Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0478 |
Symbol | |
ID | 5668898 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 563583 |
End bp | 564650 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239408 |
Product | integrase catalytic region |
Protein accession | YP_001504846 |
Protein GI | 158312338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTAC GCCTGCTGTA CCTGATCTTC GTTCGGGTCT GTGGCTGGCT GGTTCTTCTC GGCCGTTCGT CGGCGTCGAA GGACATCGAG TTGCTGGTGC TGCGGCATGA GGTCGCGGTG CTGCGCCGTA CCCAGCCCAA GCCCCGGTGG GACTGGGCGG ACCGGGCGGT CCTCGCCGCA CTCATCCGGC TCCTGCCCAG GGCGCTGCGA GCGCACCGGC TGGTCACGCC CGGCACCGTC CTACGGTGGC ACCGCCGCCT GATCACACGG AAATGGACCC ACCCGCAGCG GACCGGACGG CCACCGATCG GCACGGAGAT CGCCACGCTG ATCGAGCGGC TCGCGACCGA GAACACGACA TGGGGCTACC AGCGAATCCA GGGCGAGCTC CTCACACTCG GCCACCGGGT GAGCGCCTCC ACGATCCGCC GGGTCCTGAA GACCCTCGGG CTGCCCCCGG CACCGAAACG GCAGACCGAC ACGACGTGGC GACAGTTCCT GCGTACACAG GCATCGACCA TGCTGGCCGT CGACTTCTTC CACGTGGACT GCGCCGTGAC ACTCCGGCGT CTGCACTGCT TCTTCGTCAT AGAGGTCGAC TCCCGCACCG TCCACATCCT CGGAGTCACC GCCCACCCCG ACGGACCATG GACCACCCAA CAAGCCCGGA ACCTCCTCAT GGACCTCGGT GATCAGGCGG CCGACTTCCA GTTCCTGATC CGCGACCGCG CCGGCCAGTT CACCGCGTCG TTCGACACGG TCCTCGCCGA CGCCGGCATC ACCGCCGTCA CGATCCCACC CCGGGCTCCC CGGGCGAACG CCTACGCGGA ACGGTTCGTC CGCACCGTCC GGACCGAGGT CACCGACCGC ATGCTGATCG TCGGCGAGCG GCATCTGCGC ATGGTCCTGG CCGAGTACGC ACGGCACTAC AACGGACGAC GACCCCACCG CGGCCGCGAC CTTCAACCAC CCCGGCCCGA CCACCCCGTC GCAGACCTGA CCCAGGAACG GATCAAGCGC CAGCCCGTCC TCGGTGGCTT GATCAACGAA TACGAACGAG CCGCCTAA
|
Protein sequence | MSVRLLYLIF VRVCGWLVLL GRSSASKDIE LLVLRHEVAV LRRTQPKPRW DWADRAVLAA LIRLLPRALR AHRLVTPGTV LRWHRRLITR KWTHPQRTGR PPIGTEIATL IERLATENTT WGYQRIQGEL LTLGHRVSAS TIRRVLKTLG LPPAPKRQTD TTWRQFLRTQ ASTMLAVDFF HVDCAVTLRR LHCFFVIEVD SRTVHILGVT AHPDGPWTTQ QARNLLMDLG DQAADFQFLI RDRAGQFTAS FDTVLADAGI TAVTIPPRAP RANAYAERFV RTVRTEVTDR MLIVGERHLR MVLAEYARHY NGRRPHRGRD LQPPRPDHPV ADLTQERIKR QPVLGGLINE YERAA
|
| |