Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4337 |
Symbol | |
ID | 5672692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5180367 |
End bp | 5182997 |
Gene Length | 2631 bp |
Protein Length | 876 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641243210 |
Product | hypothetical protein |
Protein accession | YP_001508627 |
Protein GI | 158316119 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.412663 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGAACG GCGGCGGACG GCGCGGCGCG GTGCTCGGGG AGACACCCGG CGGTGACGGC CATGCCGCGA CACCCGGCAA CGGGAACAGC GGCCGTGGCG CGGCGCCCGG CAGCGGCCGG CGCGGCGCGG CATCCGGGGA CGACGGCGGG CCGGGCCGGG GCGGGGCGGG GTCGGCGGGG CTGTGGCTGC GGTGGTCGTG GCGCGACCTG CGGGCGCGGC TGCTGCTGGT GGTGGCGCTC GCCGCCGTCA TCGGCGAGGG AACCGGCCTG TACGCCGGCC TGACCAGCAC GTCGCGGTGG CGCTACGAGT CCTACGACGC CAGCTTCGCC GGGCTGAACG TGCACGACCT GCGGATCAGC GTGGACGCGG GCGCCACGGT GCCGCGGGGG CGGCTGCGCG ACGTCGTCGC GGCCCTGCCC GACCCGGCCG CCGTGGATGC GAGCGCCGAG CGCCTGATGT TCCCCACCGA GATCGAGGCG AGCCGACCAG GGCACAAGGA GGTGCTCGTC CGCGGCGAGG TCGTCGGCGT CGACCTGACC GCGCGCCCGC TCGTGGACGG GATCTCGGTC GCCGCCGGCC GCGCGCTCAC CACGGCCGAC CGGGGACGGC CCGTCGGCGT CCTGGACCAC GGGTTCGCCC AGGCCAACCA CCTGCCGGCG ACGGGCACGG TACGGATCAG CGGGGACCGC GAGATCGGAT ACGTCGGCGA GGGGCAGTCC CCGGAGTACT TCGTGCTGCC CGGCGAGCAG CCCGGACTGA TCACCCAGGC CGGCTTCGGC GTGCTCTACA CCTCGCTGGA GACCGCGCAG AACCTCGCCG GCAAACCGGG AGCGGTCAAC GACCTGGTCC TCACCGCGCG GCCCGGGACC GACGTCGCGG TCCTCCAGCG GCAGCTCACC GCGGCCGTCA CCGAGCGGCT CCCCGGGGTG AGCACCACGA TCACCAACCG TGACGACATC GCGTCCCGGC AGATCATGTA CGACTCGATC GAGAGCAACC AGGCGCTGTG GAACGCGCTC GCGCTGCTCG TCCTGGTCGG TGCCATGTTC GCCGCGTTCA ACCTGGTGGG ACGGGTGGTG GACGCGCAGC GACGGGAGAT CGGCATCGGC ATGGCGCTCG GGGTGCGGTC CCGGATGCTG GCGTTGCGCC CGTTGCTGCT GGGGCTGCAG ATCGGGATCC TCGGGGTCCT CGCGGGGCTG GTCACGGGGA TGATCATCAC TGCCGCGATG GGCGCGATGC TGCGCGACGT GTGGCCGCTA CCGGACTGGC GCACCGGCTT CCAGGTCGGG GTGTTCGCCC GCGCCGCGGC GGTCGGCCTG CTGCTGCCGC TGGTCGCGGC GGTCCACCCG GTGTGGCGGG CGGTGCGGGT CGAGCCGGTG CAGGCGATCC ACGCGTCGGC CATCTCCGGC TCCACCCGCG CCCGTGCCCG CGCCCGCCCG CGGCGGCGTC GGCGCGGGTT CCCGCTGCCC GGGGGGAGCC TGGCCCGAAT GCCGGCGCGC AACCTGGCGC GCGCCCCGCG GCGGATGCTG CTCACCGCCC TGGGAATCGC CGCGGCGATC ACGGCACAGG TCGTGTTCAC CGGCCAGCTC GACACCTTCA CCCGGACGAC CGACGCCGCC GAGACCGAGC TGACGTCCAC CAGCCCCGAC CGGCTGCGGG TGACCCTGCC GTCGGTGCAG CCCGTCACCT CGCCCACCGT CACCGCGGTC ACCGGCTCAC CGGCGGTCCG CGGCTCCGAC GCCACGCTCG CCCTGCCCAC CAGGCTGCTG GGCCCGCCCG GTTCGGCACC GCACGAACCG ATCGACACCC TGACCTACAT CCTGGACGTC GACAACCACA TCTGGTCACC CTCGATCACC GAGGGCAGCG CGACAGGCGG CCTGCTGCTC GCCGCCAAGG CCGCCCACGA CCTGGGCGTG GGCGTCGGCG GGACGGTCCT GCTGCGCCAT CCGCGCCGGG CCGGCGACGG CTACCAGACG GTCGACACCC CGCTGCGCGT CGCCGGCATC CACAGCTTCC CGGTCCGCTC GGTGGCCTTC CTCGACGCCG CCGACGCCGG CTCGTTCGGC CTCGCCGGCC TGACGAACGT GCTGACCGTC CTGCCCGCGC CCGGCTACGA CCAGCTCGAC GCGATGCGCA CGCTCGCGGC CGTCCCCGGG GTCGGCTCCG TCGTCCCGGC GACGGGGAGC ATCGAGGAGA TCCGTGCGCT GCTACGCACG TTCGTCGGCA TCCTGCGGAT CGCCGAGATC GCGGTGCTCC TGCTCGCCCT GCTGATCGCC TACAACGCGA TGAGCATCGC GATGGACGAG CGCCGCCGGG AGCAGGCGAC GATGCTCGCC TTCGGGCTGG CCCCGCGCCG GGTGCTGGCG CTCGCCGTCG CCGAGAGCGC GCTGATCGGC CTGCTCGGGA CGGTGATCGG CCTGGTAGCC GGCTACTGGA CGCTGCGCTG GACCGTCGAG GTCCTGCTCG CCGACACCCT GCCCGACCTG GGCATCCGGG CCGTGCTGTC GGTTCCCACC CTGCTGACCA CGCTGCTGCT GGGGGTCTTC GCGGTCGCCG TCGCCCCGCT GCTGTCCGCC CGGCGGGTAC GCCGGATGGA CGTCCCGTCC ACCCTGCGGG TGATCGAGTA G
|
Protein sequence | MVNGGGRRGA VLGETPGGDG HAATPGNGNS GRGAAPGSGR RGAASGDDGG PGRGGAGSAG LWLRWSWRDL RARLLLVVAL AAVIGEGTGL YAGLTSTSRW RYESYDASFA GLNVHDLRIS VDAGATVPRG RLRDVVAALP DPAAVDASAE RLMFPTEIEA SRPGHKEVLV RGEVVGVDLT ARPLVDGISV AAGRALTTAD RGRPVGVLDH GFAQANHLPA TGTVRISGDR EIGYVGEGQS PEYFVLPGEQ PGLITQAGFG VLYTSLETAQ NLAGKPGAVN DLVLTARPGT DVAVLQRQLT AAVTERLPGV STTITNRDDI ASRQIMYDSI ESNQALWNAL ALLVLVGAMF AAFNLVGRVV DAQRREIGIG MALGVRSRML ALRPLLLGLQ IGILGVLAGL VTGMIITAAM GAMLRDVWPL PDWRTGFQVG VFARAAAVGL LLPLVAAVHP VWRAVRVEPV QAIHASAISG STRARARARP RRRRRGFPLP GGSLARMPAR NLARAPRRML LTALGIAAAI TAQVVFTGQL DTFTRTTDAA ETELTSTSPD RLRVTLPSVQ PVTSPTVTAV TGSPAVRGSD ATLALPTRLL GPPGSAPHEP IDTLTYILDV DNHIWSPSIT EGSATGGLLL AAKAAHDLGV GVGGTVLLRH PRRAGDGYQT VDTPLRVAGI HSFPVRSVAF LDAADAGSFG LAGLTNVLTV LPAPGYDQLD AMRTLAAVPG VGSVVPATGS IEEIRALLRT FVGILRIAEI AVLLLALLIA YNAMSIAMDE RRREQATMLA FGLAPRRVLA LAVAESALIG LLGTVIGLVA GYWTLRWTVE VLLADTLPDL GIRAVLSVPT LLTTLLLGVF AVAVAPLLSA RRVRRMDVPS TLRVIE
|
| |