Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6589 |
Symbol | |
ID | 5674904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 8018594 |
End bp | 8019595 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | 641245440 |
Product | ApbE family lipoprotein |
Protein accession | YP_001510832 |
Protein GI | 158318324 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.162386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTAGGG CCGGCGCCGG CCAGGACGCG CGCCCGTCGC GCCCGTTGCA CCCGTCGCGC CCGTCGCGAC CGGGGCGCAG CCACGCCGAA CCGGTGATGG GCACGGTTGT CAGCATCGAC GTCCGCTCGC CGCTCGACGC GGTCGGCCTG GACGCGGCGA TCGGTGCCGC GGTCCGTGTG CTGCATCGGG TCGACGAGGA TTTCAGCACG TTCCGGGCGA CGTCCTGGGT GTCGCGGCTG CGCCGCGGCG AGATCGAGCT CGGCGACTGT CCGGATCACG TCCGCGAGGT GTACCGGGCG GCCGCCGAAT GCCGGGAGCA GACCGGGGGA TGGTTCGACC CCGCCTGGCG CGGGGACGGC ACGCTGGACC CGACCGGGCT GGTCAAGGGC TGGGCGGCCG ACGCCGCGTC GACGGCCCTG ACCGCCGCGG GTGCCCCGAG CCACTGCGTC AGCGCCGCCG GTGACCTGCG GGTGCGGGGC ACCTCGGGGT GGGCGCCGGG ACGTCCATGG CGGATCGGGA TCGCCGATCC GTTCGACCGG GCCAGGCTGG TCGCCGTGGT GGAGGGCACC GAGCTGGCCG TCGCGACCTC GGGGGTCGCC GAGCGGGGCG CGCACGTCGT CGACCCGCGT ACCGGCGCCC CGGCGACGGG CCTCGCCTCG GTCACCCTCG TGGGCGCCGA CCTCGTGGGT GCCGACGCCA CGCTCGCGGA CGGGTTCGCC ACCGCCGCCC TCGCCGCCGG GCCGGAGGCG CCGGCCCTGC TCACCCACCT CGCCCGCCGG GGATGGGAGT GGCTGACGGT CGACACCACC GGGCGGCTCA CGCACTCCGC CGGCTTCCCG GGCCAGGCCA CCACGACCGC CGCGGGGCCG GTCGCCGAGA CAGCCCCGCG GCCGGCCGTC CGAGCGACAG CCCCGCGGCC GGCCGTCCGA GCGACAGCCC CGAGGCCAGC CGTGCGAGCG ACCGCCCCGA GGCCGGCCGC CGGACGTCCC GACGGGAGAT GA
|
Protein sequence | MSRAGAGQDA RPSRPLHPSR PSRPGRSHAE PVMGTVVSID VRSPLDAVGL DAAIGAAVRV LHRVDEDFST FRATSWVSRL RRGEIELGDC PDHVREVYRA AAECREQTGG WFDPAWRGDG TLDPTGLVKG WAADAASTAL TAAGAPSHCV SAAGDLRVRG TSGWAPGRPW RIGIADPFDR ARLVAVVEGT ELAVATSGVA ERGAHVVDPR TGAPATGLAS VTLVGADLVG ADATLADGFA TAALAAGPEA PALLTHLARR GWEWLTVDTT GRLTHSAGFP GQATTTAAGP VAETAPRPAV RATAPRPAVR ATAPRPAVRA TAPRPAAGRP DGR
|
| |