Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1424 |
Symbol | |
ID | 5675680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1717585 |
End bp | 1719195 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641240345 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001505772 |
Protein GI | 158313264 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGGGTG AAGGATCAAC GGTTCGCACG GTGACCTACG ACCTGCTGAG GTCGTTGGGG ATGACCACCG TCTTCGGTAA CCCCGGGTCT ACCGAAGAGC CGTTCCTCCA GAAGTTTCCG GACGACTTCA CCTATGTTCT TGGTCTTCAG GAGGCGTCCG TTATCGCCAT GGCGGACGGT TTCGCGCAGA CCACGCGGCG TCCGGCGCTG GTGAACGTGC ATTCTTCCGC GGGTCTTGGT AACGCCCTCG GCAACCTGGT CGCGGCCTAC CGCGGCCATA CGCCGCTGAT CGTCACGTCC GGTCAGCAGC ACCGCGAACT GGTGATCGGT GAGCCTTACC TCGGCAACCG CGACGCAACG AACCTACCCA GACCGTGGGT GAAGTGGGCC TACGAGCCGG CCCGCGCCGA GGATGTCCCC GAGGCTTTCA TGCGCGCCTA CGCGGTGGCG CTGCAGCCGC CGTCGGGCCC GGTGTATCTG TCCATCCCGC TCGACGACTG GAGCGTTCCC CTGGAAGGGC CCGCCGTTCT GCGCAGCGTG AGCACCACCT GCGCGCCTGA TATCGAGCGG CTGCGTGGCT TCGCCGAACG CCTCTCCGCG AGTCGGCGTC CTGCCCTGGT CTTCGGCCCC GAGGTGGATC GTAGCGGCGG CTGGCACGCT GCCGTCGCGC TGGCCGAGAA GCTGCGGGCA TCGGTGTACG GCGCGCCGCT GCCGGACCGA GCCTCGTTCC CCGAGAACCA CCGCCTCTAT CAGGGCCCGC TGGGTATGTC GCTCAAGGCC ATCAGCGACC GGCTGACCGG GCATGACCTC GTGACCATCA TCGGGGCCGA GGTGTTCCGC TACTACCCGT ACGTGCCCGG GGACATCCTG CCCGCCGGCA CCGAATTGCT CCACATTGCC GCGGACCCGG CGATGACCGG AGCGGCACGC GTTGGCGACA GCCTGCTTGG CGACCCCCGA CTGGCCATCG AACTGCTCAC CGACATGGTG AAGGATGGCG CTCGTACTCA GCCCGAGCCC ATGCCGCGGC CCCGCAAACT GCCCACGAAG CCGAGCAGCC CGCTGACGCC GGCGGAGGTC CACGCGACCG TCAGTACGGC CCGCCCACCG CACGCCGTCC TCGTCTACGA GTCGACTTCG AGCATGGCGG AGCAGGTCGA GTGGCTGCCG ACCATCGAGC CTGACTCGTT CTTCGCCACG GCCAGCGGTG GTATCGGCTG GGGTGTGCCT GCCGCGGTCG GCGTGGCTCT CGGGGACCGC GATCGCGGCG TAAGGCGCCC GGTGATCGGC CTGATCGGTG ATGGGTCGTT CCAGTACTCC GTGCAGGCCA TCTGGACTGC CGCGCAGCAC TCGCTGCCGA TCGTGTACGT CGTGCTGCGC AACAAGGAGT ACTCGATCCT CAAGTCCTTC GCCGAACTGG AAAGGACGCC CGGTGTGCCG GGGCTGGACC TGCCCGGGCT GGACATCGCG GCGGTGGCAC GGGGCTTCGG CTGCCGGGCA GTCGACGTCG AGACCACCGC TGCCCTGGAA AAAGAGTTCG CGGTCGCGCT GGAGGCCGGC ACCACCACTG TGATCGTGGT GCCGACCCAG CCGCAGAAGG CCGCGCTCTG A
|
Protein sequence | MVGEGSTVRT VTYDLLRSLG MTTVFGNPGS TEEPFLQKFP DDFTYVLGLQ EASVIAMADG FAQTTRRPAL VNVHSSAGLG NALGNLVAAY RGHTPLIVTS GQQHRELVIG EPYLGNRDAT NLPRPWVKWA YEPARAEDVP EAFMRAYAVA LQPPSGPVYL SIPLDDWSVP LEGPAVLRSV STTCAPDIER LRGFAERLSA SRRPALVFGP EVDRSGGWHA AVALAEKLRA SVYGAPLPDR ASFPENHRLY QGPLGMSLKA ISDRLTGHDL VTIIGAEVFR YYPYVPGDIL PAGTELLHIA ADPAMTGAAR VGDSLLGDPR LAIELLTDMV KDGARTQPEP MPRPRKLPTK PSSPLTPAEV HATVSTARPP HAVLVYESTS SMAEQVEWLP TIEPDSFFAT ASGGIGWGVP AAVGVALGDR DRGVRRPVIG LIGDGSFQYS VQAIWTAAQH SLPIVYVVLR NKEYSILKSF AELERTPGVP GLDLPGLDIA AVARGFGCRA VDVETTAALE KEFAVALEAG TTTVIVVPTQ PQKAAL
|
| |