Gene Franean1_1424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1424 
Symbol 
ID5675680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1717585 
End bp1719195 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content68% 
IMG OID641240345 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001505772 
Protein GI158313264 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGGTG AAGGATCAAC GGTTCGCACG GTGACCTACG ACCTGCTGAG GTCGTTGGGG 
ATGACCACCG TCTTCGGTAA CCCCGGGTCT ACCGAAGAGC CGTTCCTCCA GAAGTTTCCG
GACGACTTCA CCTATGTTCT TGGTCTTCAG GAGGCGTCCG TTATCGCCAT GGCGGACGGT
TTCGCGCAGA CCACGCGGCG TCCGGCGCTG GTGAACGTGC ATTCTTCCGC GGGTCTTGGT
AACGCCCTCG GCAACCTGGT CGCGGCCTAC CGCGGCCATA CGCCGCTGAT CGTCACGTCC
GGTCAGCAGC ACCGCGAACT GGTGATCGGT GAGCCTTACC TCGGCAACCG CGACGCAACG
AACCTACCCA GACCGTGGGT GAAGTGGGCC TACGAGCCGG CCCGCGCCGA GGATGTCCCC
GAGGCTTTCA TGCGCGCCTA CGCGGTGGCG CTGCAGCCGC CGTCGGGCCC GGTGTATCTG
TCCATCCCGC TCGACGACTG GAGCGTTCCC CTGGAAGGGC CCGCCGTTCT GCGCAGCGTG
AGCACCACCT GCGCGCCTGA TATCGAGCGG CTGCGTGGCT TCGCCGAACG CCTCTCCGCG
AGTCGGCGTC CTGCCCTGGT CTTCGGCCCC GAGGTGGATC GTAGCGGCGG CTGGCACGCT
GCCGTCGCGC TGGCCGAGAA GCTGCGGGCA TCGGTGTACG GCGCGCCGCT GCCGGACCGA
GCCTCGTTCC CCGAGAACCA CCGCCTCTAT CAGGGCCCGC TGGGTATGTC GCTCAAGGCC
ATCAGCGACC GGCTGACCGG GCATGACCTC GTGACCATCA TCGGGGCCGA GGTGTTCCGC
TACTACCCGT ACGTGCCCGG GGACATCCTG CCCGCCGGCA CCGAATTGCT CCACATTGCC
GCGGACCCGG CGATGACCGG AGCGGCACGC GTTGGCGACA GCCTGCTTGG CGACCCCCGA
CTGGCCATCG AACTGCTCAC CGACATGGTG AAGGATGGCG CTCGTACTCA GCCCGAGCCC
ATGCCGCGGC CCCGCAAACT GCCCACGAAG CCGAGCAGCC CGCTGACGCC GGCGGAGGTC
CACGCGACCG TCAGTACGGC CCGCCCACCG CACGCCGTCC TCGTCTACGA GTCGACTTCG
AGCATGGCGG AGCAGGTCGA GTGGCTGCCG ACCATCGAGC CTGACTCGTT CTTCGCCACG
GCCAGCGGTG GTATCGGCTG GGGTGTGCCT GCCGCGGTCG GCGTGGCTCT CGGGGACCGC
GATCGCGGCG TAAGGCGCCC GGTGATCGGC CTGATCGGTG ATGGGTCGTT CCAGTACTCC
GTGCAGGCCA TCTGGACTGC CGCGCAGCAC TCGCTGCCGA TCGTGTACGT CGTGCTGCGC
AACAAGGAGT ACTCGATCCT CAAGTCCTTC GCCGAACTGG AAAGGACGCC CGGTGTGCCG
GGGCTGGACC TGCCCGGGCT GGACATCGCG GCGGTGGCAC GGGGCTTCGG CTGCCGGGCA
GTCGACGTCG AGACCACCGC TGCCCTGGAA AAAGAGTTCG CGGTCGCGCT GGAGGCCGGC
ACCACCACTG TGATCGTGGT GCCGACCCAG CCGCAGAAGG CCGCGCTCTG A
 
Protein sequence
MVGEGSTVRT VTYDLLRSLG MTTVFGNPGS TEEPFLQKFP DDFTYVLGLQ EASVIAMADG 
FAQTTRRPAL VNVHSSAGLG NALGNLVAAY RGHTPLIVTS GQQHRELVIG EPYLGNRDAT
NLPRPWVKWA YEPARAEDVP EAFMRAYAVA LQPPSGPVYL SIPLDDWSVP LEGPAVLRSV
STTCAPDIER LRGFAERLSA SRRPALVFGP EVDRSGGWHA AVALAEKLRA SVYGAPLPDR
ASFPENHRLY QGPLGMSLKA ISDRLTGHDL VTIIGAEVFR YYPYVPGDIL PAGTELLHIA
ADPAMTGAAR VGDSLLGDPR LAIELLTDMV KDGARTQPEP MPRPRKLPTK PSSPLTPAEV
HATVSTARPP HAVLVYESTS SMAEQVEWLP TIEPDSFFAT ASGGIGWGVP AAVGVALGDR
DRGVRRPVIG LIGDGSFQYS VQAIWTAAQH SLPIVYVVLR NKEYSILKSF AELERTPGVP
GLDLPGLDIA AVARGFGCRA VDVETTAALE KEFAVALEAG TTTVIVVPTQ PQKAAL