Gene Namu_3686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3686 
Symbol 
ID8449305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4043694 
End bp4045493 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content69% 
IMG OID645042750 
Productthiamine pyrophosphate protein TPP binding domain protein 
Protein accessionYP_003202986 
Protein GI258653830 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0906801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0462514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAT TGGTGGCGGA TGTGGTGTTG TCCCGGTTGC GAGAATGGGG GGTGCGGCAG 
GTCTTCGGGT ACCCGGGGGA CGGGATCAAC GGCCTGTTGG CGGCGTGGGG GAGGGCGAAG
GACGACCCGC AGTTCGTGCA GGCCCGGCAC GAGGAGATGG CCGCGTTCGC CGCCGTCGGC
TTCGCCAAGT TCAGCGGTCG GGTCGGGGTG TGCGTGGCGA CCAGCGGACC CGGCGCGATC
CACCTGCTGA ACGGGCTGTA CGACGCGAAA CTCGATCACG TCCCGGTCGT CGCGATCGTC
GGGCAGACGG CCCGCTCGGC CATGGGCGGC TCGTACCAGC AGGAGGTCGA CCTGCTCTCG
CTGTTCAAGG ACGTCTGCAG CGACTACGTG CAGATGTGTA CCGTCCCACA GCAGCTGCCG
AACCTGATCG ACCGGGCGAT CCGGATCGCT CAGACCGAGC ACGCCCCGAC CTGCGTGATC
GTGCCGTCCG ACGTGTTCGA CCTGGACTAC GAACCGCCGG GGCACGAGTT CAAGCAGGTC
CCGTCCAGCG TCGGCACCGC CTGGGCGACC GCCGCCCCGG ACCCGGACGC GGTCCGCGCC
GCGGCGGACC TGCTCAACGC CGGGGAGAAG GTCGCGTTGC TGGTCGGTCA GGGGGCCAGA
GGCTGCGAAG CCGAGCTGAC CGAAGTCGCG GACCTGCTGG GCGCCGGCGC CGCGAAGGCG
TTGCTGGGCA AGGACGTGCT GCCCGACACC CTGCCCTGGG TCACCGGTTC GATTGGCCTG
CTGGGCACCA CCGCCAGCTA CCGGCTGATG ATGGGCTGCG ACACGCTCCT GACCATCGGG
TCGAACTTCC CGTACACCCA GTTCATGCCG GACCTCGGGC AGGCCCGGGC CGTGCAGATC
GACCGGTCCG GCAAGTGGAT CGGCATGCGG TACCCCTACG AGATCAACCT CGTCGGGGAC
GCGAAGGCCA CTCTCAAGGC GCTGATCCCG CTGTTGAACC GGAAGGCCGA CCGGAGCTGG
CGGGACCGGG TGCAGGCCGA TGTCGCGGAC TGGTGGCAGA CCGCCGAGCG CCGCGCGTTG
ACCGCCGCCG ATCCGGTCAA CCCGATGCGG ATCTTCCATG AACTGTCGCA GCGGTTGCCC
GTCGACGCCA TCGTGGTCAG CGATTCGGGC AGCGCAGCGA ACTGGTACGC CCGGCACCTG
CGCTTTCACG GCGACATCCG CGGGTCACTG TCCGGGACGC TGGCCACGAT GGGTCCGGGG
GTGCCGTACG CGATCGGCGC GAAATGGGCG CACCCGGACC GACCGGTGAT CGCCCTGGTG
GGAGACGGAG CGATGCAGAT GAACGGACTG GCCGAGCTCA TCACCATCTC GCACTACTGG
TCGCAATGGG CCGACCCGCG GCTCATCGTC GCGGTGCTGC ACAACAACGA CCTCAACCAG
GTCACCTGGG AGATGCGGGC CATGTCGGGT GCCCCCAAGT TCGCCGAATC GCAGACTCTC
CCGGACGTCG ACTACGCCGG ATTCGCGACC GGTTTGGGTC TGTCCGGCGT CCGGATCGAC
GACCCCGATG CGCTGGGCCC GGCCTGGGCG ACCGCGTTGG CGGCGACCCG GCCCACCGTG
CTGGACGTGA TCTGCGACCC GGATGTGCCG CCGATCCCGC CGCACGCCAC CTTCGATCAG
GTCAAGTCCG TCGCCGGGGC GGTGCTGCAC GGTGACGAGG ACGCCTGGGG TTTCGTCAAA
CAGGGTGTGA AACAGAAGGT GCAGCAGTAT CTGCCCGGAA CCAAGGGGGG AACGTCATGA
 
Protein sequence
MAELVADVVL SRLREWGVRQ VFGYPGDGIN GLLAAWGRAK DDPQFVQARH EEMAAFAAVG 
FAKFSGRVGV CVATSGPGAI HLLNGLYDAK LDHVPVVAIV GQTARSAMGG SYQQEVDLLS
LFKDVCSDYV QMCTVPQQLP NLIDRAIRIA QTEHAPTCVI VPSDVFDLDY EPPGHEFKQV
PSSVGTAWAT AAPDPDAVRA AADLLNAGEK VALLVGQGAR GCEAELTEVA DLLGAGAAKA
LLGKDVLPDT LPWVTGSIGL LGTTASYRLM MGCDTLLTIG SNFPYTQFMP DLGQARAVQI
DRSGKWIGMR YPYEINLVGD AKATLKALIP LLNRKADRSW RDRVQADVAD WWQTAERRAL
TAADPVNPMR IFHELSQRLP VDAIVVSDSG SAANWYARHL RFHGDIRGSL SGTLATMGPG
VPYAIGAKWA HPDRPVIALV GDGAMQMNGL AELITISHYW SQWADPRLIV AVLHNNDLNQ
VTWEMRAMSG APKFAESQTL PDVDYAGFAT GLGLSGVRID DPDALGPAWA TALAATRPTV
LDVICDPDVP PIPPHATFDQ VKSVAGAVLH GDEDAWGFVK QGVKQKVQQY LPGTKGGTS