Gene Namu_1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1982 
Symbol 
ID8447591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2190520 
End bp2192310 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content72% 
IMG OID645041110 
Productthiamine pyrophosphate protein 
Protein accessionYP_003201356 
Protein GI258652200 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0200304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0807898 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGAGC GCACGGTCGC TGATCTGATC GTCGATCGCC TGCAAACGTG GGGTGTACGG 
CGAATTTTCG GCTACAGCGG GGACGGCATC AACGCGTTCA TGGGCGCATT GCGCCGGGCG
CAGGACCGGG TGGAGTTCGT CCAGGCCCGG CATGAGGAGA ATGCCGCGTT CATGGCCGTC
GGCCACGCCA AGTACTCCGG GGACGTCGGT GTGGTCACCT CGACCCAGGG TCCGGGGGCG
GTCCACCTGC TCAACGGGCT CTATGACGCC AAGCTCGACG GGGTGCCGGT GGTGGCGATC
ATCGGCCAGC AGAGCACCAG CGTGCTGGGC TCGGCCTACA TGCAGGAGAT CGACCTGCCG
GCGCTGGTCA AGGACGTCGC GCCGGCGTTC GCGCAGCAGG TCGCCGCGGC CGAGCAGCTG
CCGATGGTGC TCGACCGGGC CTTCCGCGCC GCCCTGACCG AGCGCGGGCC GGCGGTCGTG
ATCGTCCCGC ACGACGTGCA GAAGCAGCCG GCGCCCGACC TCGGGCAGGA GCACGGCATC
GTGGTGACCG CGCCCACCTG GGCCCGGCCC CGGATCCTGC CCGCCGAGGA CGATCTGGCG
GCCGCGGCGG AGCTGCTCAA CGCCGGCTCC CGGGTCGCCC TGCTGGTCGG TCAGGGGTCC
CGGCACGCCC GGGACGAGGT GCTGGCGGTG GCCCGGGCGC TGGGCGCCGG AATCACCACC
AGCCTGCTGG GCAAGCCGTA CGTCGACGAG ACGCTGCCGT TCGTCGTGGG CACCATGGGC
CACCTCGGCA CGACGGCCAG CGCGTTCCTG ATGAGCACCT GCGACACGCT GCTCATCGTC
GGGTCCAACG ACCCGTGGAC CGAGTTCTAC CCGCCGCCCG GCCAGGCCCG GGCCGTGCAG
ATCGACATCG ACGGGCGCAA CGTCGGCAAC CGTTACCCCA TCGAGGTGAG CCTGGTCGGG
GACGCCGCCG GCACCCTGGC CGAGCTGCTG CCGCGGCTGC GTCCTCGCCA GCCCGACACG
TGGTCTCGGG ACGTGGCCGG CGCGGTGGAC GGCTGGCGCC GCCTGTCCCG CGAGCGCGCC
CACACCCCGG CCGATCCGGT GAACCCGGAA CGGGTGGTCT TCGAACTCGA TTCCCGGCTG
CCGGCCGATG CCCAGGTCGC GATCGACGTG GGCAGCTGCG TGTACTGGTA CGCCCGGCAG
TTGCGGTTGC CGGTCGGGGT GCCGGCGCAC CTGTCCGGCA CGCTGGCCAG CATGGGCTGC
TCGATCCCGT ACGGCATCGC CGCCAAGCTG CTGCATCCCG ATCGACCCCT GGTCGCGCTG
ACCGGGGACG GGGCGATGCA GATGGCCGGG CTGTCCGAAC TGGTCACCGT GTCCCGGATG
TGGCCGCGCT GGTCCGACCC GCGATTCGTG GTCTGCGTGC TGCACAACGG CGATCTGGCC
GAGGTGTCTT GGGAGCAAAG GGAAATGGAG GGCGAGCCGC TGTTCCCGGA CAGCCAGGAC
CTGCCGGACG TCCCGTACGC CCGTTACGCC GAGTTGCTCG GTCTGCGCGG CATCCGCGTC
GACGACCCGG AGCGCCTGGG AGCGGCCTGG GACGAGGCGC TGGCCGCCGA CCGGCCGACC
GTCATCGAGG TGATCACCGA CCGGGCCGTG CCGCTGCTCG CCCCGTTCCC GGCCGGCGCC
GCCACGCTCG AGAAGATGCA TCAGGGCATC GCGGCGGAAG GTGACGCCGG CCGGCCGGCC
GCGGCGCTGC TGAACGTCTA CGCCGCGCAG GAGGGGTGGC GGGCCCCGTA G
 
Protein sequence
MVERTVADLI VDRLQTWGVR RIFGYSGDGI NAFMGALRRA QDRVEFVQAR HEENAAFMAV 
GHAKYSGDVG VVTSTQGPGA VHLLNGLYDA KLDGVPVVAI IGQQSTSVLG SAYMQEIDLP
ALVKDVAPAF AQQVAAAEQL PMVLDRAFRA ALTERGPAVV IVPHDVQKQP APDLGQEHGI
VVTAPTWARP RILPAEDDLA AAAELLNAGS RVALLVGQGS RHARDEVLAV ARALGAGITT
SLLGKPYVDE TLPFVVGTMG HLGTTASAFL MSTCDTLLIV GSNDPWTEFY PPPGQARAVQ
IDIDGRNVGN RYPIEVSLVG DAAGTLAELL PRLRPRQPDT WSRDVAGAVD GWRRLSRERA
HTPADPVNPE RVVFELDSRL PADAQVAIDV GSCVYWYARQ LRLPVGVPAH LSGTLASMGC
SIPYGIAAKL LHPDRPLVAL TGDGAMQMAG LSELVTVSRM WPRWSDPRFV VCVLHNGDLA
EVSWEQREME GEPLFPDSQD LPDVPYARYA ELLGLRGIRV DDPERLGAAW DEALAADRPT
VIEVITDRAV PLLAPFPAGA ATLEKMHQGI AAEGDAGRPA AALLNVYAAQ EGWRAP