Gene Namu_4539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4539 
Symbol 
ID8450167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5047100 
End bp5049013 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content70% 
IMG OID645043580 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_003203807 
Protein GI258654651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.366125 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAGCG CAGCAGCGTC GCACGCGGGA CAGACCGTCC GACTGACCGT CGGGCAGGCG 
GTCGTGAAGT TCCTGGGCAA TCAGTACAGC GAGCGGGACG GTGTGCGGCG CAAGCTGTTC
GCCGGCTGCT TCGGCATCTT CGGGCACGGC AACGTGGCCG GTCTGGGGCA GGCGCTGCTG
CAGGCCGAGC TGGCGGAGCC GGAGCTGCTG CCCTATCACC AGGGTCGCAA CGAGCAGGCG
ATGGTGCACA TTGCGGTCGG GTACGCCCGG CAGAAGGACC GGCTGGAGAC GTTCGCAGTG
ACCGCGTCGG TCGGCCCGGG CTCGTCGAAC ATGCTGACCG GGGCCGCGCT GGCCACCATC
AACCGGTTGC CGGTGCTGCT GCTGGCCAGC GACATCTTCG CCACCCGGGT GGCCTCGCCG
GTCCTGCAGG AGCTGGAACA GCCTTTCGGG TACGACGTCT CGGTCAACGA CGCGTTCCGG
CCGCTGAGCA AGTTCTTCGA CCGGGTGTGG CGGCCCGAGC AGTTGCCGGC GGCCCTGCTG
GGCGCGATGC GGGTGCTGAC CGATCCGGCG GAGACGGGGG CGGTGACGCT GGCCCTGCCG
GAGGACGTGC AGGCCGAGGC GTTCGACTGG CCGGTGGAGC TGTTCGCCGA GCGGGTCTGG
CGGATCGGCC GTCCGGTGCC CGAGCCCGAG GTGATCGCCG CCGCCGCGCA GATCATCCGC
AACGCCAAGG CGCCGCTGAT CGTGTCCGGG GGCGGGGTCA CCTACGCGGA GGCCAACGAC
GAGCTGCGGG CGTTCGTGGA GGCGACCGGC ATCCCGATCA GCGAGACCCA GGCCGGCAAG
GGATCGTTGC CGTTCGACCA CCCGCTGAAC CTGGGCGCGA TCGGCGCGAC CGGTTCCCCG
GCGGCCAACC ACTTCGCCCG CCGGGCCGAC GTGGTGATCG GGATCGGCAC CCGGTACAGC
GATTTCACCA CCGGGTCGAA GACGATCTGG CAGCACCCGG ATGTGCGGTT CGTGAACATC
AACGTGGCCG GGTTGGACGC GTTCAAGCTG GCCGGGTACC CGGTGGTGGC CGACGCGAAG
CGGGCGCTGC CGGCGTTGGC TCAGGCCCTG TCGGGGTACT CGTCCGGACC GGAGTTCCAG
GCGGAGGTCA CCGCCCGGGC CAAGGCGTGG GACGACGAGG TGGTGGCCTC TCACCACAGC
GGGTACGGCG AGACCCACGA GTTCCTGGCC CAGGCCGAGG TTCTCGGCGC GGTCGAGGCG
GCCATGGAGC CGACCGACGT GGTGGTCTGC GCGGCCGGTT CGCTGCCCGG CGATCTGCAC
GGCATGTGGC GCACCCGGGA ACGCAAGGGG TATCACGTCG AGTACGGGTA CTCGTGCATG
GGCTACGAGA TTCCGGGAGC GATCGGGATC AAGCTGGCCG CCCCGGAGCG GGACGTGTTC
GTCACCGTCG GCGACGGCTC GTACCTGATG ATGCCGACCG AGCTGGTGAC CGCGGTGCAG
GAGGGCATCA AGATCATCGT GGTGCTCTTG CAGAACCATG GGTACGCCTC GATCGGCGCG
CTGGCCCAGT CGCTGGGGGT GCAGCGGTTC GGCACCAAGT ACCGGTACCG CAACGCCGAA
TCGGGCCGGC TGGACGGCGG GAAGCTGCCG GTCGACCTGG CCCTGAACGC CGAGTCCATG
GGTGTCACGG TCTACCGGAC GAGGACCCTC GCCGAGCTCA AGGATGCGCT GGCCGCGGCG
AAGGCCGGCG ACGAGCCGTG CCTGGTCCAC GTGGACACCG ACCTGGAGTT CCACTCGCCC
AAGGGCGACG GCTGGTGGGA CGTGCCCGTC GCGCAGGTCT CCACACTGGA CTTCACCCAG
GACGCCCGGA TCCGCTACGA GAAGTCCCGC GCCGACCAGA AGCCCTACCT GTAG
 
Protein sequence
MTSAAASHAG QTVRLTVGQA VVKFLGNQYS ERDGVRRKLF AGCFGIFGHG NVAGLGQALL 
QAELAEPELL PYHQGRNEQA MVHIAVGYAR QKDRLETFAV TASVGPGSSN MLTGAALATI
NRLPVLLLAS DIFATRVASP VLQELEQPFG YDVSVNDAFR PLSKFFDRVW RPEQLPAALL
GAMRVLTDPA ETGAVTLALP EDVQAEAFDW PVELFAERVW RIGRPVPEPE VIAAAAQIIR
NAKAPLIVSG GGVTYAEAND ELRAFVEATG IPISETQAGK GSLPFDHPLN LGAIGATGSP
AANHFARRAD VVIGIGTRYS DFTTGSKTIW QHPDVRFVNI NVAGLDAFKL AGYPVVADAK
RALPALAQAL SGYSSGPEFQ AEVTARAKAW DDEVVASHHS GYGETHEFLA QAEVLGAVEA
AMEPTDVVVC AAGSLPGDLH GMWRTRERKG YHVEYGYSCM GYEIPGAIGI KLAAPERDVF
VTVGDGSYLM MPTELVTAVQ EGIKIIVVLL QNHGYASIGA LAQSLGVQRF GTKYRYRNAE
SGRLDGGKLP VDLALNAESM GVTVYRTRTL AELKDALAAA KAGDEPCLVH VDTDLEFHSP
KGDGWWDVPV AQVSTLDFTQ DARIRYEKSR ADQKPYL