Gene Namu_4952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4952 
Symbol 
ID8450583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5528567 
End bp5530270 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content71% 
IMG OID645043990 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003204214 
Protein GI258655058 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CCGCCGCCGA CGCCGCAGGA AAGCAGCACC GCAAGCCGCA CAGCCACATC 
GTCACCGACG GCATCGAGCG GGCTCCGGCC CGCGGCATGC TGCGGGCGGT GGGCATGGGC
GACGACGACT GGCGCAAGCC GCAGATCGGC GTCGCCAGTT CCTGGAACGA GATCACCCCG
TGCAACATGT CCCTGGACCG GCTGGCCAAG GCGGCCAAGC AGGGCGTGCA CGAGGCCGAC
GGCTACCCGC TGGAGTTCGG CACCATCTCG GTGTCCGACG GCATCTCCAT GGGCCACGGC
GGCATGCACT ACTCGCTGGT CAGCCGTGAG GTCATCGCCG ACTCGGTGGA GACGGTCTTC
CGGGCCGAGC AGCTCGACGG CGGGGTGCTG CTGGCCGGCT GCGACAAGTC CGAGCCCGGC
ATGCTGATGG CCGCGGCCCG CCTGGACATC GCGGCCGTGT TCCTCTACGC CGGCTCGACC
CTGCCCGGTC AGCTCGACGG CGAGACCGTC ACCATCATCG ACGCCTTCGA AGGCGTCGGC
GCCTGCCTGG CCGGCAAGAT CAGCCGGGAG CGGCTGACCG AGATCGAGAA GGCCATCTGT
CCGGGCGAGG GCGCCTGCGG CGGCATGTAC ACGGCCAACA CGATGGCCAG CGTGGCCGAG
GCGCTGGGCA TGTCGCTGCC CGGCAGCGCC GCGCCGCCCG CCCCGGACGC GCGCCGGGAC
ACCTACGCGA TCCGCAGCGG GCAGGCCGTC GTCGCCCTCA TCGACGCCGG CATCACCTCG
CGCGACATCC TGACCAAGAA GGCGTTCGAG AACGCGATCA CGGTGCTGAT GGCGCTGGGC
GGGTCGACGA ACGCGGTGCT GCACCTGATG GCCATCGCGC ACGAGGCCAA GGTCGACCTG
GCCCTGGAGG ACTTCAACCG GATCGGCGAG CGGACCCCGC ACCTGGCCGA TGTCAAGCCG
TTCGGCCGCT ACGTGATGAC CGACGTGGAC CGCATCGGCG GGGTGCCGGT GGTGATGAAG
GCGCTGCTGG ACGCGGGCCT GCTGCACGGG GACACGCTGA CCGTCACCGG CAAGACGATG
GCCGAGAACC TGGCCGAGCT GAACCCGCCG GAGCTGGACG GCGACGTGCT GCGCAAGCTG
TCCAATCCGA TCCACACCAC CGGCGGGATC ACCATCCTGC ACGGCTCGCT GGCCCCGGAG
GGCGCGGTGA TCAAGAGCGC CGGGATCGAG TACGCCGAGT TCACCGGCCC GGCCCGGGTG
TTCGACGGCG AGGCCGGGGT GCTCGAGGCG GTCACCAACG GCACGCTGGG CAAGGGCGAC
GTCATCGTCA TCCGCTGGGA GGGCCCCAAG GGCGGCCCGG GGATGCGCGA GATGCTCGCC
GTCACCGGCG CGATCAAGGG TGCCGGCCTG GGCAAGGACG TCCTGCTGCT CACCGACGGC
CGGTTCTCCG GCGGCACCAC CGGCCCCTGC ATCGGGCACA TCGCCCCGGA GGCAGCGCAC
GGCGGGCCGA TCGCGCTGGT GCAGGAGGGT GATCAGATCC GGCTCGACCT GGCCGCCAAG
ACCCTGGACC TGCTGGTCGA CGAGGCCGAG CTGGAGCGAC GCCGGGCCGA TTGGGAGCCC
CGCGAGCAGA ACCTGAACTT CGGGGTCGCC GGCAAGTACG CCAAGCTGGT CGGCTCGGCC
GCCAAGGGCG CCGTCTGCTT CTGA
 
Protein sequence
MSETAADAAG KQHRKPHSHI VTDGIERAPA RGMLRAVGMG DDDWRKPQIG VASSWNEITP 
CNMSLDRLAK AAKQGVHEAD GYPLEFGTIS VSDGISMGHG GMHYSLVSRE VIADSVETVF
RAEQLDGGVL LAGCDKSEPG MLMAAARLDI AAVFLYAGST LPGQLDGETV TIIDAFEGVG
ACLAGKISRE RLTEIEKAIC PGEGACGGMY TANTMASVAE ALGMSLPGSA APPAPDARRD
TYAIRSGQAV VALIDAGITS RDILTKKAFE NAITVLMALG GSTNAVLHLM AIAHEAKVDL
ALEDFNRIGE RTPHLADVKP FGRYVMTDVD RIGGVPVVMK ALLDAGLLHG DTLTVTGKTM
AENLAELNPP ELDGDVLRKL SNPIHTTGGI TILHGSLAPE GAVIKSAGIE YAEFTGPARV
FDGEAGVLEA VTNGTLGKGD VIVIRWEGPK GGPGMREMLA VTGAIKGAGL GKDVLLLTDG
RFSGGTTGPC IGHIAPEAAH GGPIALVQEG DQIRLDLAAK TLDLLVDEAE LERRRADWEP
REQNLNFGVA GKYAKLVGSA AKGAVCF