Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4952 |
Symbol | |
ID | 8450583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5528567 |
End bp | 5530270 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645043990 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003204214 |
Protein GI | 258655058 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAGA CCGCCGCCGA CGCCGCAGGA AAGCAGCACC GCAAGCCGCA CAGCCACATC GTCACCGACG GCATCGAGCG GGCTCCGGCC CGCGGCATGC TGCGGGCGGT GGGCATGGGC GACGACGACT GGCGCAAGCC GCAGATCGGC GTCGCCAGTT CCTGGAACGA GATCACCCCG TGCAACATGT CCCTGGACCG GCTGGCCAAG GCGGCCAAGC AGGGCGTGCA CGAGGCCGAC GGCTACCCGC TGGAGTTCGG CACCATCTCG GTGTCCGACG GCATCTCCAT GGGCCACGGC GGCATGCACT ACTCGCTGGT CAGCCGTGAG GTCATCGCCG ACTCGGTGGA GACGGTCTTC CGGGCCGAGC AGCTCGACGG CGGGGTGCTG CTGGCCGGCT GCGACAAGTC CGAGCCCGGC ATGCTGATGG CCGCGGCCCG CCTGGACATC GCGGCCGTGT TCCTCTACGC CGGCTCGACC CTGCCCGGTC AGCTCGACGG CGAGACCGTC ACCATCATCG ACGCCTTCGA AGGCGTCGGC GCCTGCCTGG CCGGCAAGAT CAGCCGGGAG CGGCTGACCG AGATCGAGAA GGCCATCTGT CCGGGCGAGG GCGCCTGCGG CGGCATGTAC ACGGCCAACA CGATGGCCAG CGTGGCCGAG GCGCTGGGCA TGTCGCTGCC CGGCAGCGCC GCGCCGCCCG CCCCGGACGC GCGCCGGGAC ACCTACGCGA TCCGCAGCGG GCAGGCCGTC GTCGCCCTCA TCGACGCCGG CATCACCTCG CGCGACATCC TGACCAAGAA GGCGTTCGAG AACGCGATCA CGGTGCTGAT GGCGCTGGGC GGGTCGACGA ACGCGGTGCT GCACCTGATG GCCATCGCGC ACGAGGCCAA GGTCGACCTG GCCCTGGAGG ACTTCAACCG GATCGGCGAG CGGACCCCGC ACCTGGCCGA TGTCAAGCCG TTCGGCCGCT ACGTGATGAC CGACGTGGAC CGCATCGGCG GGGTGCCGGT GGTGATGAAG GCGCTGCTGG ACGCGGGCCT GCTGCACGGG GACACGCTGA CCGTCACCGG CAAGACGATG GCCGAGAACC TGGCCGAGCT GAACCCGCCG GAGCTGGACG GCGACGTGCT GCGCAAGCTG TCCAATCCGA TCCACACCAC CGGCGGGATC ACCATCCTGC ACGGCTCGCT GGCCCCGGAG GGCGCGGTGA TCAAGAGCGC CGGGATCGAG TACGCCGAGT TCACCGGCCC GGCCCGGGTG TTCGACGGCG AGGCCGGGGT GCTCGAGGCG GTCACCAACG GCACGCTGGG CAAGGGCGAC GTCATCGTCA TCCGCTGGGA GGGCCCCAAG GGCGGCCCGG GGATGCGCGA GATGCTCGCC GTCACCGGCG CGATCAAGGG TGCCGGCCTG GGCAAGGACG TCCTGCTGCT CACCGACGGC CGGTTCTCCG GCGGCACCAC CGGCCCCTGC ATCGGGCACA TCGCCCCGGA GGCAGCGCAC GGCGGGCCGA TCGCGCTGGT GCAGGAGGGT GATCAGATCC GGCTCGACCT GGCCGCCAAG ACCCTGGACC TGCTGGTCGA CGAGGCCGAG CTGGAGCGAC GCCGGGCCGA TTGGGAGCCC CGCGAGCAGA ACCTGAACTT CGGGGTCGCC GGCAAGTACG CCAAGCTGGT CGGCTCGGCC GCCAAGGGCG CCGTCTGCTT CTGA
|
Protein sequence | MSETAADAAG KQHRKPHSHI VTDGIERAPA RGMLRAVGMG DDDWRKPQIG VASSWNEITP CNMSLDRLAK AAKQGVHEAD GYPLEFGTIS VSDGISMGHG GMHYSLVSRE VIADSVETVF RAEQLDGGVL LAGCDKSEPG MLMAAARLDI AAVFLYAGST LPGQLDGETV TIIDAFEGVG ACLAGKISRE RLTEIEKAIC PGEGACGGMY TANTMASVAE ALGMSLPGSA APPAPDARRD TYAIRSGQAV VALIDAGITS RDILTKKAFE NAITVLMALG GSTNAVLHLM AIAHEAKVDL ALEDFNRIGE RTPHLADVKP FGRYVMTDVD RIGGVPVVMK ALLDAGLLHG DTLTVTGKTM AENLAELNPP ELDGDVLRKL SNPIHTTGGI TILHGSLAPE GAVIKSAGIE YAEFTGPARV FDGEAGVLEA VTNGTLGKGD VIVIRWEGPK GGPGMREMLA VTGAIKGAGL GKDVLLLTDG RFSGGTTGPC IGHIAPEAAH GGPIALVQEG DQIRLDLAAK TLDLLVDEAE LERRRADWEP REQNLNFGVA GKYAKLVGSA AKGAVCF
|
| |