Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4268 |
Symbol | |
ID | 8449894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4747048 |
End bp | 4748898 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043316 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003203545 |
Protein GI | 258654389 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0784946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAACC TGCGCTCCCG AACGGTGACC CACGGACGGA ACATGGCCGG CGCCCGGGCC CTGCTGCGGG CGGCCGGCGT CGCCGGGTCC GACATCGGCA AGCCGATCGT CGCCGTGGCC AACAGCTTCA CCGAGTTCGT CCCCGGCCAC ACCCACCTGC AGCCGGTCGG CCGGATCGTG GCTGAGGCGA TCACCGCGGC CGGCGGCGTG CCCCGGGAGT TCAACACCAT CGCCGTCGAC GACGGGATCG CGATGGGCCA CGGCGGCATG CTGTACTCGC TGCCGTCCCG GGATCTGATC GCCGATTCCG TGGAGTACAT GGTCAACGCG CACTGCGCCG ACGCGCTGGT CTGCATCTCC AACTGCGACA AGATCACCCC CGGGATGCTG ATGGCGGCGC TCCGGCTGAA CATCCCCACG GTCTTCGTCT CCGGCGGGCC GATGGAGGGC GGCCGAGCGG TCCTGGTCGA CGGCACTGTC CGCACCGGGT TGAACCTGAT CACGGCCATC GCCGACTCGG CCAGCGCCTC GGTGTCCGAC GAGGATCTCG ACCGCATCGA GGAAAACGCC TGCCCCACCT GCGGATCCTG CTCGGGGATG TTCACGGCCA ACTCGATGAA CTGCCTGACC GAGGCGCTCG GGCTGGCCCT GCCCGGCAAC GGCACCACTC TGGCCACCCA CACCGCCCGG CGGGCCCTGT ACGAGGCGGC CGGTGCCACG GTGATGGACC TGGTTCGGTC CTGCTACGAG CTCGGCGACG ATGGGGTGCG GCCGCGCGCG ATCGCCGGCC GGGCGGCGTT CACCAACGCC ATGGCCATGG ACATCGCGAT GGGCGGGTCG ACCAACACGA TCCTGCATCT GCTGGCCGCC GCGCACGAGG CCGAGCTCGA CTTCACCCTG GCCGACATCG ACCGGATCTC CCGAAACACC CCGTGTCTGG CCAAGGTCGC ACCGAACGGG GCGTACCTGG TGCAGGACGT GCACCGGGCC GGTGGCATCC CGGCCATCCT GGGCGAGCTG GACCGGGCCG GGCTGCTACA CCGCGACGTC CGGTCGATCC ACAGCCCCGA CCTGCGCGGC TGGCTCGACG ACTGGGATGT TCGGGGCGGC CGAGCCACGG CCGCCGCGGT CGAGTTGTTC CACGCCGCCC CCGGCGGCCG TCGCTCGGCG ACCGCGTTCT CCCAATCCGA GCGCTGGGAA GCGCTCGACC TCGACAGCGA GAACGGGTGC ATCCGCGCGG TTTCGCACGC CTACAGCACC GATGGCGGGC TGGCGGTGCT GCGCGGCAAC CTGGCCCCGG ATGGCTGCGT GGTCAAGACC GCCGGGGTCG ACGAGTCGAT CCTGCGGTTC AACGGGCCGG CCGTGGTCCT CGAATCGCAG GAGGACGCCG TCGCAGCGAT CCTGGGCCGG CGGGTGCAGG CCGGGGACGT CGTCGTCATC CGTTACGAGG GCCCGCGCGG CGGCCCGGGG ATGCAGGAGA TGCTGTATCC GACGTCGTTC CTCAAGGGTC GTGGGTTGGG CGCCGCCTGT GCGTTGATCA CCGACGGCCG GTTCTCCGGC GGCACATCCG GCCTGTCGAT CGGGCACGTC TCGCCGGAGG CCGCCGCCGG CGGGACGATC GCACTGGTCC GCGACGGCGA CCTGATCAGC ATCGACATCC CCGCGCGTAG CGTCACCCTG GTCGTCGACG AGGCCGAGTT GGGCCGCCGG CGTGCGCTGC GGGAAGCGAC CGGGGGATAC CGGCCGGCCG CGCGGCAGCG GGCCGTCTCG ACCGCCCTGC GGGCCTATGC CGCGTTCGCC CAGTCGGCCG ACCGGGGCGC CGTCCGGGCG GTCCCGGACG AACGCCGCTG A
|
Protein sequence | MRNLRSRTVT HGRNMAGARA LLRAAGVAGS DIGKPIVAVA NSFTEFVPGH THLQPVGRIV AEAITAAGGV PREFNTIAVD DGIAMGHGGM LYSLPSRDLI ADSVEYMVNA HCADALVCIS NCDKITPGML MAALRLNIPT VFVSGGPMEG GRAVLVDGTV RTGLNLITAI ADSASASVSD EDLDRIEENA CPTCGSCSGM FTANSMNCLT EALGLALPGN GTTLATHTAR RALYEAAGAT VMDLVRSCYE LGDDGVRPRA IAGRAAFTNA MAMDIAMGGS TNTILHLLAA AHEAELDFTL ADIDRISRNT PCLAKVAPNG AYLVQDVHRA GGIPAILGEL DRAGLLHRDV RSIHSPDLRG WLDDWDVRGG RATAAAVELF HAAPGGRRSA TAFSQSERWE ALDLDSENGC IRAVSHAYST DGGLAVLRGN LAPDGCVVKT AGVDESILRF NGPAVVLESQ EDAVAAILGR RVQAGDVVVI RYEGPRGGPG MQEMLYPTSF LKGRGLGAAC ALITDGRFSG GTSGLSIGHV SPEAAAGGTI ALVRDGDLIS IDIPARSVTL VVDEAELGRR RALREATGGY RPAARQRAVS TALRAYAAFA QSADRGAVRA VPDERR
|
| |