Gene TM1040_2486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2486 
Symbol 
ID4076851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2625094 
End bp2626938 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content61% 
IMG OID638007810 
Productdihydroxy-acid dehydratase 
Protein accessionYP_614480 
Protein GI99082326 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.274808 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.116227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGT ACCGATCAAG AACCTCCACC CACGGTCGCA ACATGGCCGG CGCTCGTGGC 
CTCTGGCGCG CCACTGGCAT GACCGAAAGC GACTTCGGCA AGCCGATCAT CGCCATCGTG
AACTCCTTCA CCCAATTCGT CCCCGGCCAC GTTCACCTCA AGGATCTGGG GCAGATGGTG
GCCCGTGAGG TTGAGTCCGC TGGCGGTGTG GCCAAGGAGT TCAACACCAT CGCTGTGGAC
GACGGCATCG CCATGGGCCA CGATGGCATG CTCTATTCGC TCCCCTCGCG CGAGGTGATC
GCGGATTCGG TTGAGTATAT GGTCAACGCC CATTGCGCCG ACGCGATGGT CTGTATCTCC
AACTGCGACA AGATCACCCC CGGCATGATG ATGGCCGCAA TGCGCCTCAA CATCCCGGTG
ATCTTTGTGT CCGGTGGCCC GATGGAAGCG GGCAAGATCG ACATCGACTC CCTTGATGCC
AAGAAAATCG ACCTCGTGGA CGCAATGGTT GCCGCCGCAA GCGACAAGCT GACCGACGAG
GAAGTCCAGC ACATCGAGGA AAACGCCTGC CCCACATGTG GCTCCTGCTC TGGCATGTTC
ACCGCAAACT CGATGAACTG TCTGGCCGAA GCCCTTGGGC TTGCGCTGCC CGGCAATGGC
TCCACCCTGG CCACCCACTC GGATCGCAAG CAATTGTTCC TCGAGGCGGG GCGCAAGATC
GTCGAGATCA CCAAGCGCCA CTATGAGCAG AACGAAAAAG GCCTGCTGCC GCGCGAAATC
GCCACCTTTG AGGCGTTTGA GAACGCCATG AGCCTTGATA TTGCCATGGG CGGCTCCACC
AATACCGTTT TGCACCTCCT CGCCATCGCG CATGAGGGCA AGGTAGACTT CACCATGCAA
GACATCGACC GGCTGAGCCG CAAGGTGCCC TGCCTGTGTA AAGTCGCGCC CAATATCGAG
AACGTCCATA TGGAGGACGT TCACCGCGCC GGTGGGATCT TCTCGATCCT CGGGGAGCTG
TCCCGTGCAG GCCTCCTGCA CAATGACGTC CCGACGGTCC ACAGCGACAG CATGGGTGAA
GCCATCGCCC ATTGGGACAT CGCTGTGGCG GACAACCAAG CGGCGAAGGA CCTCTTCAAG
GCCGCTCCCG GCGGAGTGCG TACGACGCAG GCCTTCAGCC AGTCCAACCG CTTCAAGGAT
CTCGACATCG ACCGCAAGGG CGGCGTTATC CGCTCCAAGG AACATGCCTT CAGCCAGGAC
GGCGGCCTCG CGGTGCTCTT TGGGAATATC GCACTGGATG GCTGCATCGT GAAAACGGCG
GGCGTGGACG CCTCGATCCT GAAGTTCACC GGCAAAGCCT ATGTCTGCGA GAGCCAGGAC
CAGGCCGTCA ATGATATCCT CACCGGCAAA GTTCAGGCGG GCGATGTAGT TGTCATCCGC
TATGAGGGGC CGCGCGGGGG ACCGGGCATG CAGGAAATGC TCTACCCAAC GTCCTACCTC
AAATCCAAGG GGCTCGGAAA AGACTGTGCT CTGTTGACCG ATGGGCGTTT CTCCGGCGGC
ACGTCGGGGC TTTCCATTGG CCACGTCTCG CCCGAAGCCG CCGAAGGTGG CACCATCGGC
CTCGTGGAGC ATGGCGACAC CATCGAGATC GATATCCCCA ACCGTTCCAT CCATCTGGCG
GTGGATGAGG CCACCCTTGC AGCGCGCCGC GCCGCGCAGG ATGAGAAAGG CTGGAAACCT
GTTGCCAAGC GTAAACGCAA GGTATCAACC GCGCTCAAGG CTTACGCCTT GCTGGCCACA
TCTGCAGCCA AAGGCGCTGT GCGCCAACTG CCAGACGACG AGTAA
 
Protein sequence
MPMYRSRTST HGRNMAGARG LWRATGMTES DFGKPIIAIV NSFTQFVPGH VHLKDLGQMV 
AREVESAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSREVI ADSVEYMVNA HCADAMVCIS
NCDKITPGMM MAAMRLNIPV IFVSGGPMEA GKIDIDSLDA KKIDLVDAMV AAASDKLTDE
EVQHIEENAC PTCGSCSGMF TANSMNCLAE ALGLALPGNG STLATHSDRK QLFLEAGRKI
VEITKRHYEQ NEKGLLPREI ATFEAFENAM SLDIAMGGST NTVLHLLAIA HEGKVDFTMQ
DIDRLSRKVP CLCKVAPNIE NVHMEDVHRA GGIFSILGEL SRAGLLHNDV PTVHSDSMGE
AIAHWDIAVA DNQAAKDLFK AAPGGVRTTQ AFSQSNRFKD LDIDRKGGVI RSKEHAFSQD
GGLAVLFGNI ALDGCIVKTA GVDASILKFT GKAYVCESQD QAVNDILTGK VQAGDVVVIR
YEGPRGGPGM QEMLYPTSYL KSKGLGKDCA LLTDGRFSGG TSGLSIGHVS PEAAEGGTIG
LVEHGDTIEI DIPNRSIHLA VDEATLAARR AAQDEKGWKP VAKRKRKVST ALKAYALLAT
SAAKGAVRQL PDDE