Gene GYMC61_2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2871 
Symbol 
ID8526748 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2930438 
End bp2932117 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content57% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003253929 
Protein GI261420247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC GGCGCAGCGA CATGATCAAA AAAGGATTCG ACCGCGCCCC GCACCGGAGT 
TTGTTGCGGG CGGCCGGCGT GAAAGAGGAG GATTTTGACA AACCGTTCAT CGCGGTTGTC
AACTCGTACA TCGACATTAT TCCGGGGCAC GTTCACTTGC AAGAGTTTGG GAGAATCGTG
AAAGAAGCGA TTCGCGAAGC AGGCGGCGTG CCGTTTGAAA TGAACACGAT CGGCGTTGAT
GACGGCATCG CCATGGGGCA TATCGGGATG CGCTATTCGC TTCCGAGCCG GGAAATCATC
GCTGATTCGA TCGAAACGGT CATCTCGGCG CACTGGTTTG ACGGCATGGT ATGCATTCCG
AACTGCGACA AAATTACGCC AGGGATGATG ATGGCGGCGA TGCGGCTCAA CATCCCGACG
ATTTTCGTCA GCGGCGGGCC GATGAAAGCT GGTGTGACGA AAGACGGGCG GAAAATTTCG
CTCTCGTCCG TGTTTGAAGG GGTCGGGGCG TATTTAGGCG GAACGCTCGA TGAGAAAGGG
CTCGAAGAAC TCGAACGCTA CGGCTGTCCG ACGTGCGGAT CGTGTTCGGG CATGTTTACG
GCCAACTCCA TGAACTGTCT CGCTGAAGCG CTCGGGCTCG CTTTGCCAGG CAACGGCACC
ATTTTGGCGG TTGACCCAGC GCGCAAAGAG CTTGTCCGCC AATCGGCAAA GCAGCTGATG
TATTTGATCG AACATGACAT CAAACCGCGC GACATCGTAA CGGAAAAAGC GATCGACAAC
GCGTTCGCGC TCGATATGGC GCTCGGCGGC TCGACGAATA CGGTGCTGCA TACGCTTGCG
ATCGCCAACG AAGCCGGCAT CGACTACTCG CTTGAGCGCA TCAACGAAAT CGCCGCGCGG
GTGCCGCATT TGGCCAAACT CGCGCCGGCG TCGGATGTGC ATTACATTGA AGACTTGCAC
GAAGCCGGCG GCGTCTCGGC GGTGTTGAAC GAGCTGGCGA AAAAAGAAGG CACGCTTCAT
TTAGATACGC TGACCGTCAC CGGCAAAACA CTCGGTGAAA ACATCGCCGG CTGCGAAGTG
AAAAACTACG ATGTCATCCG CCCGATTGAC AACCCGTATT CGGAAACGGG CGGGCTCGCC
ATTTTGTTCG GCAACTTAGC GCCGGACGGC GCCGTCATCA AAACCGGCGC GGTCCAAGGC
GGCATCACGC GCCATGAAGG TCCGGCGATC GTGTTTGATT CGCAGGAAGA GGCGCTTGAA
GGCATCGCCA GCGGCAAAAT CAAGCCGGGT CATGTCGTCG TCATCCGCTA CGAAGGACCA
AAAGGCGGCC CAGGGATGCC GGAAATGCTT GCGCCAACGT CGCAAATCGT CGGCATGGGG
CTCGGTACGA AGGTAGCGCT TGTCACCGAT GGCCGCTTTT CCGGCGCCTC ACGCGGCTTG
TCCGTCGGCC ACGTTTCACC GGAAGCGGCG GAAGGCGGAC CGATTGCTTT CATCCAAGAC
GGCGATATCA TCGAGATCGA TACGGTGAAA CGAACGATCA ACGTCAAGCT GTCCGATGAA
GAGCTCGAAC GCCGGAAAGC GAACTGGAAA GGCTTTGAAC CAAAAGTGAA AACCGGGTAT
CTCGCCCGCT ACTCGAAACA CGTCACATCG GCGAGCACGG GGGGGATTAT GAAGATTTAG
 
Protein sequence
MKKRRSDMIK KGFDRAPHRS LLRAAGVKEE DFDKPFIAVV NSYIDIIPGH VHLQEFGRIV 
KEAIREAGGV PFEMNTIGVD DGIAMGHIGM RYSLPSREII ADSIETVISA HWFDGMVCIP
NCDKITPGMM MAAMRLNIPT IFVSGGPMKA GVTKDGRKIS LSSVFEGVGA YLGGTLDEKG
LEELERYGCP TCGSCSGMFT ANSMNCLAEA LGLALPGNGT ILAVDPARKE LVRQSAKQLM
YLIEHDIKPR DIVTEKAIDN AFALDMALGG STNTVLHTLA IANEAGIDYS LERINEIAAR
VPHLAKLAPA SDVHYIEDLH EAGGVSAVLN ELAKKEGTLH LDTLTVTGKT LGENIAGCEV
KNYDVIRPID NPYSETGGLA ILFGNLAPDG AVIKTGAVQG GITRHEGPAI VFDSQEEALE
GIASGKIKPG HVVVIRYEGP KGGPGMPEML APTSQIVGMG LGTKVALVTD GRFSGASRGL
SVGHVSPEAA EGGPIAFIQD GDIIEIDTVK RTINVKLSDE ELERRKANWK GFEPKVKTGY
LARYSKHVTS ASTGGIMKI