Gene GYMC61_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3041 
Symbol 
ID8526926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3098118 
End bp3099416 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content55% 
IMG OID 
Producthomoserine dehydrogenase 
Protein accessionYP_003254083 
Protein GI261420401 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAC CAATTTTGGT CGGATTGTTA GGATTAGGAA CGGTCGGGAG CGGCGTGGTC 
AAAATTATTG AAAACCACCA AGAAAAATTG ATGCATCAGG TTGGCTGCCC GGTGAAGGTG
AAAAAAATCC TTGTCCGGGA TGTACAGAAA CCGCGTGATG TCGCCGTTGA CCCGTCGCTC
CTTACGACGA GTGCGGCTGA GGTGATTGAC GATCCGGACA TTGATGTCAT CATCGAAGTG
ATGGGCGGCA TTGAGGAGAC AAAAGAGCTG CTATTGCGGG CGCTGCGCCA AGGGAAGCAT
GTCGTGACCG CCAATAAAGA CTTAATGGCC GTCTACGGGT CGGAGCTGCT TCGGGTGGCG
GCGGAATACC GCTGCGATTT GTTTTATGAA GCGAGCGTCG CCGGCGGCAT TCCGATTTTG
CGCAGCTTGG TCGACGGCTT GGCGTCGGAC CGGATTACGA AGCTCATGGG CATTGTGAAT
GGGACGACGA ACTACATTTT GACGAAAATG TCGCAAAACG GCGCTTCCTA TGAGGACGTG
CTCGCCGAAG CGCAGGCGCT CGGGTTTGCG GAAGCCGATC CGACGTCAGA CGTCGAAGGG
CTGGATGCGG CGCGGAAAAT GGCGATTTTG GCCCGCCTTG GCTTTTCAAT GGACATCGAC
TTGGACGATG TGCAAGTGAA AGGCATCACC CAAGTGACGG AGGAAGACTT GAACTACGGG
AAGCGGCTCG GCTACACGAT GAAATTGATC GGCATCGCCC AGCGCGACGG GCAGAAGGTC
GAGGTGAGCG TCCAGCCGAC GTTTTTGCCG GATTCGCATC CGTTGGCGTC CGTGCACAAC
GAATACAATG CGGTGTACGT ATACGGCGAA GCGGTCGGAG AGACGATGTT TTACGGGCCG
GGGGCCGGGA GCTTGCCGAC GGCGACGGCG GTTGTCTCCG ACTTGGTCGC GGTGATGAAA
AATATGCGCC TTGGCGTCAA CGGCCGCTAT GCCGTCGCGC CGCAATATGA AAAGCAGTTG
AAGACGCCGG CGGAAATTTT CTCGAAATAC TTTTTGCGCA TTCACGTCAA AGACCAGGTC
GGCGCGTTTG CCAAAATTAC GACGCTGTTT TCGCAGCGCG GGGTGAGCTT TGAGAAAATT
TTGCAATTGC CGCTGAAAGA GGATGGCCTA GCGGAAATCG TCATCGTCAC GCATGACGCC
TCGCAGCAAG ACTACGAAGA CATTTTGCAG CAGCTCGGCG ATTTGGAAAT CGTCGAACGG
GTGCAAAGCT CGTATCGAGT GGAAGGAGAG AAACGGTAA
 
Protein sequence
MEKPILVGLL GLGTVGSGVV KIIENHQEKL MHQVGCPVKV KKILVRDVQK PRDVAVDPSL 
LTTSAAEVID DPDIDVIIEV MGGIEETKEL LLRALRQGKH VVTANKDLMA VYGSELLRVA
AEYRCDLFYE ASVAGGIPIL RSLVDGLASD RITKLMGIVN GTTNYILTKM SQNGASYEDV
LAEAQALGFA EADPTSDVEG LDAARKMAIL ARLGFSMDID LDDVQVKGIT QVTEEDLNYG
KRLGYTMKLI GIAQRDGQKV EVSVQPTFLP DSHPLASVHN EYNAVYVYGE AVGETMFYGP
GAGSLPTATA VVSDLVAVMK NMRLGVNGRY AVAPQYEKQL KTPAEIFSKY FLRIHVKDQV
GAFAKITTLF SQRGVSFEKI LQLPLKEDGL AEIVIVTHDA SQQDYEDILQ QLGDLEIVER
VQSSYRVEGE KR