Gene GYMC61_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2159 
Symbol 
ID8526023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2179030 
End bp2180685 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content59% 
IMG OID 
Producturocanate hydratase 
Protein accessionYP_003253256 
Protein GI261419574 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAA AACGGACCGT ACGCCCGTTT GCGGGAACAG AGCGGCGGGC GAAAGGATGG 
ATTCAAGAAG CGGCGTTGCG CATGTTAAAC AACAATTTGC ATCCCGATGT CGCCGAGCGG
CCGGATGAGT TGATCGTCTA CGGCGGCATC GGCAAGGCGG CGCGCAACTG GGAATGTTAC
GAGGCGATTG TGGACACCCT TCTTCGTTTA GAAAACGATG AAACGTTGCT CATTCAATCT
GGCAAGCCGG TGGCGGTGTT TCGCACGCAT CCGGACGCCC CTCGCGTGCT GATCGCCAAC
TCCAACCTCG TGCCCGCATG GGCGACGTGG GACCATTTTC ACGAACTTGA CAAAAAAGGG
TTGATCATGT ACGGACAAAT GACGGCCGGG AGCTGGATTT ACATCGGCAG CCAAGGAATC
GTCCAAGGGA CATATGAAAC GTTTGCCGAA GTGGCGCGCC AGCACTTTGG CGGCACGCTG
GCCGGGACGA TCACGCTAAC CGCCGGCCTT GGCGGCATGG GCGGGGCGCA GCCGCTCGCC
GTGACGATGA ACGGCGGCGT CTGCCTCGCC ATCGAAGTCG ATCCGGCCCG CATCCAGCGC
CGCATTGACA CGAATTACTT GGATACGATG ACCGACAGCC TAGACGCGGC GCTCGAGATG
GCGAAACAAG CGAAGGAAGA GAAAAAAGCG CTGTCGATCG GCCTTGTCGG CAATGCGGCT
GAAGTGTTGC CGCGTCTCGT CGAAACGGGC TTTGTTCCGG ATGTCTTGAC CGATCAAACG
TCCGCCCACG ATCCGTTAAA CGGCTACATC CCGGCTGGCC TTACGCTTGA TGAGGCCGCC
GAACTCAGGG CGCGCGATCC GAAGCAGTAC ATCGCCCGTG CGAAACAGTC GATCGCCGCG
CATGTTCGAG CGATGCTGGC GATGCAAAAG CAAGGGGCGG TGACGTTTGA TTACGGCAAC
AACATCCGCC AAGTGGCAAA AGACGAAGGG GTGGACGACG CCTTTTCCTT CCCAGGTTTT
GTGCCGGCCT ACATCCGTCC GCTCTTTTGC GAAGGAAAAG GGCCGTTCCG CTGGGTGGCA
TTATCCGGCG ACCCGGAAGA CATTTATAAA ACCGATGAAG TCATTTTGCG TGAATTCAGC
GACAATGAGC GTCTTTGCCA TTGGATTCGC ATGGCGCAAA AACGCATTAA GTTCCAAGGG
CTGCCGGCGC GCATTTGTTG GCTCGGCTAC GGCGAGCGGG CGAAATTTGG CAAAATCATC
AACGACATGG TGGCCAAAGG CGAGCTGAAA GCGCCGATCG TCATCGGCCG CGATCATTTG
GATTCGGGCT CCGTCGCTTC GCCGAACCGG GAGACGGAAG GAATGAAAGA CGGAAGCGAC
GCCATCGCCG ACTGGCCGAT TTTAAACGCG CTGTTGAATG CGGTTGGGGG CGCGAGCTGG
GTGTCGGTTC ACCACGGTGG CGGCGTCGGC ATGGGCTACT CGATTCACGC CGGCATGGTC
ATTGTCGCCG ACGGCACGAA AGAGGCGGAA AAACGGTTGG AACGGGTGTT GACGACCGAC
CCGGGGCTTG GTGTGGTCCG CCACGCCGAT GCCGGTTATG AGCTCGCCAT CCGGACGGCG
AAAGAAAAAG GCATTGATAT GCCGATGCTC AAGTAG
 
Protein sequence
MAEKRTVRPF AGTERRAKGW IQEAALRMLN NNLHPDVAER PDELIVYGGI GKAARNWECY 
EAIVDTLLRL ENDETLLIQS GKPVAVFRTH PDAPRVLIAN SNLVPAWATW DHFHELDKKG
LIMYGQMTAG SWIYIGSQGI VQGTYETFAE VARQHFGGTL AGTITLTAGL GGMGGAQPLA
VTMNGGVCLA IEVDPARIQR RIDTNYLDTM TDSLDAALEM AKQAKEEKKA LSIGLVGNAA
EVLPRLVETG FVPDVLTDQT SAHDPLNGYI PAGLTLDEAA ELRARDPKQY IARAKQSIAA
HVRAMLAMQK QGAVTFDYGN NIRQVAKDEG VDDAFSFPGF VPAYIRPLFC EGKGPFRWVA
LSGDPEDIYK TDEVILREFS DNERLCHWIR MAQKRIKFQG LPARICWLGY GERAKFGKII
NDMVAKGELK APIVIGRDHL DSGSVASPNR ETEGMKDGSD AIADWPILNA LLNAVGGASW
VSVHHGGGVG MGYSIHAGMV IVADGTKEAE KRLERVLTTD PGLGVVRHAD AGYELAIRTA
KEKGIDMPML K