Gene GYMC61_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3107 
Symbol 
ID8526992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3156770 
End bp3158281 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content58% 
IMG OID 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_003254148 
Protein GI261420466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTGC AACCAAAAGT GCTTCATTAT ATCAATGGGC AGTTGATGGA AGGGGCAGCC 
GGTGCGTATT TTGACAACAT CAATCCGTTT ACGAACGAAC GGATCAACGA AGTGGCCGAA
GGGCGGAAAG AAGACATCGA CGCAGCGGTC CGGGCGGCAA AGGAGGCGTT TGATCACGGG
CCGTGGCGGA CAATGCCGGT CGAACGGCGT CTTCGTTACC TTTTCCGCAT TGCTGACTTG
ATTGAGCAGT ATGCGGACGA CATCGCCTAT TTAGAAGCGC TTGACACCGG CATTCCGATC
AGCCAGGCGA AAAAGCAAGC CGCCCGCGCG GCGGAAAACT TCCGCTTTTA CGCGGAAATG
GTGAAGACGC GCCTTGTCGG CGAGGCGTAC CATGTGAATG GACAGTTTTT AAACTATACC
GTTTATAAAC CAGTCGGCGT CGCCGGGCTC ATCACGCCGT GGAATACGCC ATTTATGCTG
GAAACGTGGA AGGTGGCTCC CGCGCTGGCG ACCGGCAACA CCGTCGTCTT GAAGCCGGCC
GAATGGTCGC CGTTGACGGC GAATAAACTG GCGGAAATCA TCGATGAAGC CGGGCTGCCT
CGCGGGGTGT TCAACGTCGT GCACGGGTTT GGCGAAACGG CGGGCGCCGC ATTGGTTGCC
CACCCGGATG TGCGCCTCAT CTCGTTTACC GGCGAGACGA CGACCGGCAT GGAAATCATC
CGCAACAGCG CTGCGACGTT GAAAAAAACA TCGATGGAGC TCGGCGGCAA GTCGCCGCTC
ATTGTGTTCG CCGATGCGGA TCTCGAACGG GCGCTCGATG CGGCGGTTTG GGGCGTGTTT
TCGCTCAATG GCGAACGGTG CACGGCCAAC TCGCGGCTTT TGCTTGAACA GTCGATTTAC
GACGAATTTG TCGCCCGGCT CAAAGAGCGC GTCGACCGCA TCGTCATCGG CGACCCGATG
AACCCGGCGA CTGAACTCGG TCCGCTCATT CACCGCGATC ATTGGGAGAG GGTGAACCGC
TATATTGACA TCGCCAAGCA AGAAGGGGCG GACGTCTATG CCCCCAGCGT TCCAACAGGA
TTGGAAAAAG GCAATTTTGT GCCGCCAACG TTGCTGCTTG GTTGCCATAA CGGCATGAGG
GTGGCGCAGG AAGAGATTTT CGGACCGGTC ATGGCGGTCA TGTCCTTTGC GGATGAAGAA
GAGGCGATAC GGCTGGCGAA CGATGTGAAA TACGGGCTGG CGGCATACGT CTGGACGAAC
GATATGAAGC GCGGCCACCG CGTCGCCCAA GCGATCGAAA GCGGGATGGC GTGGGTCAAC
TCGCCGAACG TCCGCGATTT GCGCATCCCG TTTGGCGGGA CGAAATACAG CGGCATCGGC
CGCGAAGGCG GGCATTACAG CTTTGATTTC TATACGGAAG TGCAAGTCGT CCACGTCGCC
GTCGGCGATC CGCCGATCCC CGCGTTCGGC AAGGGGGAGA AACCGACCGC CTTGTCTGCC
GAACAGGCAT AA
 
Protein sequence
MAVQPKVLHY INGQLMEGAA GAYFDNINPF TNERINEVAE GRKEDIDAAV RAAKEAFDHG 
PWRTMPVERR LRYLFRIADL IEQYADDIAY LEALDTGIPI SQAKKQAARA AENFRFYAEM
VKTRLVGEAY HVNGQFLNYT VYKPVGVAGL ITPWNTPFML ETWKVAPALA TGNTVVLKPA
EWSPLTANKL AEIIDEAGLP RGVFNVVHGF GETAGAALVA HPDVRLISFT GETTTGMEII
RNSAATLKKT SMELGGKSPL IVFADADLER ALDAAVWGVF SLNGERCTAN SRLLLEQSIY
DEFVARLKER VDRIVIGDPM NPATELGPLI HRDHWERVNR YIDIAKQEGA DVYAPSVPTG
LEKGNFVPPT LLLGCHNGMR VAQEEIFGPV MAVMSFADEE EAIRLANDVK YGLAAYVWTN
DMKRGHRVAQ AIESGMAWVN SPNVRDLRIP FGGTKYSGIG REGGHYSFDF YTEVQVVHVA
VGDPPIPAFG KGEKPTALSA EQA