Gene GYMC61_3040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3040 
Symbol 
ID8526925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3097057 
End bp3098118 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content62% 
IMG OID 
Productthreonine synthase 
Protein accessionYP_003254082 
Protein GI261420400 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATGGA AAGGGCTGCT TGACGCCTAC GGCGAGTTTT TGCCGCTCTC TGCAGCGACG 
CCGAGGCTGT CGCTTTGCGA AGGGAATACG CCGCTCATCC CGCTGCCGCG CTTGTCGGAG
GAGCTCGGCA TTGCGCTCTA TGTCAAGGTC GAAGGGGCGA ACCCGACCGG TTCGTTTAAA
GACCGCGGCA TGGTGATGGC GGTGGCCAAG GCGAAAGAGG AAGGAAGCCA TACGATCATC
TGCGCCTCAA CCGGCAATAC ATCGGCTTCA GCGGCCGCCT ATGCGGCGCG CGCCGGCATG
CGCTGCCTCG TCGTCGTTCC GAACGGCAAG ATCGCTCTCG GCAAGTTGGC GCAAGCCGCC
ATGTACGGCG CGGAAATTTT TGCGATTGAC GGCAACTTTG ACGAGGCGCT GAAGATGGTG
CGCCGGTTGA GCGAAACGGC GCCGATTACG CTCGTCAACT CGGTCAATCC GTACCGGATC
GAAGGGCAAA AAACGGCGGC GTTTGAAGTG TGCGACCAGC TTGGGCGCGC CCCGGACGTG
CTCGCCATTC CAGTCGGCAA CGCCGGCAAC ATCACCGCCT ATTGGAAAGG GTTTAAGGAG
TACCATGAAG CGAAAGGAAC GGGCTTGCCG CAAATGCGCG GCTTTGAAGC GGAAGGAGCG
GCGGCGATCG TCCGCAACCG GGTGATTGAA CAGCCGGAGA CGGTGGCGAC CGCGATCCGC
ATCGGCAATC CGGCGAGCTG GGACAAAGCG GTCGAGGCGG CGAGCGAGTC GCGCGGGAAA
ATTGATGAGG TGAGCGACGC AGAAATTTTG GCCGCCTACA AGCGGCTCGC CCGGACGGAA
GGCATTTTTG CCGAACCGGC GTCATGCGCG GCGATCGCCG GGGTGATCAA GCAGCGCGAA
CGGAACGAAA TCGAACGCGG CAGCCTCGTC GTGGCGGTTC TCACTGGCAA TGGATTGAAA
GACCCGGCCA TCGCCTTGGA GACCGCGGCG ATCGAACCGA TCGTGCTGCC GAACGATGAA
CAAGTTGTCT TGGAGCATTT GCAAGGGGTT GTCCGGACAT GA
 
Protein sequence
MAWKGLLDAY GEFLPLSAAT PRLSLCEGNT PLIPLPRLSE ELGIALYVKV EGANPTGSFK 
DRGMVMAVAK AKEEGSHTII CASTGNTSAS AAAYAARAGM RCLVVVPNGK IALGKLAQAA
MYGAEIFAID GNFDEALKMV RRLSETAPIT LVNSVNPYRI EGQKTAAFEV CDQLGRAPDV
LAIPVGNAGN ITAYWKGFKE YHEAKGTGLP QMRGFEAEGA AAIVRNRVIE QPETVATAIR
IGNPASWDKA VEAASESRGK IDEVSDAEIL AAYKRLARTE GIFAEPASCA AIAGVIKQRE
RNEIERGSLV VAVLTGNGLK DPAIALETAA IEPIVLPNDE QVVLEHLQGV VRT