Gene GYMC61_3481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3481 
Symbol 
ID8527369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3539654 
End bp3541066 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content50% 
IMG OID 
Productglycoside hydrolase family 1 
Protein accessionYP_003254511 
Protein GI261420829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTATA CTCAATTAAA ACCGTTTCCA ACGGGGTTTT TATGGGGCGG TTCGACGTCT 
GCTTACCAAG TCGAAGGCGC ATGGAACGAA GACGGAAAAG GGCCGTCGGT CATCGATATG
GCCAAACATC CGGAAGGAAC GACCGATTTC AAAGTCGCCA GCGACCATTA TCACCGGTAT
CAAGAAGATA TCGCTTTGCT CGCAGAAATG GGGTTTAAAG CGTATCGCTT TTCCATCGCT
TGGACGCGCA TTTATCCGAA CGGCGAAGGG GAAGTGAACC CAAAAGGATT GGAATTTTAC
AACAACTTGA TTAATGAGAT TGTCCGCCAT GGCATCGAAC CGATCGTGAC GATCTATCAT
TTCGATTTGC CGTACGCCTT GCAAACGAAA GGCGGATGGT CGAACCGTGC GACTATCGAT
GCGTTTGTCA ACTACTGCCG GACGCTGTTT GAACATTTTG GCGACCGTGT AAAGTATTGG
TTGACCATTA ATGAGCAAAA TATGATGATC CTTCACGGGG AAGCCATTGG CATTGTCGAT
CCCGACAGCG AAAACCCGAA AAAAGAGCTA TACCAGCAAA ACCACCATAT GTTTGTCGCC
CAAGCCAAAG CGATGGCGCT TTGCCACGAA ATGCTTCCTG ATGCAAAAAT CGGGCCGGCG
CCGAATATTG CGACGATTTA TCCGGCGAGC TCCAAGCCGG AAGATGTGCT CGCCGCCAAC
ACGTATTCAG CGATTCGCAA CTGGTTGTAC TTAGATATGG CCGTCTACGG CCGCTACAAT
CCGACAGCGT GGGCGTATTT AGAAGAAAAA GGCTATACCC CAACGATTGC AGACGGAGAT
ATGGACATCT TGCAAAACGC GAAACCGGAT TTCATCGCTT TTAACTACTA TACGTCACAA
ACAGTAGCCG CCAGCGTGGG GAATGAGAGC GATATCGGCC ATACGGGAGA CCAACATATT
ACAATTGGCG AACCGGGCGT ATACAAAGGC GCATCCAACC CGAACTTGCC GAAAAACGAC
TTCGGCTGGG AAATTGACCC GATCGGCTTC CGAACAACGC TTCGGGAAAT TTATGAGCGC
TACCGGTTGC CGCTCATCGT AACCGAAAAC GGGTTAGGAG CTTACGATCG ATTAGAAGAA
GGGGATATCG TGAACGACAC ATACCGGATC GACTTTTTGC GCAACCATAT TGAACAAATG
CGCCTCGCCA TCACGGACGG CGTCGACGTG TTCGGCTACT GCCCGTGGTC GGCGATCGAC
TTAGTCAGCA CCCACCAAGG CATCAGCAAA CGATACGGGT TCATTTACGT CAACCGCGAC
GAATTTGATT TGAAAGATTT GCGCCGTATC CGCAAACAAA GCTTTTATTG GTACCAACGG
GTCATCTCCT CGAACGGCGA ACAGCTCGAC TAA
 
Protein sequence
MKYTQLKPFP TGFLWGGSTS AYQVEGAWNE DGKGPSVIDM AKHPEGTTDF KVASDHYHRY 
QEDIALLAEM GFKAYRFSIA WTRIYPNGEG EVNPKGLEFY NNLINEIVRH GIEPIVTIYH
FDLPYALQTK GGWSNRATID AFVNYCRTLF EHFGDRVKYW LTINEQNMMI LHGEAIGIVD
PDSENPKKEL YQQNHHMFVA QAKAMALCHE MLPDAKIGPA PNIATIYPAS SKPEDVLAAN
TYSAIRNWLY LDMAVYGRYN PTAWAYLEEK GYTPTIADGD MDILQNAKPD FIAFNYYTSQ
TVAASVGNES DIGHTGDQHI TIGEPGVYKG ASNPNLPKND FGWEIDPIGF RTTLREIYER
YRLPLIVTEN GLGAYDRLEE GDIVNDTYRI DFLRNHIEQM RLAITDGVDV FGYCPWSAID
LVSTHQGISK RYGFIYVNRD EFDLKDLRRI RKQSFYWYQR VISSNGEQLD