Gene GYMC61_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2553 
Symbol 
ID8526421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2587190 
End bp2588881 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content53% 
IMG OID 
Productalpha,alpha-phosphotrehalase 
Protein accessionYP_003253628 
Protein GI261419946 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACAA CCCCTTGGTG GAAAAAAGCG GTCGTTTATC AAATTTATCC GAAAAGCTTC 
TACGACACGA ATGGAGACGG CATCGGCGAT TTGCCGGGCG TGATCGAGAA GCTCGATTAT
CTACAAGAAC TGGGCGTCGA TGTCATTTGG CTGACGCCGA TCTACGCGTC GCCGCAGCGC
GACAACGGTT ATGACATCAG CGATTATTTC CGCATCCATC ACGAATACGG AACGATGGGC
GATTTTGACC GTCTGCTTCA CGAAGTGCAT GCGCGCGGCA TGAAGCTGGT GATGGACATG
GTCGTCAACC ATACCTCAAC CGACCATGAA TGGTTTAAGC AGGCGCGCTC CTCGAAAACG
AACCCATACC GCTCGTTTTA CATTTGGCGC GACCCGAAGC CGGACGGCGG AGCGCCGAAC
AACTGGCAAT CGAAATTCGG CGGCTCGGCG TGGGAATACG ACGAACAAAC CGGCCAATAT
TACTTGCATT TGTTCGATGT GACACAAGCG GACTTGGACT GGGAAAACGA AGAGCTGCGC
CGCCGCATTT ATGACATGAT GCATTTTTGG CTCCAAAAAG GAGTGGACGG CTTTCGGCTT
GACGTCATCA ACTTGTTGTC AAAAGATCAG CGCTTCCCGG ATGACGACGG TTCGGTGCCG
CCGGGGGACG GGCGCCGCTT TTATACGGAC GGACCGCGCA TTCATGAGTT TTTGCAGGAA
ATGAATCGAG AGGTGTTTTC AAAATACGAC ATCATGACAG TGGGGGAAAT GTCGTCGACG
ACGATCGATC ATTGCATCCG GTATACGAAC CCGGAAAACC ACGAGCTCAA TATGACGTTT
AACTTCCATC ACTTGAAGGT CGATTACCCG AATGGGGAAA AATGGGCGGT CTCCCCGTTT
GACTTTCTAG CTTTAAAACG CATTTTGTCC GAATGGCAAG TCCGGATGTA TGAAGGCGGG
GGCTGGAATG CGCTCTTTTG GTGCAACCAC GATCAGCCGC GCATCGTGTC GCGCTATGGC
GATGACGGGA CGTATTGGAA AGAATCAGCC AAAATGTTGG CGACGACGAT CCATTTAATG
CAAGGGACGC CATACATTTA CCAAGGCGAA GAAATCGGCA TGACCGACCC GAAATTCACC
GATATTCGCG ACTACCGCGA CGTTGAGTCG CTCAACATGT ACCGGATTTT GCGAGAACAA
GGCAAAAGCG AGCAAGAAGT GATGGAAATT TTGCGGCGAA AATCGCGCGA CAATTCCCGC
ACGCCGATGC AATGGGACGA CAGCCCGCAT GCCGGTTTCA CGTCTGGGAC GCCGTGGATT
CGCGTCGCCG ACAACTACCG GCGCATTAAC GTGAAACAGG CGCTCGCCGA CCGCGACTCG
ATTTTCTACC ATTATAAGCG GCTGATTGAG CTGCGCAAAC AGTATGATCT CATTACGACC
GGGCGCTATG AGCTGTTGCT TGCGGACGAT CCGCATATTT TCGCCTATAT GCGTCATGGC
GATGGAGAAA AGCTGCTTGT TGTCAACAAT TTTTATCCAG TCGAAACGAT GTTCACGCTG
CCGAAAGAAG CGGGAGCCGA TGGCTATACG GGAGAACTGC TGCTTGCCAA TTATTCGGAC
GCGCCGGCCG ATTTTCGCCG CATGCAACTG CGCCCATACG AATCGGTTGT TTATCTTTTG
CGCCGGCCGT GA
 
Protein sequence
MSTTPWWKKA VVYQIYPKSF YDTNGDGIGD LPGVIEKLDY LQELGVDVIW LTPIYASPQR 
DNGYDISDYF RIHHEYGTMG DFDRLLHEVH ARGMKLVMDM VVNHTSTDHE WFKQARSSKT
NPYRSFYIWR DPKPDGGAPN NWQSKFGGSA WEYDEQTGQY YLHLFDVTQA DLDWENEELR
RRIYDMMHFW LQKGVDGFRL DVINLLSKDQ RFPDDDGSVP PGDGRRFYTD GPRIHEFLQE
MNREVFSKYD IMTVGEMSST TIDHCIRYTN PENHELNMTF NFHHLKVDYP NGEKWAVSPF
DFLALKRILS EWQVRMYEGG GWNALFWCNH DQPRIVSRYG DDGTYWKESA KMLATTIHLM
QGTPYIYQGE EIGMTDPKFT DIRDYRDVES LNMYRILREQ GKSEQEVMEI LRRKSRDNSR
TPMQWDDSPH AGFTSGTPWI RVADNYRRIN VKQALADRDS IFYHYKRLIE LRKQYDLITT
GRYELLLADD PHIFAYMRHG DGEKLLVVNN FYPVETMFTL PKEAGADGYT GELLLANYSD
APADFRRMQL RPYESVVYLL RRP