Gene GYMC61_2731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2731 
Symbol 
ID8526608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2781619 
End bp2782983 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content42% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003253795 
Protein GI261420113 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGGA CGAAAAAAAT GAAAACGCTT TTTCTTGCTC TAGCGTTTGC CGTGATTTCT 
GCTTTAGTAG GGTGTTCGGA CAAAAACGCA TCTGTCGGCA ATGGCAATGG GGAAAAAATA
GAACTGACGT TTATGTTTAG GGGACAGCCC CAAGAGCAAA CAGCTTATAA GAACGTAGTG
AAAAAATTCG AAGAAAAACA TCCGAATGTA AAAGTAAATA TTGTTGTTAC GTCGCCAGAT
CAATATGCTA CGAAACTACG GGCTGCGATC GCGGGAAGAA AAATACCTGA TGTGTTTTAT
TTCAACCCCG GAGAACTTCG CGCTTATGTG AATTCCAATG TACTATTAGA CATCACAAAA
TATGTAGAAA ACTCAAAAGG TGTTAATCTC CAAGATATTT GGGAAAAAGG GGTAAATAAA
TATCGATTTG ATGGAGAAAA GGTCGGTCAG GGAAATCTTT ATGGGCTGCC GAAAGATTTA
GGACCGTTTG CACTCGGGTA CAATAAAACA ATGTTTGAAA AAGCAGGGAT TCCTCTTCCA
GATAAAGATA AACCATATAC ATGGCAAGAA TTTATTGATG TTTGTAAGAA ACTAACCAAA
GACACGAATG GCGATGGGAA GCTCGACCAA TGGGGAACAG GTTTAAATGC CACATGGACG
TTGCAAGCGT TTGTTTGGAG CAATGGTGCC GATTGGATTG ATGAAAGCAA AACGAAAGTT
ACCGTTGACG ATCCGAAATT TATAGAAGCC CTCCAATTCT TTGCTGACAT GCAGAATAAA
TATAAGGTCA CCCCATCGAT TGCGGAGGCG CAGACATTGG ATACGTATCA ACGCTGGTTG
AGAGGGCAAC TTGGCTTTTT CCCTGTAGGT CCTTGGGATT TAGCTGCTTT TGACCAACAA
ATCAAATTTG AGTATGATTT GATTCCATGG CCTGCAGGTT CGACTGGCAA ACCGGCTACT
TGGGTTGGGT CGCTTGGAAT CGGGGTGTCA AGCATGACCA AGCATCCAAA AGAGGCAGTA
GAGTTAGCAT TATATTTGTC CGCTGATCCA GAGGGGCAGA AAGCGCTTGT TGACCAGCGT
GTACAGTTGC CGAACTCTGT GAAAGTAGCT GAAGAGTGGG CAAAAGATCC TTCCATTAAG
CCGGCAAACA AGCAGGAATT TTTGGATATC ATTAATGATT ATGGGCGTTC ATTTCCGACA
GAATATACGT ACAACGGTGA ATGGTACGAC GAGTTTTATC GCAATCTGCA ACCAGTTTTA
GATGGAAAAA TGTCCGCTGA AGAGTACGTA AAGAAAGCAA AGCCGAAAAT GCAAAAGCTG
TTGGATCAGG CAATCGAACA AGAAAAACAA GCAAGCAAAA AATGA
 
Protein sequence
MMRTKKMKTL FLALAFAVIS ALVGCSDKNA SVGNGNGEKI ELTFMFRGQP QEQTAYKNVV 
KKFEEKHPNV KVNIVVTSPD QYATKLRAAI AGRKIPDVFY FNPGELRAYV NSNVLLDITK
YVENSKGVNL QDIWEKGVNK YRFDGEKVGQ GNLYGLPKDL GPFALGYNKT MFEKAGIPLP
DKDKPYTWQE FIDVCKKLTK DTNGDGKLDQ WGTGLNATWT LQAFVWSNGA DWIDESKTKV
TVDDPKFIEA LQFFADMQNK YKVTPSIAEA QTLDTYQRWL RGQLGFFPVG PWDLAAFDQQ
IKFEYDLIPW PAGSTGKPAT WVGSLGIGVS SMTKHPKEAV ELALYLSADP EGQKALVDQR
VQLPNSVKVA EEWAKDPSIK PANKQEFLDI INDYGRSFPT EYTYNGEWYD EFYRNLQPVL
DGKMSAEEYV KKAKPKMQKL LDQAIEQEKQ ASKK