Gene GYMC61_2716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2716 
Symbol 
ID8526593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2761338 
End bp2762663 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content44% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003253780 
Protein GI261420098 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAGCGTTCAC ATGGTTGGTT TTGTTGATGT TGATGGCGGG TCTTGTGCTG 
TCTGGTTGTT CTTCTTCATC TAGCGATTCT GCGAGCGAAG GGAAATCCGG GGACGGAAAA
ATTCATTTGA AGTTTATGCA CGATTGGCCA AAAGGCAGTT CAACTGCTTA TTACAATCTT
GTTGAAGAGA TTATTAAAGA CTATGAAGCG AAACACCCGA ATGTTGTCAT TGATGTAGAG
GTGCTGAATC CGGATCAATA CCGAGATAAA TTGAAAGTAT TGGCCGCTTC GAATGAATTA
CCGGATGTAG GGTTAACGTG GTCAAACGGT TTTGCCGAAC CATACGCCAC TGGGGGACAA
TTTGCGCCGC TAAATGATAT TATCGAAAAA GAATTTAAAG ACCAGTTTGT ACCTGGAACG
GTAGAAGCGT ATACCTTTAA CGGCAAATCG TATGCGCTTC CAATGGAGAT GAACATTACG
TATATTTTCT ACAACAAAGA GATTTTCAAA AAATATGATC TTCAGGAACC AAAAACCTTT
GAAGATTTGA AAAATATTGG GAAAACGTTG ATCAAGCATG GGGTGATTCC AGCAACGGTC
GGTTCAAAAG ACGGATGGCC GGCATCGATG TGGTTTATGT ACCTCGCTGA CCGGATTGGC
GGCCCAACGA TTTTGACCGA TGTCATTCAA GGAAAAGTGA AAATGTCTGA TCCAGCCATT
GTAAAAGCGG CGAAAGAAGT TCAAAATCTT GTGGATATGG GGTTCTTTGT CAAAGGGAAC
ACTGCTTTCT CGAATGATGA TGCTAAAGGT TATTTCCTGA ACGAAAAGGC AGCCATGTTC
TTAACGGCCA CATGGGAGTT GCCGAACTTT ACAACCAGCC CGGATGTGCC GCAAGAGTTT
AAAGAGAAGG TGGGTTACTT CAAATTCCCA TTGTATGAAG GCGGCAAAGG AACGGACATC
AACAGTTATG TCGGCGGTCC TGGGTTAGGC GCATTTGTCG CGGAAAACTC AAAGCATAAA
GAACAAGCGA AAGACTTCGC GGCGTACCTT GTCAAAGAAT GGGGCAAACG GTCAGTGGAA
GGTGCAGGCA TTTTACCAGC TACAAAGGTG AATACAGAAG GGTTGAACGT ACCGAAGATG
TATCTCGATG TCTTGCATGA CATCAATAAC GCGACAAACA TTACGACTTG GTTTGACACC
CAAGCAAGCC CGAATGTTTC TGAGCTGCAC CATGACTTAA TGACGGCCCT GTTTGGGAAA
CAAATCACTC CAGAGGAATT TGCGAAACAA CATGATGACG CGTTGGCGGA GGAAGCCAAC
AAATAA
 
Protein sequence
MKKKAFTWLV LLMLMAGLVL SGCSSSSSDS ASEGKSGDGK IHLKFMHDWP KGSSTAYYNL 
VEEIIKDYEA KHPNVVIDVE VLNPDQYRDK LKVLAASNEL PDVGLTWSNG FAEPYATGGQ
FAPLNDIIEK EFKDQFVPGT VEAYTFNGKS YALPMEMNIT YIFYNKEIFK KYDLQEPKTF
EDLKNIGKTL IKHGVIPATV GSKDGWPASM WFMYLADRIG GPTILTDVIQ GKVKMSDPAI
VKAAKEVQNL VDMGFFVKGN TAFSNDDAKG YFLNEKAAMF LTATWELPNF TTSPDVPQEF
KEKVGYFKFP LYEGGKGTDI NSYVGGPGLG AFVAENSKHK EQAKDFAAYL VKEWGKRSVE
GAGILPATKV NTEGLNVPKM YLDVLHDINN ATNITTWFDT QASPNVSELH HDLMTALFGK
QITPEEFAKQ HDDALAEEAN K