Gene GYMC61_1727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1727 
Symbol 
ID8525591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1751213 
End bp1752481 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content48% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003252836 
Protein GI261419154 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAAA CAAAGTGGAT GGCAGCGCTA GGCATCACAA CGATGCTGTT TGGCAGCCTT 
TTGGCCGGTT GCGGCGGTGG TGAGAAAAGT GAGAAGGCAA ACGGGGGGAG TAAGCAAGGC
GAGAAAGTAG AAGTCACGTT AGCTGGCTGG GGCGGCAACC CAACGGAGCA AAAGTTGTTG
AAACAAACGC TTGACGATTT TGAGAAAAAA CACCCTAATA TTAAGGTCAA GTATGAAGTC
ATTGCCGACC AGTATATGGA TGTCATCAAA ACCCGTTTAG CGGGTGGGCA AGGGCCGGAT
GTGTTTTACC TTGATGCATT TGAAGCCCCA GCTCTGATTG AAACAGGGGC GCTTGAGCCG
CTTGACAAAT ATGTAACGGA CGATTTTGAC ATTAACGATT TTGAAAAGCC GATGCTTGAT
GCGTTTAAAG GGAAAGACGG GAAAATTTAC GGATTCCCGA AAGACTATTC GACGCTAGCA
CTGTTTTACA ATAAAAAGAT GTTCGAAGAA GCAGGCGTTG AAGTCCCAAA AACTTGGGAT
GAACTGCGGG AAGTGGCGAA AAAGCTGACA AAAGGGAAGC AAGTATACGG ATTTGGCGTT
GCACCGGAAC TGGCTCGCTT ATACTACATT GCTGAATCCA AAGGCGGCAA AGTTGTGACG
GATAATAAAG CGAGCTTTGC CGATCCGAAA GTCGTCGAGG CGCTCCAGCC GATCGTTGAT
ATGCACTTAA AAGATAAGTC GGCGGCCCAA CCGAATGAAG TTGGGGCGAC ATGGGGCGGC
GAGATGTTCG GGCAAGGCAA AGCTGCTATG GTGATTGAAG GGAACTGGGC GATTCCATTT
TTACAAGACA CGTTCCCGAA TTTAGAATTC GGTACAGCGG AAGTTCCAAT GATCAATGGC
AAAAAGGCGA CGATGGCGTA CACAGTGGCT TATGTCATGA ACAAAGACTC GAAAAAGAAA
GAAGCGGCTT GGGAGCTCAT CTCGTATTTG ACTGGCAAAG AAGGCATGAA AACATGGACG
AGCAAGGGGT ATGCTTTGCC GACGCGGAAA TCGGTCGCTG CTGAATTGGG ATTTGACAAA
GATCCGTTGC GGGCGCCATT AGTCGCTGGA GCATCGTATG CAACTGTATG GCAAAACGGA
ACGAACTTGC CGATTATTAT GAACAACTTC AATAACCAAT TTGTCAGCGC TTTCCTCGGT
GAACGTCCGC TTGCTGAGGC ATTGAAAGAA GCGCAAAAAA CGGCGAATAG CGAAATCGAG
AGCAAATAA
 
Protein sequence
MGKTKWMAAL GITTMLFGSL LAGCGGGEKS EKANGGSKQG EKVEVTLAGW GGNPTEQKLL 
KQTLDDFEKK HPNIKVKYEV IADQYMDVIK TRLAGGQGPD VFYLDAFEAP ALIETGALEP
LDKYVTDDFD INDFEKPMLD AFKGKDGKIY GFPKDYSTLA LFYNKKMFEE AGVEVPKTWD
ELREVAKKLT KGKQVYGFGV APELARLYYI AESKGGKVVT DNKASFADPK VVEALQPIVD
MHLKDKSAAQ PNEVGATWGG EMFGQGKAAM VIEGNWAIPF LQDTFPNLEF GTAEVPMING
KKATMAYTVA YVMNKDSKKK EAAWELISYL TGKEGMKTWT SKGYALPTRK SVAAELGFDK
DPLRAPLVAG ASYATVWQNG TNLPIIMNNF NNQFVSAFLG ERPLAEALKE AQKTANSEIE
SK