Gene GYMC61_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2845 
Symbol 
ID8526722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2904164 
End bp2905483 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003253903 
Protein GI261420221 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGG TTTTGGCTGG TTTCTTGTTC TTAGCGCTTG TTGGGCTTGC TGCGGGATGT 
TCAAGTGAAG ATGCTGGACA AGCGGCGGGA GGCAAAACCG AAGTTGTATT TTGGCACTCG
ATGAGCGGGG ATTTGGAGCC GGTGTTGAAT GATCTTGTGG CGGATTTTAA CCAAACCCAT
CCGGATATTG AGGTAAAGCC GGTGTTTCAA GGAACATATG AAGAAGCGCT GACGAAATGG
AATGCGGTAG CGGGGACCAA AGATGCGCCG ACGATCATGC AGACGTTTGA AGTCGGGACA
AAGCATATGA TCGACAGTGG AAAGATTGTT CCGGTGCAAA CGTGGATCGA TAAAGACAAG
TATGATGTTT CGCAATGGGA GAAAAACATT GTCAATTATT ATACCGTGAA CGGGCGAATT
TACTCGATGC CATTTAACTC GTCAACCCCT GTGTTAATTT ATAATAAAGA TGCGTTCCGC
GAAGCCGGGC TTGATCCGGA AAAGCCGCCG CTGACCTACA GCGAGTTGAA AGAAGCGGCG
AAAAAGCTGA CAAAGAAAAA AGGGAAGGAA ACCGAACGGT ACGGATTCTC GATTTTGAAC
TACGGCTGGT TTTTTGAAGA AATGGTGGCC GTACAAGACG GGCTATATGT GAACAACAAC
AATGGCCGGA GCGGTAATGC AACGAAAGCA GTATTTAATG GAGAGGAAGG GAAACGTGTA
TTTGAGTTGA TCCGCGACAT GTATCGAGAC GGCACGTTTT ACAACGTCGG CCAAAATTGG
GACGATATGC GCGCTGCCTT CCAAGCGGGA AAAATCGCCA TGTATTTGGA TTCGTCCGCT
GGCGTAAAAA CGTTGATCGA CAACTCGCCG TTTGACGTTG GCGTTTCGTA TTTGCCTGTT
CCGGATGGCG TAGAGCGCCA AGGCGTCGTG ATCGGCGGCG CTTCTCTTTG GATGATGAAA
GGAAGCAGCG AAGAGGAACA AAAAGCGGCG TGGGAGTTCA TGAAATACTT GACGACTGCT
CCCGTCCAAG CCGAGTGGCA TGTGCGCACA GGCTATTTCG CCATCAACCC AGCTGCGTAC
GATGAGCCGC TGGTCAAAGA GGAATGGACG AAATACCCTC AATTAAAAGT GACGGTGGAC
CAGCTGCATG AAACAAAATC AACCCCTGCC ACCCAAGGAG CGCTCATCAC CGTCTTCCCT
GAATCTCGGC AACATGTCGT GAAAGCGATG GAACGGTTGT ATGAAGGCAT CGATCCGCAA
GAAGCGCTCA ATCAAGCAGC GGAAGAAACG AACCAGGCGT TGCAGGGGGC GGCAAATTAG
 
Protein sequence
MRKVLAGFLF LALVGLAAGC SSEDAGQAAG GKTEVVFWHS MSGDLEPVLN DLVADFNQTH 
PDIEVKPVFQ GTYEEALTKW NAVAGTKDAP TIMQTFEVGT KHMIDSGKIV PVQTWIDKDK
YDVSQWEKNI VNYYTVNGRI YSMPFNSSTP VLIYNKDAFR EAGLDPEKPP LTYSELKEAA
KKLTKKKGKE TERYGFSILN YGWFFEEMVA VQDGLYVNNN NGRSGNATKA VFNGEEGKRV
FELIRDMYRD GTFYNVGQNW DDMRAAFQAG KIAMYLDSSA GVKTLIDNSP FDVGVSYLPV
PDGVERQGVV IGGASLWMMK GSSEEEQKAA WEFMKYLTTA PVQAEWHVRT GYFAINPAAY
DEPLVKEEWT KYPQLKVTVD QLHETKSTPA TQGALITVFP ESRQHVVKAM ERLYEGIDPQ
EALNQAAEET NQALQGAAN