Gene GYMC61_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3304 
Symbol 
ID8527192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3365049 
End bp3366344 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content53% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003254339 
Protein GI261420657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGA AAGCATCCGC ATGGCTGGCG CTGGCGCTTG GAATTGGGAT GGCGCTGTCG 
GGCTGCAGCA GCTCGAACTC TTCATCCGAT AACGCCAAAC AAGGCAGCCA AAAAGCGGGA
GAGAAGCTCG AGATTTTCAG CTGGTGGACC GGAGCCGGTG AGGAAGACGG CTTAAAAGCG
CTCATCAAAC TGTTCCAAGA AAAATATCCG GATATTGAAG TCGAAAACGC CGCGGTCGCC
GGCGGGGCCG GAACGAACGC CAAGGCGGTG TTGGCGAGCC GCATGCAAGG GAACGACCCG
CCATCAACGT TCCAAGTGCA CGGCGGAGCG GAGCTGAATG AAGGCTGGGT GGCGGCTGGC
AAAATGGAGC CGCTCAACGA TTTGTATGAA AAAGAAGGCT GGATGGACAA ATTCCCGAAA
GCGCTTATCG ATATGGTAAG CAAAGACGGC AACATTTACT CCGTGCCTGT CAACATTCAC
CGCGGCAACG TGCTGTGGTA CAACAAAAAA ATCTTCGCTG ACAACGGACT CCAGCCGCCG
AAAACGTTCG ACGAGTTTTT CCAAGTGGCT GACAAGCTGA AAGCAAAAGG CATTACGCCG
CTCGCCTTGG GCGACAAAGA GCCGTGGGCT GCGACGCACC TGTTTGAAAA TGTGCTGCTT
GGCACGCTCG GAACGGAAAA CTATAAGAAG CTTTGGACGG GCGAATTGTC GTTTAACGAC
CCACAAGTGA AACAAGCCGT CGAGACGTTT AAGAAAATGC TTGGCTATAT TAACGAAGAC
CATAGCTCGC GCAACTGGCA AGACGCCGCC CAGCTCGTCG CTGAGGGGAA AGCGGCCATG
TACGTGATGG GCGATTGGGT GAAAGGCTAT TTTGTCAACG ATTTGAAATT GAAGGTGAAC
CAAGACTTCG GCTATGTGCC AGTGCCGAAT ACGGAAGGCA AGTTTATGGT CATTACTGAT
ACGTTCGGCC TGCCGAAAGG CGTGAAAAAC CCGGATGATG TGAAGAAATT TTTAGCGGTG
CTTGGTTCGG TGGAAGGGCA AGATGCGTTT AACCCGCTGA AAGGCTCCAT CCCGGCCCGC
ATCGACGCGG ATCCGTCCAA GTACGATGAA TACGGCAAAC AAACGATGCA AGACTTCAAA
ACGGCGGAGC TGGCGCCGAG CTTAGCGCAC GGTTCAGCAG CGCCGGAAGG GTTTGTCACG
AAGGTGAATC AGGCCGTCAA CATTTTCGTG ACGCAAAAAG ATGTGAAGAC GTTCATCGAC
ACGTTGGCAT CGGCCGCCGC AGAACTGAAG AAGTAA
 
Protein sequence
MRKKASAWLA LALGIGMALS GCSSSNSSSD NAKQGSQKAG EKLEIFSWWT GAGEEDGLKA 
LIKLFQEKYP DIEVENAAVA GGAGTNAKAV LASRMQGNDP PSTFQVHGGA ELNEGWVAAG
KMEPLNDLYE KEGWMDKFPK ALIDMVSKDG NIYSVPVNIH RGNVLWYNKK IFADNGLQPP
KTFDEFFQVA DKLKAKGITP LALGDKEPWA ATHLFENVLL GTLGTENYKK LWTGELSFND
PQVKQAVETF KKMLGYINED HSSRNWQDAA QLVAEGKAAM YVMGDWVKGY FVNDLKLKVN
QDFGYVPVPN TEGKFMVITD TFGLPKGVKN PDDVKKFLAV LGSVEGQDAF NPLKGSIPAR
IDADPSKYDE YGKQTMQDFK TAELAPSLAH GSAAPEGFVT KVNQAVNIFV TQKDVKTFID
TLASAAAELK K