Gene GYMC61_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_3366 
Symbol 
ID8527254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3429385 
End bp3430710 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content51% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003254397 
Protein GI261420715 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGC GCAAGTGGTT CAAACTGGCT TCGTTGTTGC TTGCCGCTGC GGTGGCGGGA 
ACAGGCTGTC AAGGACAGAC CGGGAATGAA CAAGGGCAGA AGAAGGAAGG ATCAACCGTA
CAAATTGAGT TTTGGGCGGC GCCGAATCCG ACGCAGCAGG CATTCTGGAA GAAGATGGCG
GATCGCTATA TGGAAGAGCA TCAAAACGTG AAAATCAAAG TATCGCCGAT GCCGGAAAGC
CCTTCTTCGG AGGCGGGCAT TCAGTCGGCG ATTGCCGCGG GAAATGCGCC GGTCATTTCC
GAAAACATCT CCCGCGGGTT TGCCGCACAG CTGGCGGACA GCCGGGCGAT TGTGCCGCTG
GATCAATTTG AAGGGTTTGA CGACTTAATC GCCAAGCGGC AGATGAAAAG CACGATTTCA
ACATGGAAAT TTGCCGACGG CCATCAATAT GTGCTGCCGA TTTACTCCAA CGCCATGCTG
TTCGGTTGGC GAATCGATAT TCTGAAAGAG TTGGGGTATA ACGCTCCGCC GAAGACGTAC
AGCGAAGTCA TGGAAGTCGG CAAAAAGCTG AAAGAAAAAT ATCCGAACAA GTTTCTATGG
GCGAGAGCCG ATTTGGTTAA ACCGACGTGG TGGGCGAGAT GGTTTGACTT CTTCATGATT
TACAATGCGG CGTCGAATGG GACGCATTTC ATCGATGGGA ACAAGTTGAG TGCGGATCGT
GACGCGGGTG TGAAGACGCT CCAGTTCTTC GCTGATTTAA GCAAAAATCA GCTCGTGTTG
ACGAAAGAGA CGAAAGACCC GTTTGAAAGC GGTACATCCG TGATGTCTGA TTTAGGGCCG
TGGACGTTCC CGTATTGGGC TGAAAAGTTT CCGGAAATGA AGTTTAACGA GAAATACGTG
TTGTCGATGC CGCCTGTGCC TGATGGCATG GATCCGTCGC AGGCGAAAAC GTTCGCCGAT
ACAAAAGGGC TGGTTATTTA CGCTTCGGCG ACGAAAGAGC AGCAACAAGC CGCGTTCGAT
TTCGTCAAAT GGGTGTATTC CGATCCGCAA AACGATTTGG AATGGCTGAA AGAAACGAAC
TTGCCGCCGG CGCGGGACGA CTTGTCGACG AACGAGGCGT TTGTCTCCTA TTTTGAACAA
AATCCGCAGC TGAAACTGTA TGCGGAAAAC ATTCCGAACG CCATACCGCC TATGGATAAC
GCCAAAATGG TTGAGCTTCA AGAGCTGATC GGCAAAGAGG CGTTGAATCC GGTCGTCAAA
GGCGAGAAAA CCCCAGAAAA AGCGTGGGAG GACATGGAAA AGGCGATCCA TGGGGTGTTA
AAATAA
 
Protein sequence
MKKRKWFKLA SLLLAAAVAG TGCQGQTGNE QGQKKEGSTV QIEFWAAPNP TQQAFWKKMA 
DRYMEEHQNV KIKVSPMPES PSSEAGIQSA IAAGNAPVIS ENISRGFAAQ LADSRAIVPL
DQFEGFDDLI AKRQMKSTIS TWKFADGHQY VLPIYSNAML FGWRIDILKE LGYNAPPKTY
SEVMEVGKKL KEKYPNKFLW ARADLVKPTW WARWFDFFMI YNAASNGTHF IDGNKLSADR
DAGVKTLQFF ADLSKNQLVL TKETKDPFES GTSVMSDLGP WTFPYWAEKF PEMKFNEKYV
LSMPPVPDGM DPSQAKTFAD TKGLVIYASA TKEQQQAAFD FVKWVYSDPQ NDLEWLKETN
LPPARDDLST NEAFVSYFEQ NPQLKLYAEN IPNAIPPMDN AKMVELQELI GKEALNPVVK
GEKTPEKAWE DMEKAIHGVL K