Gene GYMC61_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0531 
Symbol 
ID8524337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp530312 
End bp531601 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content37% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003251695 
Protein GI261418013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAGA TTTTGACACT ATTTTTAGTA AGTATTCTTC TATTAACTGC TTGTTCTTCA 
AAAACGAAAA ACGGTGGAGC TGCAGATAGC ACAAAAAGTA CGAATGAAAT AACAGTGTGG
ACGTGGGATC CAAATTTCAA CGTAAAAGCG ATGAAAATTG CAGAAGAGTT TTATAAGAAA
GAGAATCCTA ATTTTAAATT AAAAATTATT GAAAATGCTC AAGAGGATAT CGTTCAAAAG
CTTAATACAA GTTTAAGTTC TGGCACAACA AAAGGCTTGC CAAATATCGT ATTAATTGAA
GACTATCGTG CACAGAGCTT CCTTAAAGCG TATCCTGATG CATTCTATGA CCTATCAAAA
TATTTCAAAG CGGAAGATTT TGCTGAATAT AAGCTTGCTC CTACTAGCTT AAATGGTAAA
CATTATGGAC TCCCATTTGA TACAGGGGTT GCTGGATTAT ACGTGAGAAC AGATTATTTA
CAAAAAGCAG GATATACTGT TGATGATTTA CAAGGATTGG ACTGGAACAA ATATATTGAA
ATCGGGAAAA AGGTAAAACA AGCAACGGGT AAATATATGC TTGCACTCGA TCCTAAAAAT
CTGGGGATTA TTCGTGAGAT GATTCAAACT AGTGGTTCAT GGTATGTGAA AGAAGATGGT
GTAACCCCGA ATTTGGCAGA CAATGAGCCG CTGAAGGAAG CATTTAGAAC TTACAAAGAG
ATTGTAAACG CAGGGATATC GAAACCTATT TCCGACTGGA GTCAATTTGT TGCCTCTTTT
AATAGTGGAG CAGTTGCATC GATTCCAACG GGAAACTGGA TTACGGCTTC TGTTAAAGCC
GAAGCTTCTC AAGCCGGTAA GTGGGCTGTT GTGCCTTTTC CAAAACAATC GGGCATTCCT
AACTCTGTTA ATGCTACGAA CCTAGGTGGA AGTTCTTGGT ATGTACTTAA CATTCCTGGA
AAAGAAAAGG CAGCAGAATT TCTAGCTAAA ACGTTTGGAT CAAATGTTGA GTTTTATGAA
GCACTAAACA AAGAGATTGG TGCAATTGGC ACATACAAAC CAGCTGCTAA TAGTGAGGCC
TATAAGGCTG CAGATGAGTT TTTTGGAGGC CAAAAAGTAA CAGCTGATTT CTCCAAATGG
ATGAAAGAAA TTCCTCAAGT AAACTACGGT GCACATACGT ATGTGATCGA AGATATTTTA
GCAGCTGCGT TGCAAGATTA CTTAAAAGGC AAAGATCTTG ATAAAGTGTT GGAAGATGCA
CAAAAACAGG CAGAACAACA AGTTAAATAA
 
Protein sequence
MKKILTLFLV SILLLTACSS KTKNGGAADS TKSTNEITVW TWDPNFNVKA MKIAEEFYKK 
ENPNFKLKII ENAQEDIVQK LNTSLSSGTT KGLPNIVLIE DYRAQSFLKA YPDAFYDLSK
YFKAEDFAEY KLAPTSLNGK HYGLPFDTGV AGLYVRTDYL QKAGYTVDDL QGLDWNKYIE
IGKKVKQATG KYMLALDPKN LGIIREMIQT SGSWYVKEDG VTPNLADNEP LKEAFRTYKE
IVNAGISKPI SDWSQFVASF NSGAVASIPT GNWITASVKA EASQAGKWAV VPFPKQSGIP
NSVNATNLGG SSWYVLNIPG KEKAAEFLAK TFGSNVEFYE ALNKEIGAIG TYKPAANSEA
YKAADEFFGG QKVTADFSKW MKEIPQVNYG AHTYVIEDIL AAALQDYLKG KDLDKVLEDA
QKQAEQQVK