Gene GYMC61_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1511 
Symbol 
ID8525374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1541713 
End bp1542987 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003252631 
Protein GI261418949 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAG CGTTATCTTT ATTTCTCATG GCTGTTTTGT TGATTGGCGT GCTGGGTGCC 
TGCGGTCCTA AGCGCGATGT CCAGCAGCCG AAAGGAAACG ACCAAAAGAC GGCAGAACAG
GACAAAAAGC CAGAGAAACT CGTCGTATGG GTGAACGATG ATGAAAAACA AAAACAAGCG
TTAAAGGATA TTTTTCAAAA GTACACAGAA AAAACGGGAA TCAAAATCGA AATGGCAGCT
GTCAGCATGC TCGACCAAAC GAAAAAACTG GCGCTCGACG GTCCGGCTGG CAAAGGGCCG
GACGTGTTCT ATCAGCCGCA TGACCGCATC GGCGACATTG TGTTGCAGGG GCTGGCCGAT
CCGGTCGACC TTGGCGATGC AAAAGGCGAG TACAGCCCGA CAGCCATCAA CGCGGTAACA
TATGATGGGA AAACGTATGG CGTGCCGATG GTCGTCGAAA CGTACGGCGT GTTCTACAAT
AAAAATTTAG TGCCGGAAGC GCCGAAAACG ATGGATGACT TGATGAAGAT CGCCAAGGAA
AAAACGAATG CGGCGAAAGA TCAATACGGC TTTTTAATGG AAGCGACGAA CTTCTATTTC
GTTTATCCGT TCTTTGCTGG CTACGGCGGC TATGTGTTTA AAAATGAAAA CGGCAAATAC
GACACAAGCG ACATCGGGCT GGCTAATGAT GGAGCCGTCA AAGGAGCCGA GCTCGTCCAA
TCGTGGTTTA AAAACGGCTA CATTCCGAAA GAAATCACCG GCGATATTAT GAACGGGCTG
TTTACGAAAG GGAATGTGGC GGTTGCCATC ACCGGACCAT GGAACATCGC GTCGTATAAA
GAAGCGTTGG GCGACAAATT AGCGACCGCG CCGCTGCCGG TCTTAGACAA CGGCGAGCAT
CCGAAATCGT TCGTCGGCGT GAAAACATGG ATGTTATCAG CCTATTCACA AAACAAAGAA
TGGGCGGTCG ATTTCATGAA ATTTGTGACG AACGAAGAAA ATTCGCTCCA TTACTATGAA
GTGGCAGGTG AAATGCCGGC GAATGAAAAA GCGTTGACAA ATGAGAAAAT TACGAATGAT
CCGTTGATCG CCGGCTTTGC GGAGCAAATC CAATACGGCG AGCCGATGCC GAACGTGCCG
GAAATGTCGC AAGTGTGGGA GCCGATGGGC AATGCCTTGC AATTTATCGC GAAAGGCGAC
AACCCGAAAG CGGTGCTCGG TGAGGCGGTC AAAACGATTC AAGATAAAAT CGCCGCCAGC
GGCGCTGGAA AATAA
 
Protein sequence
MRKALSLFLM AVLLIGVLGA CGPKRDVQQP KGNDQKTAEQ DKKPEKLVVW VNDDEKQKQA 
LKDIFQKYTE KTGIKIEMAA VSMLDQTKKL ALDGPAGKGP DVFYQPHDRI GDIVLQGLAD
PVDLGDAKGE YSPTAINAVT YDGKTYGVPM VVETYGVFYN KNLVPEAPKT MDDLMKIAKE
KTNAAKDQYG FLMEATNFYF VYPFFAGYGG YVFKNENGKY DTSDIGLAND GAVKGAELVQ
SWFKNGYIPK EITGDIMNGL FTKGNVAVAI TGPWNIASYK EALGDKLATA PLPVLDNGEH
PKSFVGVKTW MLSAYSQNKE WAVDFMKFVT NEENSLHYYE VAGEMPANEK ALTNEKITND
PLIAGFAEQI QYGEPMPNVP EMSQVWEPMG NALQFIAKGD NPKAVLGEAV KTIQDKIAAS
GAGK