Gene GYMC61_2959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2959 
Symbol 
ID8526836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp3015187 
End bp3016500 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003254005 
Protein GI261420323 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CGTTAAAAAC ATCGGCTGTT TTGTTAGCGC TTACGATGGG GCTCGCGGCA 
TGCTCCGGCG GCCAAAGCGA GAGCCAGACG GGCAAAGACA GCAAAAAAGA AGAAGGGAAA
ACGAGCCAGT CTGAAAAAGT GAAAATTGTC TATGCCCGCG GCCAAGACTC CACGAAAGCC
ACCGAAAAAA TCATCGAAGC GTTCGAGAAA ACGCACCCGA ACATCGATGT GGAATTTCGC
GAAATGCCGG CTGACACCGG GAAGCAGCAT GATGCGTATG TCACGATGTT AAATGCGCAG
TCGTCGGAAA TCGATGTCAT GGATCTCGAC GTCATTTGGC CGGCTGAGTT TGCTCAAGCG
GGTTATACGC TGCCGCTCGA CCGCTTTATT GAGAAAGACG GCATTGACCT CGGCAAATAC
AACCAAGGGG CGCTTGCGGC CGGCAACTTC AATGGCAAGC AATGGGCGAT GCCGAAATTC
ATTGATGCCG GCATGCTGTT CTATCGCACC GACCTGGTGC CAAAGGACAA AGTGCCGAAA
ACATGGGATG AGCTCTTAAA AGAGGCGAAA GAACTGAAAG GAAAAGGCGG CACGAAATTC
GGCTATTTGA TGCAAGCCAA ACAATATGAA GGATTGGTGT GCAACGCGGT TGAATTTATC
GCCTCTTATG GCGGGCAAAT CGTCGATAAA AACGGGAATG TCGTGATCAA CAGCCCAGAA
ACGATCAAAG GGCTGAAGAA AATGGTGGAA ATCGTCAAAT CCGATGTTGT GCCGAGCAAC
ATCACCACCT TCACTGAACC AGAATCGCAT ACCGCGTTCA TTGAAGGACA ATCGCCGTTT
ATCCGCAACT GGCCTTATCA ATACGCGCTA GCGAATGACA AAGAACAATC GAAAATCGTC
GGCAAAGTCG GCGTGGCTCC GCTTCCGGCC GGAGACAAAG GTTCGGCTGC CGCGCTGGGC
GGCTGGATGA CCGCGATCAA CAAATATTCG AAACATCCGA AAGAAGCATG GGAATTTGTC
AAATTTATGA CGGGGCCGGA GGGGCAAAAA ATTTCTGCGA TTTACGGCGG TTTAGCGCCA
ACGCTTCCGG AACTGTTTAA AGATCCAGAT GTCTTAAAAG CCAATCCGTT CTTTGCGGAA
GAAGGATTTG TCAATGCGTT GAACGCCGCT GTGCCGCGCC CGGTCGTCCC GAACTACCCG
GAAATTTCGG AAATCATCCA AATTAACGTA TCCAAAGCGC TGGCCGGGGA GCTGACGGTC
GAACAAGCTG TGGCCAATAT GGAAAAAGAA ATGAAAGCAG CCTTGAATAA GTAA
 
Protein sequence
MKKTLKTSAV LLALTMGLAA CSGGQSESQT GKDSKKEEGK TSQSEKVKIV YARGQDSTKA 
TEKIIEAFEK THPNIDVEFR EMPADTGKQH DAYVTMLNAQ SSEIDVMDLD VIWPAEFAQA
GYTLPLDRFI EKDGIDLGKY NQGALAAGNF NGKQWAMPKF IDAGMLFYRT DLVPKDKVPK
TWDELLKEAK ELKGKGGTKF GYLMQAKQYE GLVCNAVEFI ASYGGQIVDK NGNVVINSPE
TIKGLKKMVE IVKSDVVPSN ITTFTEPESH TAFIEGQSPF IRNWPYQYAL ANDKEQSKIV
GKVGVAPLPA GDKGSAAALG GWMTAINKYS KHPKEAWEFV KFMTGPEGQK ISAIYGGLAP
TLPELFKDPD VLKANPFFAE EGFVNALNAA VPRPVVPNYP EISEIIQINV SKALAGELTV
EQAVANMEKE MKAALNK