Gene GYMC61_1612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1612 
Symbol 
ID8525475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1639436 
End bp1641085 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content50% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003252727 
Protein GI261419045 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAGC GATTTTCTTT CTTCCTCGTC CTGCTTCTCG CGCTGTCAAC GTTCCTCGCG 
GCTTGCGGCG GGGGCAAGGA CAACAACGCT CAAGGCGGCC AGGGAGGCGA AAAGCCGGCG
GAGAAAAAGG AGCAGGTGCT GAACTTGCTT GATTCTTCGG AAATCCCGTC GCTTGACTCG
GCGCTCGCCA CCGACCAAGT ATCGTTTATC GTGTTGAACA ACGTTATGGA AGGGCTGTAC
CGTTTAGGGA AAGACAACAA ACCGGTGCCG GGGATCGCGG AAAGCTATGA AGTGAGCCAA
GACGGCAAAA CGTACACGTT CAAGCTTCGC AAAGACGCCA AATGGTCGAA CGGCGACCCG
GTGACGGCGC ACGACTTCGT GTTTGCTTGG AAACGGGTGC TTGACCCGAA AACGAAAGCG
GAGTATGCGT ACATTATGTA CGACATCAAA AACGCGGAAG AAGTCAACAC CGGCAAACTG
CCAGTTGACC AGCTGGGCGT CAAAGCGCTT GACGACTACA CGCTCCAAGT CGAGCTGAAA
AACCCGATTC CGTATTTCAT CAGCCTGACG GTATTCGGCA CGTTCATGCC GCAAAACGAG
AAATTCGTGA AAGCGCAAGG CGACAAATAC GGTCTCGAAG CTAATACGAC GCTTTACAAC
GGCCCGTTCG TCTTAAGCGA ATGGAAGCAT GAACAAGGCT GGACATATGA GAAAAACCCG
AATTACTGGG ACAAAGATAC AGTTAAGCTT GAAAAAATCA ACGTCAAAGT TGTGAAAGAC
ACGGCGACAG CGGTCAACTT GTATGAAACG AAACAGGCCG ACCGCGTCGG TTTGACGGCC
GAATTTGTTG ATAAGTATAA AAATGATAAA AACTTCAAAA CGGAATTGGA TCCATCCCTC
TACTGGCTGC GCATGAACAC GAAAAAAGAA CCGTTGAACA ACGTCAACGC CCGCAAAGCG
ATTGCGATGG CAATCGACAC GCAAGCGATG GTCGATACGC TCTTGAACAA CGGTTCGATT
CCGGCGAAGT TCACCGTTCC GAAAGACTTC GTCACCGGTC CGGATGGCAA AGACTTCCGC
GATGTGAATG GCGATTTGGT CAATTACAAT CCGGATGAAG CGAAAAAACT GTGGGAGCAA
GCGAAAAAAG AGCTGGGCAA AGACAAATTT ACGCTTGAAC TGCTGAACTA TGACAGCGAC
AGCGCGAAGA AAATCGGCGA GTATGTGAAA GCGCAGCTTG AACAAAACTT GCCGGGCTTG
ACGGTCAACA TTAAACAACA ACCGTTCGCA CAAAAGCTCG AGCTTGAAAG CAAAATGCAA
TACGACCTGT CGTTCTCCGG CTGGGGTCCG GACTATCAAG ACCCGATGAC GTTCCTTGAC
TTGTGGACGA CAACTAACCC GCACAACCAA ACGGGTTGGT CGAACGCTGA ATACGACAAA
TTGATTAAAG ACGCGAAAAC AACGTTGCTT GGCGATTTGC AAGCCCGTTG GGATGCGATG
CTGAAAGCAG AAAAGATCGT CTTTGAAGAA ATGCCGATTG CCCCGCTCTA TCAGCGCGGT
GTCGCCTACT TGCAACGCGA GTATGTCAAA GATATCGTTT CGCACCCGTT TGGCGGCGAT
TACAGCTACA AATGGGCATA TATTGAGTAA
 
Protein sequence
MKKRFSFFLV LLLALSTFLA ACGGGKDNNA QGGQGGEKPA EKKEQVLNLL DSSEIPSLDS 
ALATDQVSFI VLNNVMEGLY RLGKDNKPVP GIAESYEVSQ DGKTYTFKLR KDAKWSNGDP
VTAHDFVFAW KRVLDPKTKA EYAYIMYDIK NAEEVNTGKL PVDQLGVKAL DDYTLQVELK
NPIPYFISLT VFGTFMPQNE KFVKAQGDKY GLEANTTLYN GPFVLSEWKH EQGWTYEKNP
NYWDKDTVKL EKINVKVVKD TATAVNLYET KQADRVGLTA EFVDKYKNDK NFKTELDPSL
YWLRMNTKKE PLNNVNARKA IAMAIDTQAM VDTLLNNGSI PAKFTVPKDF VTGPDGKDFR
DVNGDLVNYN PDEAKKLWEQ AKKELGKDKF TLELLNYDSD SAKKIGEYVK AQLEQNLPGL
TVNIKQQPFA QKLELESKMQ YDLSFSGWGP DYQDPMTFLD LWTTTNPHNQ TGWSNAEYDK
LIKDAKTTLL GDLQARWDAM LKAEKIVFEE MPIAPLYQRG VAYLQREYVK DIVSHPFGGD
YSYKWAYIE