Gene GYMC61_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_1352 
Symbol 
ID8525191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp1373391 
End bp1374995 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content51% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003252480 
Protein GI261418798 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAGA AAACATGGTT TACGTGGCTT GCCATGATGC TGGCAGTCAT GCTTGTTTTA 
GCTGGCTGCG GCAAGTCGAA GCAAACCGCA GGGGGTGGCA GTGACAAACC GTCGACCCAG
GATACGCTGG TGTACGGGCG CGGCGGTGAT TCGGTGTCGC TCGACCCGGC GACTGTGACG
GATGGGGAGT CGCTGAAAGT AACGAAAAAC ATTTTTGATA CGCTCCTTGA TTACAATGAC
AACGATACGT CCGTCAAGCC GGCGTTGGCG ACGGAATGGA CGATTTCGAA GGATGGACTG
ACCTATACGT TTAAATTGCG CCAAGGTGTC AAGTTTCATG ACGGGACAGA GTTTAACGCC
GATGCAGTGG TCTTTAACTT CGAGCGTTGG GCGAACGGCA ATGCCGACAA GTTTCCGTAT
TACGGATCGA TGTTTGGCGG TTACAAGCAG GATGACAGCC ATGTGATTAA AGAAGTAAAG
GCGCTCGACA AGTACACGGT GCAATTTGTG CTGAAACGGC CGCAAGCTCC ATTTTTGAAA
AATCTCGCCA TGACGCCGTT TGCCATCGCC AGCCCAGAAG CCGTGAAAAA ATACGGCGAC
AAGTTTGGCG AACACCCAGT CGGGACCGGC CCGTTCGTCT TTAAAGAATG GAAGCGCAAC
GAACGGATCG TACTTGAAAA AAATAAAGAC TATTGGGAAA AAGGCTATCC AAAGCTGAAC
CAGCTCATCT TCGTGTCCAT TCCGGACAAC TCGGCGCGTC TCAATGCGCT CTTAAAAGGC
GAAATCGACA TCATGGAAGA CTTGAATCCG ACGGACTTAA AACAAGTGGA GGGAAACAAA
GAGTTTCAAA TTTTCAAGCG CCCGTCGATG AACGTCGCCT ATGTCGGACT GACGGCGACG
AGAGGGCCGC TGAAAAACAA GTTGGTTCGC CAAGCGTTGA ACTACGCGGT TGATAAGAAA
GCGATCATCG ATGCGTTTTA CGCCGGCCAG GCGGAACCGG CGAAAAACCC GATGCCGCCC
AGCATCCCGG GATACAACGA TGCGATTCAA GACTATCCGT TTGATTTGAA TAAAGCGAAA
GAGCTGCTGG CGAAAGCGGG TTATCCGAAT GGCTTTGAAA TCGAACTGTG GGCGATGCCG
GTGCCGCGTC CGTATATGCC GGACGGGCAA AAAATCGCTG AGGCCATTCA AGCGAATTTT
GCCAAAATCG GCGTGAAAGC GAAAATCGTG ACGTATGAAT GGGCGACCTA TTTAGACAAG
CTCGCCAAAG GGGAAGCGGA CGCCTTCCTG CTCGGCTGGA CGGGCGACAA CGGCGACGCG
GATAACTTCT TGTATGCGCT CCTTGACAAA GACAGCATTG GCAGCAACAA CTACACCTAT
TTCTCGAATG ATGAGCTGCA TAAAATTTTG GTCGAAGCGC AAACGGTGAG CGATGAAAAC
AAACGGAACG AGCTGTATAA AAAAGCGCAA GAGATCATTA AAGAAGAAGC GCCATGGATT
CCGCTCGTCC ATTCAACTCC GCTGTTGGCC GGCAAGGCGA ATATCCAAGG CTTTAACCCG
CACCCGACCG GTTCGGATAA GTTTACGAAA GTCGAGTTTA AATAA
 
Protein sequence
MRKKTWFTWL AMMLAVMLVL AGCGKSKQTA GGGSDKPSTQ DTLVYGRGGD SVSLDPATVT 
DGESLKVTKN IFDTLLDYND NDTSVKPALA TEWTISKDGL TYTFKLRQGV KFHDGTEFNA
DAVVFNFERW ANGNADKFPY YGSMFGGYKQ DDSHVIKEVK ALDKYTVQFV LKRPQAPFLK
NLAMTPFAIA SPEAVKKYGD KFGEHPVGTG PFVFKEWKRN ERIVLEKNKD YWEKGYPKLN
QLIFVSIPDN SARLNALLKG EIDIMEDLNP TDLKQVEGNK EFQIFKRPSM NVAYVGLTAT
RGPLKNKLVR QALNYAVDKK AIIDAFYAGQ AEPAKNPMPP SIPGYNDAIQ DYPFDLNKAK
ELLAKAGYPN GFEIELWAMP VPRPYMPDGQ KIAEAIQANF AKIGVKAKIV TYEWATYLDK
LAKGEADAFL LGWTGDNGDA DNFLYALLDK DSIGSNNYTY FSNDELHKIL VEAQTVSDEN
KRNELYKKAQ EIIKEEAPWI PLVHSTPLLA GKANIQGFNP HPTGSDKFTK VEFK