Gene GYMC61_2742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_2742 
Symbol 
ID8526619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp2794719 
End bp2796344 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content42% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003253806 
Protein GI261420124 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA GTCTTCTAAC TGTATTGGTG TTGTTGCTAT TCGTTTCAAC GGCGCTTGTC 
GCTTGCAGCA ATGAAACGCC GTCAAATGAA GGAAGTGAGG CGGCCAAGAA TATTAAGCAG
GAAATTACGT TGAATGCAAA GACTGAGCCG CCTAGCTTAG ACCCTGCAAT CGCATCGGAT
ACGACGTCGG GATGGGTGAT CGACCACCTA TTTGAAGGAT TGTATACGAA AAATCAAAAA
GGAGAGCCTG TATTAGGAGC CGCAAGTGAT GTAAAAATAT CAGAAGATGG AAAAACGTAT
ACGTTTACGA TTCGTGAGGA TGCAAAATGG TCTGACGGTG ATCCAGTCAC GGCCTATGAC
TTTGAATACG CTTGGAAACG TGTGCTTGAT CCGAAGACAG GGAGCCCGTT CGCTTTTTAT
ATGTATTACA TCAAGGGGGC CGAAGAATAT AACAAAGGAA AGGGAAGCGC GGATCAGGTA
GGGATTAAGG CGTTGGATGA CAAAACATTC CAAGTGGAAT TAAAAGCACC GTTGGGATAT
TTTGATAAAT TGCTGACCAT GTGGACATTT TATCCAGTGA AAAAATCTCT CGTTGAATCG
AACCCGAAAT GGGCAGCGGA CGCAAAAGGA TATGTAAGCA ACGGGGCTTA TCGTTTGACA
GAATGGAAGC ATAATAGTGA AGTTGTCATC GAAAAGAATG AACATTATTG GAACAAAGAT
CAAATTAATA TGCAAAAGGT AACATGGAAG ATGGTCAATG ATGCGACGAC ATACTATCAA
ATGTATAAAA CAGGGGAGCT TGACTTAATT GACACCTTGC CGACTGACGT CATTGACCAA
GAAAAAAATA ATAAGGAGTT TAAAATCACT CCATACTTTG GTACGTATAT GTTTATGCTG
AATGTAGACA AACCACCGTT TACGAACGCA AAAATTCGCC GCGCTTTTGC CATGGCCATT
GATCGGGAGG CAATTGTCAA AAATATTACC AAATCTGGTG AAAAACCGGC TTATGCCTTC
GTACCATACG GTGTCAATAC TCCGAAAGGC GATTTCCGAG AAGTGGGCGG TTCTTATTTT
GAAGAGAACG TCAAAGAAGC GAAACAGTTA TTGGAAGAAG GTATGAAGGA AGAAGGATGG
ACAAAGCTTC CAGAAGTCAC GCTAATGTAT AATACCGCCG AGAACCATAA AAAAATTGCT
GAAGCTGTTC AAGAAATGTT GAAAACGAAC CTTGGCGTGA AAGTGAAACT GGCCAACCAA
GAATGGAAAA CATACTTGGA AACGACACAG CAATCCAATT TCCAAATGGC CCGCATGGGT
TGGATCGGTG TGTTTGTTGA TCCGACAGTG ATTTTGGATT ACTACTTAGG CGACAGCCCG
AACAACCGCA CGAACTGGGT AAACAAGCGA TTTGATGATT TGATGGCGAA AGCGAAAGTG
GAACAAGATG ACCAAAAACG ATATGAACTC CTCCATGAAG CGGAAAAAGT ACTAATGACG
GATCTGCCGT TTATCCCTGT TTATTTCTAT TCGCAAAATT ATTTAACATC GCCGAATTTT
AAAGATATTG TCTATCCCGT CAACCGTTAT CCGGACGTGC GCTGGGCGAA AAAAGTAGCA
GAGTAG
 
Protein sequence
MKRSLLTVLV LLLFVSTALV ACSNETPSNE GSEAAKNIKQ EITLNAKTEP PSLDPAIASD 
TTSGWVIDHL FEGLYTKNQK GEPVLGAASD VKISEDGKTY TFTIREDAKW SDGDPVTAYD
FEYAWKRVLD PKTGSPFAFY MYYIKGAEEY NKGKGSADQV GIKALDDKTF QVELKAPLGY
FDKLLTMWTF YPVKKSLVES NPKWAADAKG YVSNGAYRLT EWKHNSEVVI EKNEHYWNKD
QINMQKVTWK MVNDATTYYQ MYKTGELDLI DTLPTDVIDQ EKNNKEFKIT PYFGTYMFML
NVDKPPFTNA KIRRAFAMAI DREAIVKNIT KSGEKPAYAF VPYGVNTPKG DFREVGGSYF
EENVKEAKQL LEEGMKEEGW TKLPEVTLMY NTAENHKKIA EAVQEMLKTN LGVKVKLANQ
EWKTYLETTQ QSNFQMARMG WIGVFVDPTV ILDYYLGDSP NNRTNWVNKR FDDLMAKAKV
EQDDQKRYEL LHEAEKVLMT DLPFIPVYFY SQNYLTSPNF KDIVYPVNRY PDVRWAKKVA
E