Gene GYMC61_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGYMC61_0888 
Symbol 
ID8524711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. Y412MC61 
KingdomBacteria 
Replicon accessionNC_013411 
Strand
Start bp890077 
End bp891738 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content54% 
IMG OID 
Producttype II secretion system protein E 
Protein accessionYP_003252037 
Protein GI261418355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA AGCAGGAACG GAAACGGCTT GGTGATTTGC TTGTCGAAGC GGGATTGATC 
ACCGAAAGTC AGCTGGCCGA GGCGCTGCGC GAGAAGGCGC CCGGGCAAAA GCTCGGCGAC
GCCTTGTTGC AGCGCGGCTA CATCACGGAA CAGCAGCTGA TTGAGGCGCT CGAGTTTCAG
CTCGGCATTC CGCACGTCAG TTTGTACCGG TACCCGATCG ATCCGAAAGC GACAAACCTC
GTGCCGAAAG AGTTTGCCCG CCGCCATATG GTCATGCCGC TCAAGGTGGA AGGCGATCGG
CTGCTCGTTG CCATGGCCGA TCCGATGGAT TTTTTTGCCA TCGACGATTT GCGCCTGTCG
ACCGGGTTTC AAATTGAAAC GGCGATCGCT TCAAAAGACG ATATTTTGCG GGCGATCAAT
AAATATTACG ACATCGACGA AGCATTTGAG GATTTTTTGC AAACGCCGCC TGAGGTGCGC
GACGACGAGC GGGCTGCTGA GGACGATTCT CCCATCGTTC GTTTGGTGAA CCAAATTTTG
CAGCTTGCTG TTGAACAGCG GGCGAGCGAC ATTCATATCG ATCCACAGGA GACGAAAGTG
CTCATCCGCT ATCGGATCGA CGGCCTGCTT CGCACCGAGC GCGCGCTGCC GAAACATATG
CAAAGCATGT TGACGGCGAG AATTAAAATT TTGGCCAATA TGGACATCAC CGAACACCGC
GTGCCGCAAG ACGGACGGAT CAAAATGGAC ATCGATTTTC ATCCGGTCGA TTTGCGCGTT
TCAACGCTGC CGACCGTATA CGGTGAGAAA ATCGTCATGC GCGTCCTCGA CTTGGGCGCA
GCTTTAAACG ATATTCATAA ACTTGGCTTC AATCCGGTTA ATTTAGATCG ATTCATCCGC
TTGATCGAGC GGCCGAACGG CATCGTCTTG ATCACCGGAC CGACCGGTTC GGGGAAATCG
TCGACGCTCT ATGCGGCGCT CAACCATTTA AACAGCGAGC ACGTGAACAT TATTACGATT
GAAGATCCAG TCGAATATCA GATCGAGGGC GTCAACCAAA TTCAAGTCAA CCCGAATGTC
GGCTTGACGT TCGCCCAAGG GCTCCGCTCG ATTTTGCGTC AAGACCCGAA CATCATCATG
GTTGGAGAGA TTCGCGACCG CGAGACGGCG GAAGTGGCGA TCCGCGCTTC GCTCACCGGT
CATTTAGTGT TGAGCACGCT CCATACGAAC GATGCATTAA GCACGATCAC CCGCCTGATC
GATATGGGGA TTGAGCCGTT TTTAGTGGCC ACATCGCTCG CCGGCGTTGT CTCGCAGCGG
CTTGTGCGCC GCGTCTGCCG CGACTGCCAA GAGGTGTATG AGCCGACGAA GCGGGAGCTG
GACATTTTCG CCCGCCGCGG CATCGAGGTT CATCAACTTG TCCGCGGCCG CGGCTGCCCG
ATGTGCAACA TGACCGGTTA CCGCGGACGG CTGGCGATTC ACGAGTTGCT TGTTGTCACC
GATGAGATGC GGCGCGTCAT TTTAAACAAC GAGCCGTTTT CGAAATTGCG CGAGCTTGCC
CTGCAAAACA AAATGATTTT TTTGCTGGAT GATGGGCTGT TGAAAGTGAA GCAAGGGTTG
ACCACGCTTG AAGAAGTGCT GAAAGTGGCC ATTTTGCATT GA
 
Protein sequence
MSKKQERKRL GDLLVEAGLI TESQLAEALR EKAPGQKLGD ALLQRGYITE QQLIEALEFQ 
LGIPHVSLYR YPIDPKATNL VPKEFARRHM VMPLKVEGDR LLVAMADPMD FFAIDDLRLS
TGFQIETAIA SKDDILRAIN KYYDIDEAFE DFLQTPPEVR DDERAAEDDS PIVRLVNQIL
QLAVEQRASD IHIDPQETKV LIRYRIDGLL RTERALPKHM QSMLTARIKI LANMDITEHR
VPQDGRIKMD IDFHPVDLRV STLPTVYGEK IVMRVLDLGA ALNDIHKLGF NPVNLDRFIR
LIERPNGIVL ITGPTGSGKS STLYAALNHL NSEHVNIITI EDPVEYQIEG VNQIQVNPNV
GLTFAQGLRS ILRQDPNIIM VGEIRDRETA EVAIRASLTG HLVLSTLHTN DALSTITRLI
DMGIEPFLVA TSLAGVVSQR LVRRVCRDCQ EVYEPTKREL DIFARRGIEV HQLVRGRGCP
MCNMTGYRGR LAIHELLVVT DEMRRVILNN EPFSKLRELA LQNKMIFLLD DGLLKVKQGL
TTLEEVLKVA ILH