Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GYMC61_0888 |
Symbol | |
ID | 8524711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. Y412MC61 |
Kingdom | Bacteria |
Replicon accession | NC_013411 |
Strand | + |
Start bp | 890077 |
End bp | 891738 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | type II secretion system protein E |
Protein accession | YP_003252037 |
Protein GI | 261418355 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAAA AGCAGGAACG GAAACGGCTT GGTGATTTGC TTGTCGAAGC GGGATTGATC ACCGAAAGTC AGCTGGCCGA GGCGCTGCGC GAGAAGGCGC CCGGGCAAAA GCTCGGCGAC GCCTTGTTGC AGCGCGGCTA CATCACGGAA CAGCAGCTGA TTGAGGCGCT CGAGTTTCAG CTCGGCATTC CGCACGTCAG TTTGTACCGG TACCCGATCG ATCCGAAAGC GACAAACCTC GTGCCGAAAG AGTTTGCCCG CCGCCATATG GTCATGCCGC TCAAGGTGGA AGGCGATCGG CTGCTCGTTG CCATGGCCGA TCCGATGGAT TTTTTTGCCA TCGACGATTT GCGCCTGTCG ACCGGGTTTC AAATTGAAAC GGCGATCGCT TCAAAAGACG ATATTTTGCG GGCGATCAAT AAATATTACG ACATCGACGA AGCATTTGAG GATTTTTTGC AAACGCCGCC TGAGGTGCGC GACGACGAGC GGGCTGCTGA GGACGATTCT CCCATCGTTC GTTTGGTGAA CCAAATTTTG CAGCTTGCTG TTGAACAGCG GGCGAGCGAC ATTCATATCG ATCCACAGGA GACGAAAGTG CTCATCCGCT ATCGGATCGA CGGCCTGCTT CGCACCGAGC GCGCGCTGCC GAAACATATG CAAAGCATGT TGACGGCGAG AATTAAAATT TTGGCCAATA TGGACATCAC CGAACACCGC GTGCCGCAAG ACGGACGGAT CAAAATGGAC ATCGATTTTC ATCCGGTCGA TTTGCGCGTT TCAACGCTGC CGACCGTATA CGGTGAGAAA ATCGTCATGC GCGTCCTCGA CTTGGGCGCA GCTTTAAACG ATATTCATAA ACTTGGCTTC AATCCGGTTA ATTTAGATCG ATTCATCCGC TTGATCGAGC GGCCGAACGG CATCGTCTTG ATCACCGGAC CGACCGGTTC GGGGAAATCG TCGACGCTCT ATGCGGCGCT CAACCATTTA AACAGCGAGC ACGTGAACAT TATTACGATT GAAGATCCAG TCGAATATCA GATCGAGGGC GTCAACCAAA TTCAAGTCAA CCCGAATGTC GGCTTGACGT TCGCCCAAGG GCTCCGCTCG ATTTTGCGTC AAGACCCGAA CATCATCATG GTTGGAGAGA TTCGCGACCG CGAGACGGCG GAAGTGGCGA TCCGCGCTTC GCTCACCGGT CATTTAGTGT TGAGCACGCT CCATACGAAC GATGCATTAA GCACGATCAC CCGCCTGATC GATATGGGGA TTGAGCCGTT TTTAGTGGCC ACATCGCTCG CCGGCGTTGT CTCGCAGCGG CTTGTGCGCC GCGTCTGCCG CGACTGCCAA GAGGTGTATG AGCCGACGAA GCGGGAGCTG GACATTTTCG CCCGCCGCGG CATCGAGGTT CATCAACTTG TCCGCGGCCG CGGCTGCCCG ATGTGCAACA TGACCGGTTA CCGCGGACGG CTGGCGATTC ACGAGTTGCT TGTTGTCACC GATGAGATGC GGCGCGTCAT TTTAAACAAC GAGCCGTTTT CGAAATTGCG CGAGCTTGCC CTGCAAAACA AAATGATTTT TTTGCTGGAT GATGGGCTGT TGAAAGTGAA GCAAGGGTTG ACCACGCTTG AAGAAGTGCT GAAAGTGGCC ATTTTGCATT GA
|
Protein sequence | MSKKQERKRL GDLLVEAGLI TESQLAEALR EKAPGQKLGD ALLQRGYITE QQLIEALEFQ LGIPHVSLYR YPIDPKATNL VPKEFARRHM VMPLKVEGDR LLVAMADPMD FFAIDDLRLS TGFQIETAIA SKDDILRAIN KYYDIDEAFE DFLQTPPEVR DDERAAEDDS PIVRLVNQIL QLAVEQRASD IHIDPQETKV LIRYRIDGLL RTERALPKHM QSMLTARIKI LANMDITEHR VPQDGRIKMD IDFHPVDLRV STLPTVYGEK IVMRVLDLGA ALNDIHKLGF NPVNLDRFIR LIERPNGIVL ITGPTGSGKS STLYAALNHL NSEHVNIITI EDPVEYQIEG VNQIQVNPNV GLTFAQGLRS ILRQDPNIIM VGEIRDRETA EVAIRASLTG HLVLSTLHTN DALSTITRLI DMGIEPFLVA TSLAGVVSQR LVRRVCRDCQ EVYEPTKREL DIFARRGIEV HQLVRGRGCP MCNMTGYRGR LAIHELLVVT DEMRRVILNN EPFSKLRELA LQNKMIFLLD DGLLKVKQGL TTLEEVLKVA ILH
|
| |