Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_0157 |
Symbol | |
ID | 4250879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 179790 |
End bp | 181355 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638116700 |
Product | type II secretion system protein E (GspE) |
Protein accession | YP_732295 |
Protein GI | 113968502 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | [TIGR02533] general secretory pathway protein E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAA TACAAGTAAC CCAAGTCGAG GATTTAAGCC TAGTTTCTAA CGAACTAGGG CTTGAAGCCG AAGGTGATGA GGTATTTCGC TCCAGCAGCA AAGAGCGTTT ACCCTTTGCC TTTGCCCACC GCCATGAGGT GATTTTAGCC TTTGGCGAGG ATGGTGCCTT GAGCCTGTTT TATACCGCCA AAACCCCGTT AACGGCCATG CTCGAAGCGC GCCGTTACTC GGGTGTGGAT TTGCCGCTGG TATTGCTCGA ACCGGGCAAG TTTGAGGCGA AATTAACTCA GGCCTATCAA GCCAACTCCT CCGAAGCGCA GCAGCTGATG GAAGATATTG GCAACGAGAT GGATTTATTC ACCCTCGCCG AAGAGCTGCC GCAGACCGAA GATCTGCTCG AAGGCGATGA TGATGCGCCT ATTATCAAGC TGATTAACGC CTTGTTATCT GAAGCGATTA AAGAAGAAGC CTCGGATATC CATATCGAAA CCTACGAGAA GCAGTTGGTC GTGCGTTTCC GTATCGACGG TGTGCTGAAG GAAGTGCTTA AGCCGAATCG TAAGTTGTCA TCCTTGTTGG TGTCGCGGAT CAAGGTGATG GCGCGTCTCG ATATCGCCGA GAAACGTGTG CCACAAGACG GCCGTATTTC GCTGCGGATT GCGGGTCGTG CGGTGGATGT GCGGGTATCG ACCATGCCAT CGAGCCATGG CGAGCGTGTG GTGCTGCGTC TGTTAGATAA AAACGCCGGT AATCTGGATT TAAAACAATT GGGTATGACC GACAGTATCC GTGCCAAGTT CGACGAGCTT ATTCGTCGTC CCCACGGCAT TATTCTGGTC ACAGGCCCTA CGGGTTCCGG TAAGAGTACT ACGCTGTACG CCGGGCTTAC CGAAATTAAC TCCAAAGATA CCAACATCTT AACCGTTGAA GATCCGATCG AATACGAGTT AGAAGGCATA GGTCAAACCC AAGTGAACAC TAAGGCTGAC ATGACCTTCG CCCGTGGTCT GCGTGCGATT CTGCGTCAGG ACCCGGATGT GGTGATGATC GGCGAAATCC GTGACCTAGA AACCGCCCAA ATTGCGGTGC AGGCATCCTT GACCGGTCAC TTAGTGATTT CAACCCTGCA TACCAACACG GCTTCTGGTG CAATTACCCG TTTGCAAGAC ATGGGCGTAG AGCCATTCCT AGTCTCATCG AGTTTGCTTG GGGTATTAGC CCAGCGTTTA ATTCGTACTC TCTGTCCAAA ATGTAAGACA GAGCATGTGC CCGATGCCCG TGAGCGTGAG CTGCTGGGAA TGGCGGCAGA CGATCATCGT CATATTTATC GCGCCAATGG CTGTAAGGCC TGTGGTAGCA GTGGCTACCG TGGTCGTACG GGTATCCATG AGCTATTGCT GGTCGATGAT AATGTGCGCG AGCTGATCCA CGGTGGACGT GGCGAACTGG CGATTGAGAA ATATATCCGC CAGTCGGTGC CAAGCATTCG CCACGATGGC ATGAGCAAGG TGCTCTCAGG TATCACCACT CTGGAAGAAG TCCTGAGGGT GACCCGCGAG GAGTAA
|
Protein sequence | MSEIQVTQVE DLSLVSNELG LEAEGDEVFR SSSKERLPFA FAHRHEVILA FGEDGALSLF YTAKTPLTAM LEARRYSGVD LPLVLLEPGK FEAKLTQAYQ ANSSEAQQLM EDIGNEMDLF TLAEELPQTE DLLEGDDDAP IIKLINALLS EAIKEEASDI HIETYEKQLV VRFRIDGVLK EVLKPNRKLS SLLVSRIKVM ARLDIAEKRV PQDGRISLRI AGRAVDVRVS TMPSSHGERV VLRLLDKNAG NLDLKQLGMT DSIRAKFDEL IRRPHGIILV TGPTGSGKST TLYAGLTEIN SKDTNILTVE DPIEYELEGI GQTQVNTKAD MTFARGLRAI LRQDPDVVMI GEIRDLETAQ IAVQASLTGH LVISTLHTNT ASGAITRLQD MGVEPFLVSS SLLGVLAQRL IRTLCPKCKT EHVPDARERE LLGMAADDHR HIYRANGCKA CGSSGYRGRT GIHELLLVDD NVRELIHGGR GELAIEKYIR QSVPSIRHDG MSKVLSGITT LEEVLRVTRE E
|
| |