Gene Shewmr4_0157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_0157 
Symbol 
ID4250879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp179790 
End bp181355 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content52% 
IMG OID638116700 
Producttype II secretion system protein E (GspE) 
Protein accessionYP_732295 
Protein GI113968502 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA TACAAGTAAC CCAAGTCGAG GATTTAAGCC TAGTTTCTAA CGAACTAGGG 
CTTGAAGCCG AAGGTGATGA GGTATTTCGC TCCAGCAGCA AAGAGCGTTT ACCCTTTGCC
TTTGCCCACC GCCATGAGGT GATTTTAGCC TTTGGCGAGG ATGGTGCCTT GAGCCTGTTT
TATACCGCCA AAACCCCGTT AACGGCCATG CTCGAAGCGC GCCGTTACTC GGGTGTGGAT
TTGCCGCTGG TATTGCTCGA ACCGGGCAAG TTTGAGGCGA AATTAACTCA GGCCTATCAA
GCCAACTCCT CCGAAGCGCA GCAGCTGATG GAAGATATTG GCAACGAGAT GGATTTATTC
ACCCTCGCCG AAGAGCTGCC GCAGACCGAA GATCTGCTCG AAGGCGATGA TGATGCGCCT
ATTATCAAGC TGATTAACGC CTTGTTATCT GAAGCGATTA AAGAAGAAGC CTCGGATATC
CATATCGAAA CCTACGAGAA GCAGTTGGTC GTGCGTTTCC GTATCGACGG TGTGCTGAAG
GAAGTGCTTA AGCCGAATCG TAAGTTGTCA TCCTTGTTGG TGTCGCGGAT CAAGGTGATG
GCGCGTCTCG ATATCGCCGA GAAACGTGTG CCACAAGACG GCCGTATTTC GCTGCGGATT
GCGGGTCGTG CGGTGGATGT GCGGGTATCG ACCATGCCAT CGAGCCATGG CGAGCGTGTG
GTGCTGCGTC TGTTAGATAA AAACGCCGGT AATCTGGATT TAAAACAATT GGGTATGACC
GACAGTATCC GTGCCAAGTT CGACGAGCTT ATTCGTCGTC CCCACGGCAT TATTCTGGTC
ACAGGCCCTA CGGGTTCCGG TAAGAGTACT ACGCTGTACG CCGGGCTTAC CGAAATTAAC
TCCAAAGATA CCAACATCTT AACCGTTGAA GATCCGATCG AATACGAGTT AGAAGGCATA
GGTCAAACCC AAGTGAACAC TAAGGCTGAC ATGACCTTCG CCCGTGGTCT GCGTGCGATT
CTGCGTCAGG ACCCGGATGT GGTGATGATC GGCGAAATCC GTGACCTAGA AACCGCCCAA
ATTGCGGTGC AGGCATCCTT GACCGGTCAC TTAGTGATTT CAACCCTGCA TACCAACACG
GCTTCTGGTG CAATTACCCG TTTGCAAGAC ATGGGCGTAG AGCCATTCCT AGTCTCATCG
AGTTTGCTTG GGGTATTAGC CCAGCGTTTA ATTCGTACTC TCTGTCCAAA ATGTAAGACA
GAGCATGTGC CCGATGCCCG TGAGCGTGAG CTGCTGGGAA TGGCGGCAGA CGATCATCGT
CATATTTATC GCGCCAATGG CTGTAAGGCC TGTGGTAGCA GTGGCTACCG TGGTCGTACG
GGTATCCATG AGCTATTGCT GGTCGATGAT AATGTGCGCG AGCTGATCCA CGGTGGACGT
GGCGAACTGG CGATTGAGAA ATATATCCGC CAGTCGGTGC CAAGCATTCG CCACGATGGC
ATGAGCAAGG TGCTCTCAGG TATCACCACT CTGGAAGAAG TCCTGAGGGT GACCCGCGAG
GAGTAA
 
Protein sequence
MSEIQVTQVE DLSLVSNELG LEAEGDEVFR SSSKERLPFA FAHRHEVILA FGEDGALSLF 
YTAKTPLTAM LEARRYSGVD LPLVLLEPGK FEAKLTQAYQ ANSSEAQQLM EDIGNEMDLF
TLAEELPQTE DLLEGDDDAP IIKLINALLS EAIKEEASDI HIETYEKQLV VRFRIDGVLK
EVLKPNRKLS SLLVSRIKVM ARLDIAEKRV PQDGRISLRI AGRAVDVRVS TMPSSHGERV
VLRLLDKNAG NLDLKQLGMT DSIRAKFDEL IRRPHGIILV TGPTGSGKST TLYAGLTEIN
SKDTNILTVE DPIEYELEGI GQTQVNTKAD MTFARGLRAI LRQDPDVVMI GEIRDLETAQ
IAVQASLTGH LVISTLHTNT ASGAITRLQD MGVEPFLVSS SLLGVLAQRL IRTLCPKCKT
EHVPDARERE LLGMAADDHR HIYRANGCKA CGSSGYRGRT GIHELLLVDD NVRELIHGGR
GELAIEKYIR QSVPSIRHDG MSKVLSGITT LEEVLRVTRE E