Gene Shewana3_0152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_0152 
Symbol 
ID4478289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp177108 
End bp178673 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content52% 
IMG OID639724672 
Producttype II secretion system protein E (GspE) 
Protein accessionYP_867802 
Protein GI117918610 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.203719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA TACAAGTAAC CCAAGTCGAG GATTTAAGCC TAGTTTCTAA CGAACTAGGG 
CTTGAAGCCG AAGGTGATGA GGTATTTCGC TCCAGCAGCA AAGAGCGTTT ACCCTTTGCC
TTTGCCCATC GCCATGAGGT GATTTTAGCC TATGGCGATG ATGGTGCCTT GAACCTGTTT
TATACCGCCA AAACCCCATT AGCAGCCATG CTCGAAGCGC GCCGCTACTC GGGCGCGGAT
TTGCCGCTGG TCTTGCTCGA GCCGGGCAAG TTTGAGGCGA AATTAACCCA GGCCTATCAA
GCCAACTCCT CCGAAGCGCA GCAGCTGATG GAAGATATTG GCAACGAGAT GGATTTATTC
ACCCTCGCCG AAGAGCTGCC GCAAACCGAA GATCTGCTCG AAGGCGATGA TGATGCGCCT
ATTATCAAGC TGATTAACGC CTTGTTATCT GAAGCGATTA AAGAAGAAGC CTCGGATATC
CATATCGAAA CCTACGAGAA GCAGTTGGTC GTGCGTTTCC GTATCGACGG TGTGCTGAAG
GAAGTGCTTA AACCGAATCG TAAGCTGTCA TCCCTTTTGG TGTCGCGGAT TAAGGTCATG
GCGCGTCTCG ATATCGCCGA GAAACGTGTA CCACAGGACG GCCGTATTTC GCTGCGGATT
GCGGGTCGTG CGGTGGACGT GCGGGTATCG ACCATGCCAT CGAGCCATGG CGAGCGTGTG
GTGCTGCGTC TGTTAGATAA AAACGCTGGC AATTTGGACT TAAAACAACT GGGGATGACC
GACAGTATCC GTGCCCAGTT TGACGAGATT ATCCGTCGTC CACACGGCAT TATTCTGGTG
ACAGGCCCTA CGGGTTCGGG TAAGAGTACC ACGCTGTACG CGGGTCTTAC CGAGATTAAC
TCTAAAGATA CCAATATCTT AACCGTTGAA GACCCGATCG AATACGAGTT AGAAGGTATA
GGTCAAACTC AAGTGAACAC TAAGGCGGAC ATGACCTTCG CCCGTGGTCT GCGTGCGATT
CTGCGTCAAG ACCCGGATGT GGTGATGATC GGGGAAATCC GTGACCTAGA AACCGCCCAA
ATTGCGGTGC AGGCATCTTT GACTGGTCAC TTAGTGATTT CAACCCTGCA TACCAACACG
GCCTCTGGCG CGATTACCCG TTTGCAGGAT ATGGGCGTAG AGCCTTTCTT AGTCTCATCG
AGTTTGCTCG GGGTACTAGC CCAGCGCTTA ATTCGTACTC TCTGTCCAAA ATGTAAGACA
GAGCATGTGC CCGATGAGCG TGAGCGTGAG CTGCTGGGAA TGGCAGCAGA CGATCATCGC
CATATTTACC GCGCCAATGG CTGTAAGGCC TGTGGTAGCA GTGGCTACCG TGGTCGTACG
GGTATCCATG AGCTGTTGCT GGTCGATGAT AATGTGCGCG AGCTGATCCA CGGTGGACGT
GGCGAACTGG CGATTGAGAA ATATATCCGC CAGTCGGTGC CAAGCATTCG CCACGATGGC
ATGAGCAAGG TGCTCTCAGG TATCACCACT CTAGAAGAAG TCCTGAGGGT GACCCGCGAG
GAGTAA
 
Protein sequence
MSEIQVTQVE DLSLVSNELG LEAEGDEVFR SSSKERLPFA FAHRHEVILA YGDDGALNLF 
YTAKTPLAAM LEARRYSGAD LPLVLLEPGK FEAKLTQAYQ ANSSEAQQLM EDIGNEMDLF
TLAEELPQTE DLLEGDDDAP IIKLINALLS EAIKEEASDI HIETYEKQLV VRFRIDGVLK
EVLKPNRKLS SLLVSRIKVM ARLDIAEKRV PQDGRISLRI AGRAVDVRVS TMPSSHGERV
VLRLLDKNAG NLDLKQLGMT DSIRAQFDEI IRRPHGIILV TGPTGSGKST TLYAGLTEIN
SKDTNILTVE DPIEYELEGI GQTQVNTKAD MTFARGLRAI LRQDPDVVMI GEIRDLETAQ
IAVQASLTGH LVISTLHTNT ASGAITRLQD MGVEPFLVSS SLLGVLAQRL IRTLCPKCKT
EHVPDERERE LLGMAADDHR HIYRANGCKA CGSSGYRGRT GIHELLLVDD NVRELIHGGR
GELAIEKYIR QSVPSIRHDG MSKVLSGITT LEEVLRVTRE E