Gene EcSMS35_3092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3092 
Symbol 
ID6145936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3180993 
End bp3182018 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content55% 
IMG OID641617960 
Producttwitching motility family protein 
Protein accessionYP_001745111 
Protein GI170680393 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0000184946 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCAGGTT ATGCGCAATA TCGTTCATTT TCCGAGGATC TTAGTATGAA TATGGAAGAA 
ATTGTGGCCC TTAGTGTAAA GCATAACGTC TCGGATCTAC ACCTGTGCAG CGCCTGGCCC
GCACGATGGC GCATTCGCGG CAGAATGGAA GCTGCGCCGT TTGACGCGCC GGACGTCGAA
GAGCTACTGC GGGAGTGGCT GGATGACGAT CAGCGGGCAA TATTGCTGGA GAATGGTCAG
CTGGATTTTG CTGTGTCGCT GGCGGAAAAC CAGCGGTTGC GTGGCAGTGC GTTCGCGCAA
CGGCAAGGTA TTTCTCTGGC ATTACGGTTG TTACCTTCGC ACTGTCCACA GCTCGAACAG
CTTGGTGCGC CACCGGTATT GCCGGAATTA CTCAAGAGCG AGAATGGCCT GATTCTGGTG
ACGGGGGCGA CGGGGAGCGG CAAATCTACC ACGCTGGCGG CGATGGTTGG CTATCTCAAT
CAACATGCCG ATGCGCATAT TCTGACGCTG GAAGATCCTG TTGAATATCT CTATGCCAGC
CAGCGATGTT TGATCCAGCA GCGGGAAATT GGTTTGCACT GTATGACGTT CGCATCGGGA
TTGCGGGCCG CATTGCGGGA AGATCCCGAT GTGATTTTGC TCGGAGAGCT GCGTGACAGC
GAGACAATCC GTCTGGCGCT GACGGCAGCA GAAACCGGAC ACCTGGTGCT GGCAACTTTA
CATACGCGTG GTGCGGCGCA GGCAGTTGAG CGGCTGGTGG ATTCATTTCC GGCGCAGGAA
AAAGATCCCG TGCGTAATCA ACTGGCAGGT AGTTTACGGG CGGTGTTGTC ACAAAAGCTG
GAAGTGGATA AACAGGAAGG ACGCGTGGCG CTGTTTGAAT TGCTGGTTAA CACACCCGCG
GTGGGGAATT TGATTCGCGA AGGGAAAACC CACCAGTTAC TGCATGTTAT TCAAACCGGG
CAGCAGGTGG GGATGTTAAC GTTTCAGCAG AGTTATCAGC AGCGGGTGGG GGAAGGACGT
TTGTGA
 
Protein sequence
MPGYAQYRSF SEDLSMNMEE IVALSVKHNV SDLHLCSAWP ARWRIRGRME AAPFDAPDVE 
ELLREWLDDD QRAILLENGQ LDFAVSLAEN QRLRGSAFAQ RQGISLALRL LPSHCPQLEQ
LGAPPVLPEL LKSENGLILV TGATGSGKST TLAAMVGYLN QHADAHILTL EDPVEYLYAS
QRCLIQQREI GLHCMTFASG LRAALREDPD VILLGELRDS ETIRLALTAA ETGHLVLATL
HTRGAAQAVE RLVDSFPAQE KDPVRNQLAG SLRAVLSQKL EVDKQEGRVA LFELLVNTPA
VGNLIREGKT HQLLHVIQTG QQVGMLTFQQ SYQQRVGEGR L