Gene EcSMS35_0150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0150 
SymbolhtrE 
ID6143082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp163230 
End bp165830 
Gene Length2601 bp 
Protein Length866 aa 
Translation table11 
GC content44% 
IMG OID641615051 
Productputative outer membrane usher protein 
Protein accessionYP_001742267 
Protein GI170679612 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0557898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCGAA AAATCATAAA AACGCATAAT AACAAACTTA CATATATAGC CTCGTTTTGC 
GCATTGCTAT TAAGCCCCTG TGCCATAAGC GCTGAACACG TTGAATATGA TAATACCTTT
TTGATGGGGC AAGATGCTTT TAACATCGAC CTTAGTCGGT ATACCGAAGG TAACCCTACC
TTGCCCGGCG TGTATGACGT CAGTGTTTAT ATCAATGATC AACCGGTTAT CAACCAAAGC
ATTTCCTTTA TTACCCTCGA AGGTAAGAAG AATGCGCAGG CTTGTATCAC CCTGAAGAAT
TTATTGCAGT TTCACATTAA TAAACCAGAT ATAAATGCCG AGAATTCCAT CCTTCTACAG
CGCGAAGGCG AACTCGGAGA TTGTCTTGAT TTAGCAAATA TCATTCCTCA GGCCTCGGTT
CATTATGACG TCAACGATCA ACGACTGGAC ATCAATGTTC CTCAAGCCTG GGTAATGAAG
AATTACCAAA ACTATGTTGA CTCTTCACTG TGGGAAAACG GTATTAACGC CGCAATGCTA
GCCTACAACG TCAACGCATA TCACAGCGAA ATTCCGGACA GAAAAAACGA CAGTGTTTAC
GCCGCATTTA ACGGCGGTAT AAATCTGGGG GCATGGCGAC TTCGCGCAAC CGGCAACTAC
AACTGGATGA CCAATGTAGG CAGTGATTAC GATTTTCAGA ATCGCTATTT GCAGCGCGAC
CTGGCCTCTT TGCGTTCACA GTTAATAGTG GGTGAGTCAT ACACAACTGG GGAAACCTTT
GATGCTGTCA GTATTCGCGG TATTCGTTTA TACAGCGACA GCCGAATGCT ACCGCCTGCG
TTAGCTAGTT TTGCGCCTAT TATTCATGGT GTCGCTAACA CCAACGCAAA AGTCACCATT
ACCCAGGGCG GGTATAAAAT ATATGAAACT ACTGTACCGC CGGGAGCATT TGTTATTGAT
GATTTAAGCC CATCAGGCTA CGGCAGCGAT CTCATTATCA CAATCGAAGA ATCTGATGGC
ATAAAACGAA CTTTTTCCCA GCCATTTTCT TCGGTAATTC AGATGCAACG CCCTGGTGTT
GGAAGATGGG ACATCAGTGC TGGTCAGGTA TTAAAAGACG ATATTCAAAA TGAGCCAAAT
TTGTTCCAGG CCAGCTACTA CTATGGTCTG AACAACTATC TTACAGGTTA TACCGGTATT
CAAATTACCG ATAATAACTA TACTGCCGGG CTGTTGGGCC TTGGCCTGAA TACCGCGTAC
GGTGCGTTTT CAGTCGATGT AACCCACTCG GATGTGCAAA TTCCGGACGA TAAAACCTAC
CGCGGGCAAA GTTATCGCAT CTCCTGGAAT AAGTTATTTG AAGATACCAG AACCTCGCTC
AATATCGCCG CTTACCGTTA TTCAACCCAG AATTATCTGG GGCTTAACGA TGCACTGACA
TTGATCGATG AAGTAAAACA TCCGGAACAG GACCTGGAAC CTAAAAATAT GCGTAACTAT
TCACGCATGA AAAATCAGGT TACGGTTAGT ATTAACCAAC CCCTCAAGTT TGAAAAGAAA
GACTATGGCT CATTCTATCT TGCCGGAAGT TGGTCTGACT ATTGGGCGGA CGGACAAAAT
AATACTAACT ACTCCATTGG TTATAGTAAC AGCGCATCGT GGGGAAGCTA CAGCATTAGT
GCCCAACGGT CCTGGAGCCA GGACGGCGGT AATGAAGACA GCATCTATCT AAGCTTTAGC
ATTCCTATCG AAAAGTTACT GGGTTCAGAA CATCGCGACT CTGGTTTTCA GAGCATTGAT
ACTCAGTTAA ATAGTGATTT CAACGGCAGC AATCAACTCA GTATCAGCAG TAGTGGTTAT
AGCACCGAAA ACCATATCAG TTATAGCGTC AATACAGGCT ATTCGATGAT GAAATCCAGT
GATGATTTGG GCTATATCGG CGGATATGCA AGCTATGAAT CTCCCTGGGG AACTCTTTCC
AGTTCAGTTT CCGCAAGCAG TGATAATAGC CGTCAAATCT CATTTAATAC CGACGGCGGA
TTTGTTTTAC ATAGCGGCGG CCTGACCTTC AGTAATGACA GCTTTAGCGA CTCAGACACT
CTGGCCGTTG TACAAGCGCC AGGAGCTAAA GGGGCGCGTA TTAACTATGG TAATAGTACG
ATAGATCGTT GGGGATATGG TGTTACTAAC GCGCTCTCCC CTTATCATGA AAACCGAATT
GCACTCGATA TTAACGGTCT GGAAAATGAC GTTGAATTGA AAAGTACCAG TGCTATTACT
GTCCCCCGCC AAGGGTCTGT TGTCTTTGCT GGTTTTGAAA CAGTTCAGGG ACAATCAGCG
ATTATGAACA TCAAGCGAAC TGACGGTAAA AACATTCCGT TTGCCGCAGA TATTTATGAC
GAGAACGGAA ATATTATTGG CAATGTAGGG CAAGGTGGTC AGGCCTTTGT TCGAGGTATA
GAACAACAAG GTAACATACG TATCAACTGG CTTGACGACG GTAAACCTGT CACTTGCCTT
GCTCATTACC AGCAGAGCGC AGCACCAGAA AAAATAGCGC AAACTATTAT TCTGAATGGA
ATTAGTTGTC AGATTCAGTA A
 
Protein sequence
MSRKIIKTHN NKLTYIASFC ALLLSPCAIS AEHVEYDNTF LMGQDAFNID LSRYTEGNPT 
LPGVYDVSVY INDQPVINQS ISFITLEGKK NAQACITLKN LLQFHINKPD INAENSILLQ
REGELGDCLD LANIIPQASV HYDVNDQRLD INVPQAWVMK NYQNYVDSSL WENGINAAML
AYNVNAYHSE IPDRKNDSVY AAFNGGINLG AWRLRATGNY NWMTNVGSDY DFQNRYLQRD
LASLRSQLIV GESYTTGETF DAVSIRGIRL YSDSRMLPPA LASFAPIIHG VANTNAKVTI
TQGGYKIYET TVPPGAFVID DLSPSGYGSD LIITIEESDG IKRTFSQPFS SVIQMQRPGV
GRWDISAGQV LKDDIQNEPN LFQASYYYGL NNYLTGYTGI QITDNNYTAG LLGLGLNTAY
GAFSVDVTHS DVQIPDDKTY RGQSYRISWN KLFEDTRTSL NIAAYRYSTQ NYLGLNDALT
LIDEVKHPEQ DLEPKNMRNY SRMKNQVTVS INQPLKFEKK DYGSFYLAGS WSDYWADGQN
NTNYSIGYSN SASWGSYSIS AQRSWSQDGG NEDSIYLSFS IPIEKLLGSE HRDSGFQSID
TQLNSDFNGS NQLSISSSGY STENHISYSV NTGYSMMKSS DDLGYIGGYA SYESPWGTLS
SSVSASSDNS RQISFNTDGG FVLHSGGLTF SNDSFSDSDT LAVVQAPGAK GARINYGNST
IDRWGYGVTN ALSPYHENRI ALDINGLEND VELKSTSAIT VPRQGSVVFA GFETVQGQSA
IMNIKRTDGK NIPFAADIYD ENGNIIGNVG QGGQAFVRGI EQQGNIRINW LDDGKPVTCL
AHYQQSAAPE KIAQTIILNG ISCQIQ