Gene EcSMS35_3926 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3926 
Symbol 
ID6143802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4000504 
End bp4002861 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content39% 
IMG OID641618752 
Productputative fimbrial usher protein FanD 
Protein accessionYP_001745891 
Protein GI170681449 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3188] P pilus assembly protein, porin PapC 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.540204 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG AAATATTTAT TGCCGCTATT ATTTTTCATT TACTATCTAA AGGTGCTCTT 
GCCGAAGAGT TCAACTACAG CTTTATTCGT GGAGGGAGTA AGGATATTCC TGATGTTTTA
AACAGCAATA AAGAAAACGT ACCGGGTAAA TATGTTGTTG ATGTTGTTTT CAATGGTTCT
AAAATCGCGT CATCTACTGA GATGAGCATC GCAAAAGAGG ATGCTGAGGG GATATGTCTT
TCTGATGAAT GGCTAACTGA AAACGGCATT ATAATTAATA AAGATTTTTA TAAAAATGTT
TATAATTCTG CACGCCAGTG CTATTTGCTG GGTAATGAGG CGAACAGTAA AGTCGCGTTT
GATCAATCGT TGCAAGAAGT ATCTATTGAT TTGCCACAGG CAGGCTTCCA GGACGCAGCA
AAAGATGGTG GTGTGTGGGA CTATGGTAGT AACGGTTTTA AAATAGCTTA TGACGTTAAT
ACTGCAAAAA ATAGCAACCA GGAAAGAACC ACCTACAGTA GTATTGATGG CCAGGTGAAT
CTGGGCGAAT GGGTGTTATT GGGTAGGGGG TATGCTTACC AGGGCGAAAA TTTCGATACT
AATAACCTGT TACTGACGCG GGCAATCAAA TCACTGAAAT CCGATTTGCA ACTGGGCAAG
ACACAGCTTT ATAACTCATT GAATAATGGT TTTACCTTTT ACGGTGCTCA GCTTAAATCG
AATCAGGATA TGTATCCGTG GAACTCTCGT GCCTATTCGC CGGTAATTAA TGGTATTGCG
CGAACCCATG CCAGGGTAAC CATTGAGCAG AGTGGTTATA CCTTAAAATC TATTGTCGTG
CCTCCGGGAC CATTTGTGAT CAACGATCTT AACGGCGTTT ACTCTGGCGA TTTGATCATG
AAAATCTATG AAGAGGATGG CTCCGTTCGT GAACAGCGTT TCCCTGTCGC AGTGTTGCCT
AATTTGTTAA GACCGGGAAC GTATAACTAC GCCCTCGCCA TGGGTAGCAA AGTTAACCAA
GATAATGGCG AACGGGATAA AGAAAGTCTG TTTGCCCAAA TGAGTTATGA CTATGGATTC
GAGCCTTTTA CACTCAATAG CTCTTTATTG CTGGATAAAA ATTATAACAA TATTGGTTTA
GGACTAATTC GCTCGTTTGG ATGGTTTGGA GCCATGTCTT TTAGTGGCAA CTTATCACAA
GCAAAATACC ATAATGGTAA GAATCTAAAA GGCTATAGCA CGTCATTAAA ATATGCAAAA
GCGCTTGGTG ATAATGCAAA TTTACAGCTT ATTGGTTACC GTTTTAATTC AGAGGATTAT
ATTGACTATG CTGATTTTAC TTATAATTCA TATAGTTTTA TAAGAAATAG GCCAAAGCAA
CGCTATGAGT CAATCGTTAC CTACCAGCTA CCAGAAAAAG GTATGTTTTT AAACTTTTCT
GCGTGGAAGG AAGATTACTG GGATAATTAT AACGAGGTCG GTGCTAACTT AAGTCTGACA
AAAAGTTTTG ATCAAATAAC AATGACACTG AACGGTGGTT ATTCCAGACT GCAAAATATG
GATGCTGACT ATAATGTTGG TTTGTCATTA AGTGTGCCGT TAAGTCTGTT TGATAAAACA
CATTATAGTT TTTCGAATGT TAACTATGAT CGACGTACGG GGACCAGTAT GAATACTGGG
ATCTCAGGAA TGATTAATCA AAGGCTGAGT TATAACGCAT CAGTTAACCA GACGCGTGAT
ACTATCGGTG GTACACTTTC TGCCTCCTAT TTATTCGACT GGATGCAGAC GTCAGCAACG
TATTCACAAA CTGGGAAAAA TTCATCTACT TCACTCCAGT TGGGTGGTAG TGTAATTGGT
GTGCCAGAAG GTGGCATAAT ATTTACTCCA GTTAAAAATG ATCAACTGGC TATCGTGCAA
ATGAAAGATG TCCCCGGTGT TATGTTTAAC GGTTCTTTAC CGGGAGATAA ATATGGTCGG
GCGGTAATCC CACTTACTGC TTATAACAAT AATACGATTT CTGTAAATGC AGAAAAATTA
CCGAAGAATA TTGAGCTTAC AGATAATGCG ATTAATGTGA CCCCAACGGG AAATGCTATC
ATTTATAAAA ATGTGAAGTT TAAGAAAATT AATACCTATG TGGTTAAACT TTATGGGAAA
AATGGTTATG TCGTTCCTAT GGGTAGCATC GCAAAAAATA CACAGGGTAA AGAAGTGGGT
TATGTAAATA ACGGTGGTAT TTTGCTGATG AACCTTGAAA CACACGATGA AGGTGTTATT
TCGCTTGACC AGTGTCAATT TAATACGCAG TCACTGAAGA AAAACTATGA CCAAACTCAG
GAGATTCACT GTGAGTAA
 
Protein sequence
MKKEIFIAAI IFHLLSKGAL AEEFNYSFIR GGSKDIPDVL NSNKENVPGK YVVDVVFNGS 
KIASSTEMSI AKEDAEGICL SDEWLTENGI IINKDFYKNV YNSARQCYLL GNEANSKVAF
DQSLQEVSID LPQAGFQDAA KDGGVWDYGS NGFKIAYDVN TAKNSNQERT TYSSIDGQVN
LGEWVLLGRG YAYQGENFDT NNLLLTRAIK SLKSDLQLGK TQLYNSLNNG FTFYGAQLKS
NQDMYPWNSR AYSPVINGIA RTHARVTIEQ SGYTLKSIVV PPGPFVINDL NGVYSGDLIM
KIYEEDGSVR EQRFPVAVLP NLLRPGTYNY ALAMGSKVNQ DNGERDKESL FAQMSYDYGF
EPFTLNSSLL LDKNYNNIGL GLIRSFGWFG AMSFSGNLSQ AKYHNGKNLK GYSTSLKYAK
ALGDNANLQL IGYRFNSEDY IDYADFTYNS YSFIRNRPKQ RYESIVTYQL PEKGMFLNFS
AWKEDYWDNY NEVGANLSLT KSFDQITMTL NGGYSRLQNM DADYNVGLSL SVPLSLFDKT
HYSFSNVNYD RRTGTSMNTG ISGMINQRLS YNASVNQTRD TIGGTLSASY LFDWMQTSAT
YSQTGKNSST SLQLGGSVIG VPEGGIIFTP VKNDQLAIVQ MKDVPGVMFN GSLPGDKYGR
AVIPLTAYNN NTISVNAEKL PKNIELTDNA INVTPTGNAI IYKNVKFKKI NTYVVKLYGK
NGYVVPMGSI AKNTQGKEVG YVNNGGILLM NLETHDEGVI SLDQCQFNTQ SLKKNYDQTQ
EIHCE