Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3926 |
Symbol | |
ID | 6143802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4000504 |
End bp | 4002861 |
Gene Length | 2358 bp |
Protein Length | 785 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641618752 |
Product | putative fimbrial usher protein FanD |
Protein accession | YP_001745891 |
Protein GI | 170681449 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.540204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAG AAATATTTAT TGCCGCTATT ATTTTTCATT TACTATCTAA AGGTGCTCTT GCCGAAGAGT TCAACTACAG CTTTATTCGT GGAGGGAGTA AGGATATTCC TGATGTTTTA AACAGCAATA AAGAAAACGT ACCGGGTAAA TATGTTGTTG ATGTTGTTTT CAATGGTTCT AAAATCGCGT CATCTACTGA GATGAGCATC GCAAAAGAGG ATGCTGAGGG GATATGTCTT TCTGATGAAT GGCTAACTGA AAACGGCATT ATAATTAATA AAGATTTTTA TAAAAATGTT TATAATTCTG CACGCCAGTG CTATTTGCTG GGTAATGAGG CGAACAGTAA AGTCGCGTTT GATCAATCGT TGCAAGAAGT ATCTATTGAT TTGCCACAGG CAGGCTTCCA GGACGCAGCA AAAGATGGTG GTGTGTGGGA CTATGGTAGT AACGGTTTTA AAATAGCTTA TGACGTTAAT ACTGCAAAAA ATAGCAACCA GGAAAGAACC ACCTACAGTA GTATTGATGG CCAGGTGAAT CTGGGCGAAT GGGTGTTATT GGGTAGGGGG TATGCTTACC AGGGCGAAAA TTTCGATACT AATAACCTGT TACTGACGCG GGCAATCAAA TCACTGAAAT CCGATTTGCA ACTGGGCAAG ACACAGCTTT ATAACTCATT GAATAATGGT TTTACCTTTT ACGGTGCTCA GCTTAAATCG AATCAGGATA TGTATCCGTG GAACTCTCGT GCCTATTCGC CGGTAATTAA TGGTATTGCG CGAACCCATG CCAGGGTAAC CATTGAGCAG AGTGGTTATA CCTTAAAATC TATTGTCGTG CCTCCGGGAC CATTTGTGAT CAACGATCTT AACGGCGTTT ACTCTGGCGA TTTGATCATG AAAATCTATG AAGAGGATGG CTCCGTTCGT GAACAGCGTT TCCCTGTCGC AGTGTTGCCT AATTTGTTAA GACCGGGAAC GTATAACTAC GCCCTCGCCA TGGGTAGCAA AGTTAACCAA GATAATGGCG AACGGGATAA AGAAAGTCTG TTTGCCCAAA TGAGTTATGA CTATGGATTC GAGCCTTTTA CACTCAATAG CTCTTTATTG CTGGATAAAA ATTATAACAA TATTGGTTTA GGACTAATTC GCTCGTTTGG ATGGTTTGGA GCCATGTCTT TTAGTGGCAA CTTATCACAA GCAAAATACC ATAATGGTAA GAATCTAAAA GGCTATAGCA CGTCATTAAA ATATGCAAAA GCGCTTGGTG ATAATGCAAA TTTACAGCTT ATTGGTTACC GTTTTAATTC AGAGGATTAT ATTGACTATG CTGATTTTAC TTATAATTCA TATAGTTTTA TAAGAAATAG GCCAAAGCAA CGCTATGAGT CAATCGTTAC CTACCAGCTA CCAGAAAAAG GTATGTTTTT AAACTTTTCT GCGTGGAAGG AAGATTACTG GGATAATTAT AACGAGGTCG GTGCTAACTT AAGTCTGACA AAAAGTTTTG ATCAAATAAC AATGACACTG AACGGTGGTT ATTCCAGACT GCAAAATATG GATGCTGACT ATAATGTTGG TTTGTCATTA AGTGTGCCGT TAAGTCTGTT TGATAAAACA CATTATAGTT TTTCGAATGT TAACTATGAT CGACGTACGG GGACCAGTAT GAATACTGGG ATCTCAGGAA TGATTAATCA AAGGCTGAGT TATAACGCAT CAGTTAACCA GACGCGTGAT ACTATCGGTG GTACACTTTC TGCCTCCTAT TTATTCGACT GGATGCAGAC GTCAGCAACG TATTCACAAA CTGGGAAAAA TTCATCTACT TCACTCCAGT TGGGTGGTAG TGTAATTGGT GTGCCAGAAG GTGGCATAAT ATTTACTCCA GTTAAAAATG ATCAACTGGC TATCGTGCAA ATGAAAGATG TCCCCGGTGT TATGTTTAAC GGTTCTTTAC CGGGAGATAA ATATGGTCGG GCGGTAATCC CACTTACTGC TTATAACAAT AATACGATTT CTGTAAATGC AGAAAAATTA CCGAAGAATA TTGAGCTTAC AGATAATGCG ATTAATGTGA CCCCAACGGG AAATGCTATC ATTTATAAAA ATGTGAAGTT TAAGAAAATT AATACCTATG TGGTTAAACT TTATGGGAAA AATGGTTATG TCGTTCCTAT GGGTAGCATC GCAAAAAATA CACAGGGTAA AGAAGTGGGT TATGTAAATA ACGGTGGTAT TTTGCTGATG AACCTTGAAA CACACGATGA AGGTGTTATT TCGCTTGACC AGTGTCAATT TAATACGCAG TCACTGAAGA AAAACTATGA CCAAACTCAG GAGATTCACT GTGAGTAA
|
Protein sequence | MKKEIFIAAI IFHLLSKGAL AEEFNYSFIR GGSKDIPDVL NSNKENVPGK YVVDVVFNGS KIASSTEMSI AKEDAEGICL SDEWLTENGI IINKDFYKNV YNSARQCYLL GNEANSKVAF DQSLQEVSID LPQAGFQDAA KDGGVWDYGS NGFKIAYDVN TAKNSNQERT TYSSIDGQVN LGEWVLLGRG YAYQGENFDT NNLLLTRAIK SLKSDLQLGK TQLYNSLNNG FTFYGAQLKS NQDMYPWNSR AYSPVINGIA RTHARVTIEQ SGYTLKSIVV PPGPFVINDL NGVYSGDLIM KIYEEDGSVR EQRFPVAVLP NLLRPGTYNY ALAMGSKVNQ DNGERDKESL FAQMSYDYGF EPFTLNSSLL LDKNYNNIGL GLIRSFGWFG AMSFSGNLSQ AKYHNGKNLK GYSTSLKYAK ALGDNANLQL IGYRFNSEDY IDYADFTYNS YSFIRNRPKQ RYESIVTYQL PEKGMFLNFS AWKEDYWDNY NEVGANLSLT KSFDQITMTL NGGYSRLQNM DADYNVGLSL SVPLSLFDKT HYSFSNVNYD RRTGTSMNTG ISGMINQRLS YNASVNQTRD TIGGTLSASY LFDWMQTSAT YSQTGKNSST SLQLGGSVIG VPEGGIIFTP VKNDQLAIVQ MKDVPGVMFN GSLPGDKYGR AVIPLTAYNN NTISVNAEKL PKNIELTDNA INVTPTGNAI IYKNVKFKKI NTYVVKLYGK NGYVVPMGSI AKNTQGKEVG YVNNGGILLM NLETHDEGVI SLDQCQFNTQ SLKKNYDQTQ EIHCE
|
| |