Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4093 |
Symbol | lpfD |
ID | 6145107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4186355 |
End bp | 4187425 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 641618917 |
Product | long polar fimbrial operon protein LpfD |
Protein accession | YP_001746055 |
Protein GI | 170683702 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3539] P pilus assembly protein, pilin FimA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.234808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAAGT ATATTATACA GTGGTGCTTT ACTGTTTTTA TCTTCTCCTT TAGTGGTGCA ACATTTGCGG CCCCAAAAGG TATCTGCACC TCAGATAATG GGGCCTTTCA TAGCACACTT GATTTTTCCG GCTATCTGAT TACGGCAGAC GAGAACAGAG TGGGAGAGAC CTTTAATAAA ACCGTGACAA ATGGGGACTC TTATCCTGCT CACTGCCATT GTGATACAGG GAAAGTGGGG GAGTTTCCTT ATATTTACTA TACAGCGAGA ATAAACGAGG CCTTAAGTTA TGCGGGTGTT CGTTCTAATG TAAACTACTT TAATTTGAAT CCTAATCTGG ATGTTGGAAT ATCGATAGAC CTTCTCGGTG TAGGATACAT TAATGCACCT TTTGAATACC ATGCTAACAG GCCTACTGGA GGTTCATATA AATGTAGTCG CACGGACCCA TTAAGTATTT CCAGTGGTGC AAAAGCAATA ATTTATTTTT ATATTAAGAA GACTTTTGCT GGAAAGTTAA TTGTCCCTGA AACGCTCGTG GCAAAATTGT ACGGAACTAT AAGCCGTGAC ACTCCGGTTG ATTACTCACA ACCTATGGCG GATGTTTATA TTCGTGGCGA TATCACTGCA CCGCAAAGCT GTGAAATTAA CAGTTTAAGA CCCATTAATT TTGATTTTAA AGAAATTCCT GCAGCAGATT TTTCTTCGGT AGCTGGAAGT ACCGTGACAA CGCATAAAAT TACCAAAACC GTCACCATTG AGTGTGAAAA TTTAGGAATA CTAAATACTG ATGATATCAG TACCTCTTTT TATGCTACCG AACCCAGTAC TGACAACTCA ATGGTCGTGA CATCAAACCC GAACGTTGGG ATAAAAATTT ATGATAAAAA TAATAAGGAA ATCAACGTTA ACGGTGGTGA ACTGCCGACA GATATGGGTA AATCAAACGT CTTTGGTGAA AAAGCCGGTA GCGTCACTTT TTCTGCTGCT CCTGCAAGCC TCACGGGGGC TCGCCCCGCG CCAGGAACAT TTACCGCGAC TGCAACGATA ACAATTGAGA TTGTACGCTA A
|
Protein sequence | MNKYIIQWCF TVFIFSFSGA TFAAPKGICT SDNGAFHSTL DFSGYLITAD ENRVGETFNK TVTNGDSYPA HCHCDTGKVG EFPYIYYTAR INEALSYAGV RSNVNYFNLN PNLDVGISID LLGVGYINAP FEYHANRPTG GSYKCSRTDP LSISSGAKAI IYFYIKKTFA GKLIVPETLV AKLYGTISRD TPVDYSQPMA DVYIRGDITA PQSCEINSLR PINFDFKEIP AADFSSVAGS TVTTHKITKT VTIECENLGI LNTDDISTSF YATEPSTDNS MVVTSNPNVG IKIYDKNNKE INVNGGELPT DMGKSNVFGE KAGSVTFSAA PASLTGARPA PGTFTATATI TIEIVR
|
| |