Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3141 |
Symbol | |
ID | 5591927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3150464 |
End bp | 3151273 |
Gene Length | 810 bp |
Protein Length | 269 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640922260 |
Product | type IV prepilin peptidase family protein |
Protein accession | YP_001459759 |
Protein GI | 157162441 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1989] Type II secretory pathway, prepilin signal peptidase PulO and related peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 65 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTTG ATGTTTTTCA GCAATACCCC GCGGCGATGC CCGTCCTGGC AACCGTCGGA GGATTGATTA TAGGTAGTTT TTTGAATGTG GTGATTTGGC GTTACCCCAT CATGCTGCGC CAACAAATGG CGGAGTTTCA CGGTGAAATG CCGAGTGCGC AGTCAAAAAT AAGCCTGGCG CTGCCACGTT CGCACTGTCC ACATTGTCAG CAGACCATCC GGATACGTGA CAATATTCCG CTGTTCTCCT GGCTGATGCT CAAAGGGCGC TGCCGCGACT GTCAGGCGAA AATCAGCAAG CGTTATCCGC TGGTGGAGTT ATTGACGGCA CTCGCTTTTT TGCTGGCGAG TCTGGTCTGG CCGGAAAGTG GATGGGCGCT GGCGGTGATG ATATTATCCG CCTGGCTGAT TGCCGCGAGC GTCATTGACC TCGATCACCA ATGGCTGCCC GATGTTTTTA CTCAGGGCGT ATTGTGGACG GGACTGAGTG CGGCATGGGC GCAGCAGAGC CCGCTCACGC TACAAGATGC AGTTACCGGC GTCCTGGTGG GGTTTATCGC TTTTTACTCC CTGCGCTGGA TAGCCGGAAT AGTTCTGCGT AAAGAAGCAT TAGGCATGGG CGATGTATTA TTGTTCGCTG CGTTAGGTAG TTGGGTGGGG CCGTTGTCGC TACCCAATGT TGCTTTAATC GCATCATGCT GCGGCCTGAT ATATGCCGTT ATTACAAAAA GAGGATCAAC CACACTGCCT TTTGGACCGT GTTTAAGTCT GGGCGGTATA GCAACAATTT ATCTACAGGC ATTGTTTTAA
|
Protein sequence | MLFDVFQQYP AAMPVLATVG GLIIGSFLNV VIWRYPIMLR QQMAEFHGEM PSAQSKISLA LPRSHCPHCQ QTIRIRDNIP LFSWLMLKGR CRDCQAKISK RYPLVELLTA LAFLLASLVW PESGWALAVM ILSAWLIAAS VIDLDHQWLP DVFTQGVLWT GLSAAWAQQS PLTLQDAVTG VLVGFIAFYS LRWIAGIVLR KEALGMGDVL LFAALGSWVG PLSLPNVALI ASCCGLIYAV ITKRGSTTLP FGPCLSLGGI ATIYLQALF
|
| |