Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0150 |
Symbol | htrE |
ID | 6143082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 163230 |
End bp | 165830 |
Gene Length | 2601 bp |
Protein Length | 866 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641615051 |
Product | putative outer membrane usher protein |
Protein accession | YP_001742267 |
Protein GI | 170679612 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3188] P pilus assembly protein, porin PapC |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0557898 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCCCGAA AAATCATAAA AACGCATAAT AACAAACTTA CATATATAGC CTCGTTTTGC GCATTGCTAT TAAGCCCCTG TGCCATAAGC GCTGAACACG TTGAATATGA TAATACCTTT TTGATGGGGC AAGATGCTTT TAACATCGAC CTTAGTCGGT ATACCGAAGG TAACCCTACC TTGCCCGGCG TGTATGACGT CAGTGTTTAT ATCAATGATC AACCGGTTAT CAACCAAAGC ATTTCCTTTA TTACCCTCGA AGGTAAGAAG AATGCGCAGG CTTGTATCAC CCTGAAGAAT TTATTGCAGT TTCACATTAA TAAACCAGAT ATAAATGCCG AGAATTCCAT CCTTCTACAG CGCGAAGGCG AACTCGGAGA TTGTCTTGAT TTAGCAAATA TCATTCCTCA GGCCTCGGTT CATTATGACG TCAACGATCA ACGACTGGAC ATCAATGTTC CTCAAGCCTG GGTAATGAAG AATTACCAAA ACTATGTTGA CTCTTCACTG TGGGAAAACG GTATTAACGC CGCAATGCTA GCCTACAACG TCAACGCATA TCACAGCGAA ATTCCGGACA GAAAAAACGA CAGTGTTTAC GCCGCATTTA ACGGCGGTAT AAATCTGGGG GCATGGCGAC TTCGCGCAAC CGGCAACTAC AACTGGATGA CCAATGTAGG CAGTGATTAC GATTTTCAGA ATCGCTATTT GCAGCGCGAC CTGGCCTCTT TGCGTTCACA GTTAATAGTG GGTGAGTCAT ACACAACTGG GGAAACCTTT GATGCTGTCA GTATTCGCGG TATTCGTTTA TACAGCGACA GCCGAATGCT ACCGCCTGCG TTAGCTAGTT TTGCGCCTAT TATTCATGGT GTCGCTAACA CCAACGCAAA AGTCACCATT ACCCAGGGCG GGTATAAAAT ATATGAAACT ACTGTACCGC CGGGAGCATT TGTTATTGAT GATTTAAGCC CATCAGGCTA CGGCAGCGAT CTCATTATCA CAATCGAAGA ATCTGATGGC ATAAAACGAA CTTTTTCCCA GCCATTTTCT TCGGTAATTC AGATGCAACG CCCTGGTGTT GGAAGATGGG ACATCAGTGC TGGTCAGGTA TTAAAAGACG ATATTCAAAA TGAGCCAAAT TTGTTCCAGG CCAGCTACTA CTATGGTCTG AACAACTATC TTACAGGTTA TACCGGTATT CAAATTACCG ATAATAACTA TACTGCCGGG CTGTTGGGCC TTGGCCTGAA TACCGCGTAC GGTGCGTTTT CAGTCGATGT AACCCACTCG GATGTGCAAA TTCCGGACGA TAAAACCTAC CGCGGGCAAA GTTATCGCAT CTCCTGGAAT AAGTTATTTG AAGATACCAG AACCTCGCTC AATATCGCCG CTTACCGTTA TTCAACCCAG AATTATCTGG GGCTTAACGA TGCACTGACA TTGATCGATG AAGTAAAACA TCCGGAACAG GACCTGGAAC CTAAAAATAT GCGTAACTAT TCACGCATGA AAAATCAGGT TACGGTTAGT ATTAACCAAC CCCTCAAGTT TGAAAAGAAA GACTATGGCT CATTCTATCT TGCCGGAAGT TGGTCTGACT ATTGGGCGGA CGGACAAAAT AATACTAACT ACTCCATTGG TTATAGTAAC AGCGCATCGT GGGGAAGCTA CAGCATTAGT GCCCAACGGT CCTGGAGCCA GGACGGCGGT AATGAAGACA GCATCTATCT AAGCTTTAGC ATTCCTATCG AAAAGTTACT GGGTTCAGAA CATCGCGACT CTGGTTTTCA GAGCATTGAT ACTCAGTTAA ATAGTGATTT CAACGGCAGC AATCAACTCA GTATCAGCAG TAGTGGTTAT AGCACCGAAA ACCATATCAG TTATAGCGTC AATACAGGCT ATTCGATGAT GAAATCCAGT GATGATTTGG GCTATATCGG CGGATATGCA AGCTATGAAT CTCCCTGGGG AACTCTTTCC AGTTCAGTTT CCGCAAGCAG TGATAATAGC CGTCAAATCT CATTTAATAC CGACGGCGGA TTTGTTTTAC ATAGCGGCGG CCTGACCTTC AGTAATGACA GCTTTAGCGA CTCAGACACT CTGGCCGTTG TACAAGCGCC AGGAGCTAAA GGGGCGCGTA TTAACTATGG TAATAGTACG ATAGATCGTT GGGGATATGG TGTTACTAAC GCGCTCTCCC CTTATCATGA AAACCGAATT GCACTCGATA TTAACGGTCT GGAAAATGAC GTTGAATTGA AAAGTACCAG TGCTATTACT GTCCCCCGCC AAGGGTCTGT TGTCTTTGCT GGTTTTGAAA CAGTTCAGGG ACAATCAGCG ATTATGAACA TCAAGCGAAC TGACGGTAAA AACATTCCGT TTGCCGCAGA TATTTATGAC GAGAACGGAA ATATTATTGG CAATGTAGGG CAAGGTGGTC AGGCCTTTGT TCGAGGTATA GAACAACAAG GTAACATACG TATCAACTGG CTTGACGACG GTAAACCTGT CACTTGCCTT GCTCATTACC AGCAGAGCGC AGCACCAGAA AAAATAGCGC AAACTATTAT TCTGAATGGA ATTAGTTGTC AGATTCAGTA A
|
Protein sequence | MSRKIIKTHN NKLTYIASFC ALLLSPCAIS AEHVEYDNTF LMGQDAFNID LSRYTEGNPT LPGVYDVSVY INDQPVINQS ISFITLEGKK NAQACITLKN LLQFHINKPD INAENSILLQ REGELGDCLD LANIIPQASV HYDVNDQRLD INVPQAWVMK NYQNYVDSSL WENGINAAML AYNVNAYHSE IPDRKNDSVY AAFNGGINLG AWRLRATGNY NWMTNVGSDY DFQNRYLQRD LASLRSQLIV GESYTTGETF DAVSIRGIRL YSDSRMLPPA LASFAPIIHG VANTNAKVTI TQGGYKIYET TVPPGAFVID DLSPSGYGSD LIITIEESDG IKRTFSQPFS SVIQMQRPGV GRWDISAGQV LKDDIQNEPN LFQASYYYGL NNYLTGYTGI QITDNNYTAG LLGLGLNTAY GAFSVDVTHS DVQIPDDKTY RGQSYRISWN KLFEDTRTSL NIAAYRYSTQ NYLGLNDALT LIDEVKHPEQ DLEPKNMRNY SRMKNQVTVS INQPLKFEKK DYGSFYLAGS WSDYWADGQN NTNYSIGYSN SASWGSYSIS AQRSWSQDGG NEDSIYLSFS IPIEKLLGSE HRDSGFQSID TQLNSDFNGS NQLSISSSGY STENHISYSV NTGYSMMKSS DDLGYIGGYA SYESPWGTLS SSVSASSDNS RQISFNTDGG FVLHSGGLTF SNDSFSDSDT LAVVQAPGAK GARINYGNST IDRWGYGVTN ALSPYHENRI ALDINGLEND VELKSTSAIT VPRQGSVVFA GFETVQGQSA IMNIKRTDGK NIPFAADIYD ENGNIIGNVG QGGQAFVRGI EQQGNIRINW LDDGKPVTCL AHYQQSAAPE KIAQTIILNG ISCQIQ
|
| |