Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1637 |
Symbol | |
ID | 5595186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1657550 |
End bp | 1661326 |
Gene Length | 3777 bp |
Protein Length | 1258 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640920785 |
Product | L-shaped tail fiber protein |
Protein accession | YP_001458341 |
Protein GI | 157161023 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTAC GGATTTCAGG TGTACTGAAA GATGGCGCAG GTAAGCCGAT ACAAAACTGC ACCATTCAGC TAAAGGCCAG GCGCAACAGC ACCACGGTGG TGGTGAACAC AGTGGCCTCA GAAAACCCGG ATGAAGCCGG GCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT ATTCTGTTGG TGGAAGGCTT CCCGCCATCG CATGCCGGAA CCATCACCGT GTATGAAGAC TCACAACCGG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGTCCGTCCG GAGGCACTGC GCCGCTTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCAGTG GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCAGTGATG CCCGCACATC AGCCCGTGAG GCGGCAACCC ATGCGACTGA TGCTGCGGAC TCCGCACGCG CAGCCAGCAC GTCAGCCGGA CAGGCCGCGT CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCT ACTGAAGCAT CAAAAAGTGC TGCCGCTGCA GAGTCTTCAA AAAGCGCGGC AGCCACCAGT GCCGGTGCAG CGAAAACGTC AGAAACGAAT GCCGCAGCAT CACAAAAATC TGCGGCCACT TCTGCATCCA CCGCGACCAC GAAAGCGTCA GAAGCTGCCA CCTCAGCCCG GGATGCGTCG GCTTCAAAAG TGGCGGCAAA ATCATCAGAA ACGAGCGCAG CCTCGAGCGC CGGCAGTGCA GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCCG CAAAAACGTC TGAGATGAAT GCGGATAACA GCGCACAGGC GGCAGCAGAC TCACAAACTG CATCGGCAAA TTCCGCGACA GCAGCCAAAA AATCAGAAAC CAACGCGAAA AATAGTGAGT CAGCAGCAAA GGTCAGCGAA ACCAACGCTA AAGCGTCAGA GAACAAGGCG AAAGAATATC TCGACAAGGT CGGGGGACTC GTCAGCCCGA TGACGCAATA CGATTGGCCC GTTGTTACTG GTAATGAGTC TTTTTACATA AAGATCGCGA AACTTTCCGA TCCCGGAAGC AACAATTGCC ATGTAACGCT AATGGTTACT AACGGCGGTG ACTACGGCTC CCCTTACGGA AACATTGACT TTATCGAGAT CTCGGCGCGC GGTCTGCCTT CTTCGCTTAC TGCTGATAAT GTATCTCGTT ACCTGAGTAT ACGCCGTTTA GGGCCAACCG GGCTAATCAA TAGCATGCAA ATGCGTTACG GCCTGGTTAA AGATGATGGC TTTATTGAGG TTTGGGCCTT CCAGCGTGCA TTTATCAACG GCGCAAAGGT TGCGGTACTG GCGCAGACGG CACGCACGGA ATTATACATT CCAGACGGAT TTGTTAAGCA AACCGCCGCG CCTTCTGGAT ATGTTGAAAG CCCCGTTGTA AGGATTTACG ACCAGTTAAA CAAGCCGACT AAAGCAGATT TGGGTCTTTC TAATGCTATG CTTACAGGCG CTTTCGGTCT TGGCGGTAGC GGGATATCAA CAAACGGCAA GATGAGCGAT GTAGAGATCT TAAAAGCTCT GCGTGACAAA GGTGGTCATT TCTGGCGCGG TGATAAGCCG ACCGGAAGCA CGGCGACCAT TTATAGCCAC GGTTCTGGTA TATTCTCGCG GTGCGGCGAT ACGTGGTCAG CGATCAATAT CGACTACTCA ACCGCGAAGA TTAAGATCTA TGCCGGCAAC GATGCCCGGC TTAACAACGG GACTTTTAGC ATCAATGAGC TATACGGCTC GGCAAACAAG CCGTCGAAAT CGGATGTTGG ACTTGGCAAC GTAACGAACG ATGCGCAGGT AAAAAAAACC GGCGATACAA TGACCGGTGA CTTGACAATC AAAAAAGGTA CACCGTCAGT CTTCCTGCGG GCAGACAGTG GAGTCACCGC TTTGCGGTTT TATACTGGCG ATAACACAGA GCGCGGCATA ATCTATGCTG GTCCTAACAC TGATTCGCTT GGCGAAGTTC GCATCAGGGC AAAGACAGCA GGGGGGACAT CAGGAGGGGA TCTTGTTGTT CGTCACGACG GGAGGGTTGA AGTCCGTGAT CTCACAGTAG CGTATAAAAT TAAAAGCAGA ACGATTGAGA TTGCAAATAC CGATACTGAC TCATCGGCAA CTACGCTCAG CATCTATGGA GTACAGCACA CGCCGTTGGT TTTAACGCGT TCTGGTTCTT CTGAAAATGT GTCCATTGGG TTTAAGTTAG ACAACATGAA CCCAAAGTAT CTTGGAATTG ATACTAATGG GGATCTGGCT TTTGGTGAGA GTCCTGATCA GAAACAAAAC AGCAAATTGA TCACGCAAGC GAAACTCGAC AAGGGATTAA CGATTGGTGG TCAACTGGCT TTCAAAGGTA CGACAGCGTT TTCAGCCGTT GCTACGTTCA TTGCCGGGAT AGCAGGAGCC ATCGAGCCGG AAAACATTGA CGGCCAGACG GTTAATCTTA ACAACCTGAC CATCATCAAG TCAGATGCCG GGGCAGTTAA ATACTATATT TGTCCATCCT CTGCAGGTGG TGCAAATATT ACCAATAAGC CTGACGGCAT AGCCGGTAAC TTTTTGCTCC GTGTAGAGTC GACTCGTAAG GTTAGGGATT CAGATTATGC GAACATGCAA ACGCTGATTA ACAGCGACAC AAAACGTATA TACGTTCGCT TTGTTGTTAA TGGAAACTGG ACAGCGTGGA GTCAGGTTGT TGTTTCCGGA TGGAATCAGG ATATAACTGT CAGGTCGTTA ACCACATCTA GTCCGGTAAA ATCTGGCGGA GGGCGAATTG ATGTCCTTGG AAGCACGTCA GACTATAGCA AAATGGATTG CTTTGTACGT GGGTTTGATA GCACCGGTAA TTCTCTCGCG TGGGCGTTGG GTTCATCAGC CGGCGTAAGT AAGATGCTGT CGCTAAAAAA TTTCTTTAGC GGAGCTGAGA TACTGTTAAA TGGTAATGAC GGCACGGTTC AACTCAAAAC AGGTGCTGTT AACGGGGCTA CAGCGCAGGC GCTCACTATC AACAGGAATG AGGTTAACTC AACTGTTGAT TTAACCCTTA CAAAACAATC AGGGACTGGC AATCGTTTTG TTTTACAGAA CTCAGGTAAT GCAGAACTAC CGTTTTCTGT CAGGGTGTGG GGTTCCAGTA CTCGACAAAA CGTTTTTGAG GTTGGCACGT CTGCTGCGTA TCTGTTTTAT GCGCAAAAAA CGTCAGCAGG CCAGTTGTTT GATGTAAATG GCGCTATTAA TTGCACAACG CTGAATCAGT CATCAGACCG CGACCTTAAA GACGATATTC TCGTTATCAG CGACGCGACG AAAGCAATCC GTAAAATGAA CGGATACACC TACACGCTCA GGGAAAACGG GATGCCTTAT GCTGGCGTTA TTGCACAGGA AGTAATGGAG GCGATACCAG AAGCTGTGGG ATCGTTTACT CATTATGGTG AAGAGTTGCA AGGTCCGACC GTTGACGGCA ACGAGCTACG CGAAGAAACG CGCTATCTTA ATGTTGACTA CGCCGCCGTG ACGGGCTTAC TTGTTCAGTT CGCCCGTGAA ACAGATGATC GCGTTACCGC GCTGGAAGAG GAAAACACAA CGCTACGTCA AAATCTGGCA ACAGCAGACA CCCGGATCAG CACTCTGGAA AATCAGGTAA GCGAACTGGT TGCACTTGTC CGGCAGTTAA CAGGAAGCGA ACATTGA
|
Protein sequence | MAVRISGVLK DGAGKPIQNC TIQLKARRNS TTVVVNTVAS ENPDEAGRYS MDVEYGQYSV ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMTEDDVRP EALRRFELMV EEVARNASAV AQNTAAAKKS ASDARTSARE AATHATDAAD SARAASTSAG QAASSAQSAS SSAGTASTKA TEASKSAAAA ESSKSAAATS AGAAKTSETN AAASQKSAAT SASTATTKAS EAATSARDAS ASKVAAKSSE TSAASSAGSA ASSATAAGNS AKAAKTSEMN ADNSAQAAAD SQTASANSAT AAKKSETNAK NSESAAKVSE TNAKASENKA KEYLDKVGGL VSPMTQYDWP VVTGNESFYI KIAKLSDPGS NNCHVTLMVT NGGDYGSPYG NIDFIEISAR GLPSSLTADN VSRYLSIRRL GPTGLINSMQ MRYGLVKDDG FIEVWAFQRA FINGAKVAVL AQTARTELYI PDGFVKQTAA PSGYVESPVV RIYDQLNKPT KADLGLSNAM LTGAFGLGGS GISTNGKMSD VEILKALRDK GGHFWRGDKP TGSTATIYSH GSGIFSRCGD TWSAINIDYS TAKIKIYAGN DARLNNGTFS INELYGSANK PSKSDVGLGN VTNDAQVKKT GDTMTGDLTI KKGTPSVFLR ADSGVTALRF YTGDNTERGI IYAGPNTDSL GEVRIRAKTA GGTSGGDLVV RHDGRVEVRD LTVAYKIKSR TIEIANTDTD SSATTLSIYG VQHTPLVLTR SGSSENVSIG FKLDNMNPKY LGIDTNGDLA FGESPDQKQN SKLITQAKLD KGLTIGGQLA FKGTTAFSAV ATFIAGIAGA IEPENIDGQT VNLNNLTIIK SDAGAVKYYI CPSSAGGANI TNKPDGIAGN FLLRVESTRK VRDSDYANMQ TLINSDTKRI YVRFVVNGNW TAWSQVVVSG WNQDITVRSL TTSSPVKSGG GRIDVLGSTS DYSKMDCFVR GFDSTGNSLA WALGSSAGVS KMLSLKNFFS GAEILLNGND GTVQLKTGAV NGATAQALTI NRNEVNSTVD LTLTKQSGTG NRFVLQNSGN AELPFSVRVW GSSTRQNVFE VGTSAAYLFY AQKTSAGQLF DVNGAINCTT LNQSSDRDLK DDILVISDAT KAIRKMNGYT YTLRENGMPY AGVIAQEVME AIPEAVGSFT HYGEELQGPT VDGNELREET RYLNVDYAAV TGLLVQFARE TDDRVTALEE ENTTLRQNLA TADTRISTLE NQVSELVALV RQLTGSEH
|
| |