Gene EcHS_A1637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1637 
Symbol 
ID5595186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1657550 
End bp1661326 
Gene Length3777 bp 
Protein Length1258 aa 
Translation table11 
GC content51% 
IMG OID640920785 
ProductL-shaped tail fiber protein 
Protein accessionYP_001458341 
Protein GI157161023 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTAC GGATTTCAGG TGTACTGAAA GATGGCGCAG GTAAGCCGAT ACAAAACTGC 
ACCATTCAGC TAAAGGCCAG GCGCAACAGC ACCACGGTGG TGGTGAACAC AGTGGCCTCA
GAAAACCCGG ATGAAGCCGG GCGTTACAGC ATGGACGTTG AGTACGGTCA GTACAGCGTT
ATTCTGTTGG TGGAAGGCTT CCCGCCATCG CATGCCGGAA CCATCACCGT GTATGAAGAC
TCACAACCGG GTACGCTGAA TGATTTTCTC GGTGCCATGA CGGAGGATGA TGTCCGTCCG
GAGGCACTGC GCCGCTTTGA ACTGATGGTG GAAGAGGTGG CGCGTAACGC GTCCGCAGTG
GCACAGAACA CGGCAGCCGC GAAGAAGTCA GCCAGTGATG CCCGCACATC AGCCCGTGAG
GCGGCAACCC ATGCGACTGA TGCTGCGGAC TCCGCACGCG CAGCCAGCAC GTCAGCCGGA
CAGGCCGCGT CGTCGGCTCA GTCAGCGTCT TCCAGCGCAG GAACGGCATC AACAAAGGCT
ACTGAAGCAT CAAAAAGTGC TGCCGCTGCA GAGTCTTCAA AAAGCGCGGC AGCCACCAGT
GCCGGTGCAG CGAAAACGTC AGAAACGAAT GCCGCAGCAT CACAAAAATC TGCGGCCACT
TCTGCATCCA CCGCGACCAC GAAAGCGTCA GAAGCTGCCA CCTCAGCCCG GGATGCGTCG
GCTTCAAAAG TGGCGGCAAA ATCATCAGAA ACGAGCGCAG CCTCGAGCGC CGGCAGTGCA
GCTTCCTCGG CAACGGCGGC AGGAAATTCC GCGAAGGCCG CAAAAACGTC TGAGATGAAT
GCGGATAACA GCGCACAGGC GGCAGCAGAC TCACAAACTG CATCGGCAAA TTCCGCGACA
GCAGCCAAAA AATCAGAAAC CAACGCGAAA AATAGTGAGT CAGCAGCAAA GGTCAGCGAA
ACCAACGCTA AAGCGTCAGA GAACAAGGCG AAAGAATATC TCGACAAGGT CGGGGGACTC
GTCAGCCCGA TGACGCAATA CGATTGGCCC GTTGTTACTG GTAATGAGTC TTTTTACATA
AAGATCGCGA AACTTTCCGA TCCCGGAAGC AACAATTGCC ATGTAACGCT AATGGTTACT
AACGGCGGTG ACTACGGCTC CCCTTACGGA AACATTGACT TTATCGAGAT CTCGGCGCGC
GGTCTGCCTT CTTCGCTTAC TGCTGATAAT GTATCTCGTT ACCTGAGTAT ACGCCGTTTA
GGGCCAACCG GGCTAATCAA TAGCATGCAA ATGCGTTACG GCCTGGTTAA AGATGATGGC
TTTATTGAGG TTTGGGCCTT CCAGCGTGCA TTTATCAACG GCGCAAAGGT TGCGGTACTG
GCGCAGACGG CACGCACGGA ATTATACATT CCAGACGGAT TTGTTAAGCA AACCGCCGCG
CCTTCTGGAT ATGTTGAAAG CCCCGTTGTA AGGATTTACG ACCAGTTAAA CAAGCCGACT
AAAGCAGATT TGGGTCTTTC TAATGCTATG CTTACAGGCG CTTTCGGTCT TGGCGGTAGC
GGGATATCAA CAAACGGCAA GATGAGCGAT GTAGAGATCT TAAAAGCTCT GCGTGACAAA
GGTGGTCATT TCTGGCGCGG TGATAAGCCG ACCGGAAGCA CGGCGACCAT TTATAGCCAC
GGTTCTGGTA TATTCTCGCG GTGCGGCGAT ACGTGGTCAG CGATCAATAT CGACTACTCA
ACCGCGAAGA TTAAGATCTA TGCCGGCAAC GATGCCCGGC TTAACAACGG GACTTTTAGC
ATCAATGAGC TATACGGCTC GGCAAACAAG CCGTCGAAAT CGGATGTTGG ACTTGGCAAC
GTAACGAACG ATGCGCAGGT AAAAAAAACC GGCGATACAA TGACCGGTGA CTTGACAATC
AAAAAAGGTA CACCGTCAGT CTTCCTGCGG GCAGACAGTG GAGTCACCGC TTTGCGGTTT
TATACTGGCG ATAACACAGA GCGCGGCATA ATCTATGCTG GTCCTAACAC TGATTCGCTT
GGCGAAGTTC GCATCAGGGC AAAGACAGCA GGGGGGACAT CAGGAGGGGA TCTTGTTGTT
CGTCACGACG GGAGGGTTGA AGTCCGTGAT CTCACAGTAG CGTATAAAAT TAAAAGCAGA
ACGATTGAGA TTGCAAATAC CGATACTGAC TCATCGGCAA CTACGCTCAG CATCTATGGA
GTACAGCACA CGCCGTTGGT TTTAACGCGT TCTGGTTCTT CTGAAAATGT GTCCATTGGG
TTTAAGTTAG ACAACATGAA CCCAAAGTAT CTTGGAATTG ATACTAATGG GGATCTGGCT
TTTGGTGAGA GTCCTGATCA GAAACAAAAC AGCAAATTGA TCACGCAAGC GAAACTCGAC
AAGGGATTAA CGATTGGTGG TCAACTGGCT TTCAAAGGTA CGACAGCGTT TTCAGCCGTT
GCTACGTTCA TTGCCGGGAT AGCAGGAGCC ATCGAGCCGG AAAACATTGA CGGCCAGACG
GTTAATCTTA ACAACCTGAC CATCATCAAG TCAGATGCCG GGGCAGTTAA ATACTATATT
TGTCCATCCT CTGCAGGTGG TGCAAATATT ACCAATAAGC CTGACGGCAT AGCCGGTAAC
TTTTTGCTCC GTGTAGAGTC GACTCGTAAG GTTAGGGATT CAGATTATGC GAACATGCAA
ACGCTGATTA ACAGCGACAC AAAACGTATA TACGTTCGCT TTGTTGTTAA TGGAAACTGG
ACAGCGTGGA GTCAGGTTGT TGTTTCCGGA TGGAATCAGG ATATAACTGT CAGGTCGTTA
ACCACATCTA GTCCGGTAAA ATCTGGCGGA GGGCGAATTG ATGTCCTTGG AAGCACGTCA
GACTATAGCA AAATGGATTG CTTTGTACGT GGGTTTGATA GCACCGGTAA TTCTCTCGCG
TGGGCGTTGG GTTCATCAGC CGGCGTAAGT AAGATGCTGT CGCTAAAAAA TTTCTTTAGC
GGAGCTGAGA TACTGTTAAA TGGTAATGAC GGCACGGTTC AACTCAAAAC AGGTGCTGTT
AACGGGGCTA CAGCGCAGGC GCTCACTATC AACAGGAATG AGGTTAACTC AACTGTTGAT
TTAACCCTTA CAAAACAATC AGGGACTGGC AATCGTTTTG TTTTACAGAA CTCAGGTAAT
GCAGAACTAC CGTTTTCTGT CAGGGTGTGG GGTTCCAGTA CTCGACAAAA CGTTTTTGAG
GTTGGCACGT CTGCTGCGTA TCTGTTTTAT GCGCAAAAAA CGTCAGCAGG CCAGTTGTTT
GATGTAAATG GCGCTATTAA TTGCACAACG CTGAATCAGT CATCAGACCG CGACCTTAAA
GACGATATTC TCGTTATCAG CGACGCGACG AAAGCAATCC GTAAAATGAA CGGATACACC
TACACGCTCA GGGAAAACGG GATGCCTTAT GCTGGCGTTA TTGCACAGGA AGTAATGGAG
GCGATACCAG AAGCTGTGGG ATCGTTTACT CATTATGGTG AAGAGTTGCA AGGTCCGACC
GTTGACGGCA ACGAGCTACG CGAAGAAACG CGCTATCTTA ATGTTGACTA CGCCGCCGTG
ACGGGCTTAC TTGTTCAGTT CGCCCGTGAA ACAGATGATC GCGTTACCGC GCTGGAAGAG
GAAAACACAA CGCTACGTCA AAATCTGGCA ACAGCAGACA CCCGGATCAG CACTCTGGAA
AATCAGGTAA GCGAACTGGT TGCACTTGTC CGGCAGTTAA CAGGAAGCGA ACATTGA
 
Protein sequence
MAVRISGVLK DGAGKPIQNC TIQLKARRNS TTVVVNTVAS ENPDEAGRYS MDVEYGQYSV 
ILLVEGFPPS HAGTITVYED SQPGTLNDFL GAMTEDDVRP EALRRFELMV EEVARNASAV
AQNTAAAKKS ASDARTSARE AATHATDAAD SARAASTSAG QAASSAQSAS SSAGTASTKA
TEASKSAAAA ESSKSAAATS AGAAKTSETN AAASQKSAAT SASTATTKAS EAATSARDAS
ASKVAAKSSE TSAASSAGSA ASSATAAGNS AKAAKTSEMN ADNSAQAAAD SQTASANSAT
AAKKSETNAK NSESAAKVSE TNAKASENKA KEYLDKVGGL VSPMTQYDWP VVTGNESFYI
KIAKLSDPGS NNCHVTLMVT NGGDYGSPYG NIDFIEISAR GLPSSLTADN VSRYLSIRRL
GPTGLINSMQ MRYGLVKDDG FIEVWAFQRA FINGAKVAVL AQTARTELYI PDGFVKQTAA
PSGYVESPVV RIYDQLNKPT KADLGLSNAM LTGAFGLGGS GISTNGKMSD VEILKALRDK
GGHFWRGDKP TGSTATIYSH GSGIFSRCGD TWSAINIDYS TAKIKIYAGN DARLNNGTFS
INELYGSANK PSKSDVGLGN VTNDAQVKKT GDTMTGDLTI KKGTPSVFLR ADSGVTALRF
YTGDNTERGI IYAGPNTDSL GEVRIRAKTA GGTSGGDLVV RHDGRVEVRD LTVAYKIKSR
TIEIANTDTD SSATTLSIYG VQHTPLVLTR SGSSENVSIG FKLDNMNPKY LGIDTNGDLA
FGESPDQKQN SKLITQAKLD KGLTIGGQLA FKGTTAFSAV ATFIAGIAGA IEPENIDGQT
VNLNNLTIIK SDAGAVKYYI CPSSAGGANI TNKPDGIAGN FLLRVESTRK VRDSDYANMQ
TLINSDTKRI YVRFVVNGNW TAWSQVVVSG WNQDITVRSL TTSSPVKSGG GRIDVLGSTS
DYSKMDCFVR GFDSTGNSLA WALGSSAGVS KMLSLKNFFS GAEILLNGND GTVQLKTGAV
NGATAQALTI NRNEVNSTVD LTLTKQSGTG NRFVLQNSGN AELPFSVRVW GSSTRQNVFE
VGTSAAYLFY AQKTSAGQLF DVNGAINCTT LNQSSDRDLK DDILVISDAT KAIRKMNGYT
YTLRENGMPY AGVIAQEVME AIPEAVGSFT HYGEELQGPT VDGNELREET RYLNVDYAAV
TGLLVQFARE TDDRVTALEE ENTTLRQNLA TADTRISTLE NQVSELVALV RQLTGSEH