Gene EcHS_A2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2702 
Symbol 
ID5594778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2717798 
End bp2721172 
Gene Length3375 bp 
Protein Length1124 aa 
Translation table11 
GC content54% 
IMG OID640921820 
ProductTPR repeat-containing protein 
Protein accessionYP_001459344 
Protein GI157162026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTGATT ATTCCGCGAA AACATTGCCA GATTGTTCAA TACACTGCCA CAAATCTTTT 
AATTCAGTAT GTCTTGTTAA TATTGAGGGC ACCATGACTC CAGTAAAAGT GTGGCAAGAG
CGCGTTGAGA TCCCGACCTA TGAAACCGGG CCGCAGGATA TACATCCCAT GTTCCTGGAA
AATCGCGTTT ATCAGGGATC GTCCGGCGCG GTTTATCCCT ATGGCGTGAC CGATACGCTG
AGCGAGCAGA AAACCCTGAA ATCCTGGCAG GCGGTGTGGC TGGAAAACGA CTACATCAAA
GTGATGATCC TGCCGGAACT GGGCGGTCGT GTGCATCGCG CATGGGATAA AGTGAAACAA
CGCGATTTTG TTTATCACAA TGAAGTCATT AAACCTGCGC TGGTGGGGCT GCTGGGACCG
TGGATCTCCG GTGGGATTGA GTTTAACTGG CCGCAACACC ATCGCCCGAC CACCTTTATG
CCCGTTGATT TCACCCTCGA AGCCCATGAA GACGGCGCAC AGACGGTGTG GGTCGGCGAA
ACGGAGCCGA TGCATGGTTT ACAGGTGATG ACAGGTTTCA CCCTGCGCCC TGACCGGGCG
GCGCTGGAAA TCGCCAGCCG CGTCTATAAC GGCAACGCCA CGCCGCGTCA TTTCTTGTGG
TGGGCCAACC CGGCAGTGAA AGGGGGGGAA GGGCATCAGA GCGTTTTCCC GCCGGATGTA
ACGGCGGTGT TTGATCACGG CAAACGGGCC GTCTCCGCTT TCCCCATCGC CACCGGCACT
TACTACAAAG TGGACTACTC CGCTGGAGTG GACATTTCTC GCTATAAAAA TGTGCCCGTT
CCAACCTCAT ATATGGCTGA AAAATCACAG TACGATTTTG TTGGCGCGTG GTGTCACGAT
GAAGATGGCG GTTTGCTGCA CGTTGCCAAC CACCATATTG CGCCAGGTAA AAAACAGTGG
AGTTGGGGAC ACAGTGAATT TGGCCAGGCG TGGGACAAGA GCCTGACCGA CAATAACGGC
CCGTATATCG AACTGATGAC CGGTATTTTT GCCGATAACC AGCCTGATTT TACCTGGCTT
GATGCTTACG AAGAGAAGCG TTTCGAGCAG TATTTCCTGC CTTATCATTC TCTGGGCATG
GTGCAAAATG CCTCCCGCGA TGCGGTGATA AAACTCCAGC GTAGTGAGCG GGGGATTGAG
TGGGGGCTGT ATGCCATCTC TCCGTTGAAC GGATACCGCC TGGCGATCCG CGAAATCGGC
AAATGCAACG CGTTACTTGA TGATGCCGTG GCACTGATGC CTGCGACCGC CATCCAGGGC
GTGTTGCACG GTATCAATCC TGAAAGGCTG ACCATTGAGC TCTCCGATGC CGACGGCAAT
ATTGTACTGA GTTATCAGGA ACATCAGGCG CAAGAGTTGC CGTTGCCGGA CGTCGCCAAA
GCGCCACTGG CAGCACAAGA CATTACCAGT ACAGATGAAG CCTGGTTTAT CGGTCAGCAT
CTGGAGCAAT ATCATCACGC CAGCCGTTCA CCGTTCGATT ACTACCTGCG CGGCGTGGCG
CTGGATCCGC TGGATTACCG CTGTAACCTG GCGCTGGCGA TGCTGGAATA TAACCGTGCC
GATTTCCCGC AAGCGGTGGC GTATGCCACT CAGGCTCTGA AACGCGCACA TGCGCTGAAC
AAAAATCCGC AGTGCGGACA GGCGAGTTTG ATTCGCGCCA GTGCTTACGA ACGTCAGGGA
CAATATCAAC AAGCCGAAGA GGATTTCTGG CGGGCGGTCT GGAGCGGCAA CAGCAAAGCC
GGTGGCTATT ATGGCCTGGC ACGACTGGCT GCGCGTAATG GTAACTTCGA CGCGGGTCTG
GATTTTTGCC AACAAAGTCT TCGCGCCTGC CCAACCAATC AGGAAGTGCT TTGCCTGCAT
AACCTGCTGC TGGTGTTAAG TGGTCGTCAG GACAACGCGC GTTTGCAGCG CGAGAAACTG
CTGCGCGATT ATCCGCTGAA CGCCACTCTG TGGTGGCTGA ACTGGTTCGA TGGTCGTAGC
GAATCAGCCC TCGCGCAGTG GCGCGGTCTG TGTCAGGGAC GCGACGTTAA CGCTCTGATG
ACCGCCGGGC AACTGATTAA CTGGGGAATG CCCACCCTGG CGGCAGAGAT GCTGAACGCA
CTGGACTGCC AGCGCACGCT GCCGCTTTAC CTGCAAGCCA GCTTGCTGCC GAAAGCCGAA
CGTGGCGAAC TGGTCGCAAA AGCCATTGAT GTCTTCCCGC AGTTTGTCCG TTTCCCGAAT
ACGCTGGAAG AAGTGGCGGC GCTGGAGAGT ATTGAAGAGT GCTGGTTTGC TCGCCATTTA
CTGGCTTGCT TCTACTACAA CAAACGTAGC TACAACAAAG CCATTGCCTT TTGGCAACGT
TGCGTAGAGA TGTCGCCGGA GTTTGACGAC GGCTGGCGCG GGTTAGCGAT CCATGCGTGG
AATAAGCAAC ACGATTATGA GCTGACCGCG CGTTATCTTG ATAATGCTTA TCAGCTTGCG
CCGCAGGATG CACGTCTGCT TTTCGAACGG GATTTGCTTG ATAAGTTAAG TGGAGCCACA
CCGGAGAAAC GACTGGCGCG TCTGGAAAAT AATCAGGAAA TTGCGCTGAA ACGCGACGAC
ATGACCGCAG AACTGCTCAA TTTGTGGCAT CTCACGGGTC AGGCAGACAA AGCGGCGGAC
ATTCTCGCCA CGCGTAAATT CCACCCGTGG GAAGGCGGGG AAGGGAAGGT CACCAGTCAG
TTTATCCTCA ACCAGTTATT ACGCGCCTGG CAGCATCTTG ATGCCAGAGA GCCGCAGCAG
GCCAGCGAAC TGCTTCATGC CGCGCTGCAT TATCCGGAGA ATTTAAGCGA AGGCCGTTTA
CCGGGGCAAA CTGATAACGA CATCTGGTTC TGGCAGGCGA TATGCGCCAA CGCGCAGGGC
GATGAAACTG AAGCGATGCG TTGTTTACGT CTGGCGGCGA CCGGCGATCG CACCATTAAC
ATCCACAGTT ATTACAACGA TCAGCCGGTT GATTATCTCT TCTGGCAAGG AATGGCGCTG
CGACTGCTGG GTGAACAGCA AACCGCACAG CAACTGTTTA GTGAAATGAA ACAGTGGGCG
CAAGAGATGG CGAAAACCAG TATTGAGGCG GATTTCTTTG CTGTTTCACA ACCTGACCTG
TTGTCGCTGT ATGGCGATTT ACAACAGCAG CATAAAGAAA AATGCCTGAT GGTGGCGATG
CTGGCGTCCG CGGGACTCGG GGAGGTTGCG CAATATGAAT CTGCTCGCGC TGAATTGACG
GCGATTAATC CGGCCTGGCC GAAAGCGGCA TTATTCACCA CCGTGATGCC TTTTATTTTT
AACTACGTTC ACTAA
 
Protein sequence
MRDYSAKTLP DCSIHCHKSF NSVCLVNIEG TMTPVKVWQE RVEIPTYETG PQDIHPMFLE 
NRVYQGSSGA VYPYGVTDTL SEQKTLKSWQ AVWLENDYIK VMILPELGGR VHRAWDKVKQ
RDFVYHNEVI KPALVGLLGP WISGGIEFNW PQHHRPTTFM PVDFTLEAHE DGAQTVWVGE
TEPMHGLQVM TGFTLRPDRA ALEIASRVYN GNATPRHFLW WANPAVKGGE GHQSVFPPDV
TAVFDHGKRA VSAFPIATGT YYKVDYSAGV DISRYKNVPV PTSYMAEKSQ YDFVGAWCHD
EDGGLLHVAN HHIAPGKKQW SWGHSEFGQA WDKSLTDNNG PYIELMTGIF ADNQPDFTWL
DAYEEKRFEQ YFLPYHSLGM VQNASRDAVI KLQRSERGIE WGLYAISPLN GYRLAIREIG
KCNALLDDAV ALMPATAIQG VLHGINPERL TIELSDADGN IVLSYQEHQA QELPLPDVAK
APLAAQDITS TDEAWFIGQH LEQYHHASRS PFDYYLRGVA LDPLDYRCNL ALAMLEYNRA
DFPQAVAYAT QALKRAHALN KNPQCGQASL IRASAYERQG QYQQAEEDFW RAVWSGNSKA
GGYYGLARLA ARNGNFDAGL DFCQQSLRAC PTNQEVLCLH NLLLVLSGRQ DNARLQREKL
LRDYPLNATL WWLNWFDGRS ESALAQWRGL CQGRDVNALM TAGQLINWGM PTLAAEMLNA
LDCQRTLPLY LQASLLPKAE RGELVAKAID VFPQFVRFPN TLEEVAALES IEECWFARHL
LACFYYNKRS YNKAIAFWQR CVEMSPEFDD GWRGLAIHAW NKQHDYELTA RYLDNAYQLA
PQDARLLFER DLLDKLSGAT PEKRLARLEN NQEIALKRDD MTAELLNLWH LTGQADKAAD
ILATRKFHPW EGGEGKVTSQ FILNQLLRAW QHLDAREPQQ ASELLHAALH YPENLSEGRL
PGQTDNDIWF WQAICANAQG DETEAMRCLR LAATGDRTIN IHSYYNDQPV DYLFWQGMAL
RLLGEQQTAQ QLFSEMKQWA QEMAKTSIEA DFFAVSQPDL LSLYGDLQQQ HKEKCLMVAM
LASAGLGEVA QYESARAELT AINPAWPKAA LFTTVMPFIF NYVH