Gene ECH74115_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1914 
Symbol 
ID6968028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1808279 
End bp1809448 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content52% 
IMG OID643385847 
Producttetratricopeptide repeat protein 
Protein accessionYP_002270336 
Protein GI209396639 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000156513 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.47203e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGGAGT TGTTGTTTCT GCTTTTGCCT GTAGCCGCTG CCTATGGCTG GTATATGGGC 
CGCAGAAGTG CGCAACAAAA CAAGCAAGAT GAAGCCAACC GCTTGTCGCG TGATTACGTA
GCGGGGGTTA ACTTCCTGCT TAGTAATCAA CAGGATAAAG CGGTAGACCT GTTTCTCGAT
ATGCTTAAAG AGGATACAGG CACCGTTGAA GCCCACCTTA CGCTCGGAAA CCTGTTCCGT
TCGCGTGGCG AAGTTGATCG CGCTATTCGC ATCCATCAGA CCCTAATGGA AAGCGCCTCG
CTGACCTATG AACAGCGTCT GTTGGCGATT CAACAACTGG GGCGTGATTA CATGGCCGCC
GGGTTATATG ACCGCGCGGA AGACATGTTC AATCAGCTGA CCGATGAAAC TGACTTCCGC
ATTGGCGCGC TGCAACAGTT GCTACAAATC TACCAGGCTA CCAGTGAGTG GCAGAAAGCA
ATTGATGTTG CCGAACGCCT GGTGAAGCTG GGTAAAGATA AACAGCGCGT CGAAATTGCC
CATTTCTACT GTGAGTTAGC CCTGCAGCAT ATGGCCAGCG ACGATCTCGA TCGTGCCATG
ACCTTGCTAA AAAAAGGGGC GGCGGCAGAT AAAAACAGCG CCCGCGTATC CATCATGATG
GGACGCGTGT TTATGGCGAA AGGAGAATAC GCCAAAGCCG TCGAAAGTTT GCAACGCGTG
ATATCCCAGG ACAGAGAACT GGTCAGCGAA ACGCTGGAAA TGCTGCAAAC CTGCTATCAG
CAGTTGGGTA AAACTGCCGA ATGGGCAGAA TTCCTGCAGC GCGCGGTGGA AGAGAACACC
GGTGCCGATG CTGAATTGAT GCTTGCTGAT ATCATCGAAG CGCGCGACGG TAGTGAGGCC
GCACAGGTCT ATATTACGCG CCAGCTTCAG CGTCATCCGA CCATGCGTGT GTTCCATAAG
TTAATGGATT ACCACTTAAA TGAAGCGGAA GAAGGGCGTG CCAAAGAGAG CCTGATGGTG
CTGCGTGACA TGGTTGGCGA GAAGGTACGT AGTAAGCCTC GTTATCGCTG CCAGAAATGT
GGTTTTACCG CATACACCCT CTACTGGCAT TGTCCGTCTT GTCGTGCCTG GTCAACTATT
AAACCGATTC GCGGTCTTGA TGGCCTGTAA
 
Protein sequence
MLELLFLLLP VAAAYGWYMG RRSAQQNKQD EANRLSRDYV AGVNFLLSNQ QDKAVDLFLD 
MLKEDTGTVE AHLTLGNLFR SRGEVDRAIR IHQTLMESAS LTYEQRLLAI QQLGRDYMAA
GLYDRAEDMF NQLTDETDFR IGALQQLLQI YQATSEWQKA IDVAERLVKL GKDKQRVEIA
HFYCELALQH MASDDLDRAM TLLKKGAAAD KNSARVSIMM GRVFMAKGEY AKAVESLQRV
ISQDRELVSE TLEMLQTCYQ QLGKTAEWAE FLQRAVEENT GADAELMLAD IIEARDGSEA
AQVYITRQLQ RHPTMRVFHK LMDYHLNEAE EGRAKESLMV LRDMVGEKVR SKPRYRCQKC
GFTAYTLYWH CPSCRAWSTI KPIRGLDGL