Gene ECH74115_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0589 
Symbol 
ID6971416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp609541 
End bp610557 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content49% 
IMG OID643384631 
Producthypothetical protein 
Protein accessionYP_002269145 
Protein GI209398275 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[W] Extracellular structures 
COG ID[COG5295] Autotransporter adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.664865 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTG TAAACGTAGC TTTACTGGCA CTCATAATTT CAGCAACATC CAGCCCTGTT 
GTTTTAGCTG GTGATACCAT TGAAGCGGCG GCAACAGAGC TTTCAGCCAT TAACTCTGGC
ATGTCGCAAT CGGAGATTGA GCAGAAGATT ACCCGCTTTT TAGAACGCAC AGACAACAGC
CCCGCTGCGT ATACCTATTT GACTGAACAT CACTACATCC CTTCTGAAAC ACCTGATACC
ACTCAGACTC CCACTGTCCA GACAGATCCT GACGCAGGAC AAAAAACCGT TGCCGCTACA
GGTGATGTAC AGACAACTGC CCGTTATCAG AGCATGATCA ACGCCCGACA GTCTGCGGTA
ACTGACGCCC AGCAAACGCA AATTACAGAG CAACAGGCGC AGATCGTAGC CACACAAAAA
ACGCTCGCCG CGACTGGAGA TACGCAAAAT ACCGCGCATT ATCAGGAAAT GATTAATGCC
AGACTGGCGG CTCAAAATGA GGCTAATCAG CGCACCGCCA CTGAACAAGG GCAGAAAATG
AATGCGCTGA CAACCGATGT GGCAGTACAA CAGCAAAATG AAAGGACTCA ATACGATAAA
CAAATGCAAA GTCTGGCGCA GGAGTCTGCC CAGGCACATG AACAAATTGA CAGCCTGTCA
CAAGACGTAA CCCAAACGCA CCAACAGTTA ACCAACACCC AAAAACGGGT TGCAGATAAC
AGCCAGCAAA TTAACACGCT CAATAACCAT TTCAGTTCGC TAAAAAACGA AGTTGATGAC
AATCGTAAAG AAGCCAATGC GGGAACTGCA TCTGCCATCG CTATCGCCTC ACAACCACAG
GTTAAAACCG GTGACGTGAT GATGGTGTCA GCGGGAGCGG GAACCTTCAA CGGTGAATCT
GCGGTGTCTG TCGGAACATC ATTTAATGCC GGAACGCATA CGGTACTTAA AGCCGGTATT
TCTGCGGATA CACAATCTGA TTTCGGCGCA GGTGTCGGCG TGGGATATTC GTTCTAA
 
Protein sequence
MKTVNVALLA LIISATSSPV VLAGDTIEAA ATELSAINSG MSQSEIEQKI TRFLERTDNS 
PAAYTYLTEH HYIPSETPDT TQTPTVQTDP DAGQKTVAAT GDVQTTARYQ SMINARQSAV
TDAQQTQITE QQAQIVATQK TLAATGDTQN TAHYQEMINA RLAAQNEANQ RTATEQGQKM
NALTTDVAVQ QQNERTQYDK QMQSLAQESA QAHEQIDSLS QDVTQTHQQL TNTQKRVADN
SQQINTLNNH FSSLKNEVDD NRKEANAGTA SAIAIASQPQ VKTGDVMMVS AGAGTFNGES
AVSVGTSFNA GTHTVLKAGI SADTQSDFGA GVGVGYSF