Gene ECH74115_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3214 
Symbol 
ID6971663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2961641 
End bp2963587 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content55% 
IMG OID643387033 
Producthypothetical protein 
Protein accessionYP_002271500 
Protein GI209398441 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0279968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCTGATGCA 
CTTGCGCAAA AAATTCGTGA AGGATGGCAA CCATACGGTG GGCCGTTTTC TTCGTATACG
GATGATGGCG CAGCACTTAT TCAGGCGATT GTCGCAGAAG GTGATGTGAG CACACCTGTT
GTGGTGAAGC TGACAGGTGG AGAAGGTGCA GTAATCAGCG CCACCAGAGA CCCGGAGTAT
TACTTTATTG TGGTTCTGGC GGGGCAGTCA AACAGCATGG CATATGGTGA AGGCCTTCCG
CTGCCGGAGA CATATGACCG TCCGGACCCG CGTATTAAGC AGCTGGCGCG CCGCAGTACG
GTGACACCGG GCGGTGTCGC CTGTAAATAT AACGACATCA TTCCGGCGGA CCATTGTCTG
CATGATGTGC AGGACATGAG CCGCCTTAAC CATCCGAAAG CGGACCTGTC AAAGGGGCAG
TACGGAACCG TGGGGCAGGG GCTGCATATC GCCAAAAAAT TGCTGCCGTT TATACCGGCG
AATGCGGGCA TTCTGCTGGT TCCGTGCTGT CGTGGTGGTT CAGCGTTCAC CACCGGAGCT
GATGGCACAT ACAGTGACGC GAGTGGTGCT TCGGAGAATT CAACCCGCTG GGGTGTGGAC
AAGCCGCTGT ATAAGGACCT TATCGGTCGA ACAAAAGCAG CACTGAAGAA GAACCCGAAA
AATGTGCTGT TTGCCGTGGT GTGGATGCAG GGGGAATTTG ATTTTGGCGG TACGCCGGCA
AATCACGCAG CACAGTTTGG TGCGCTGGTT GATAAATTCC GTGCAGACCT GGCGGATATG
GCAGGTCAGT GCGTCGGTGG CTCTGCTGAC GGTGTTCCCT GGATATGCGG GGACACGACG
TATTTCTGGA AGCAGAAGAA CGAAGCCACC TACCAGACGG TGTACGGCAG CTACAAAAAC
AAAACGGAAA AGAATATCCA TTTCGTACCG TTCATGACCG ATGAGAACGG GGTGAATGTG
CCGACGAACA AACCGGAAGA AGACCCGGAC ATTCCGGGTA TCGGATATTA CGGTTCGAAA
TGGCGTGACA GCTCAGCCAC CTGGACGTCA CAGGACAGGG CGAGCCATTT CAGTTCATGG
GCTCGCCGCG GGATTATTTC CGACCGTCTG GCAACGGCGA TTTTGCGCCA TGCGGGAAGA
GTGGCGCTAA ACGCGGGGGC ATCATCGACA GTATCAGAGG TGCGCCCGTC ATCGCCTTCC
GGTGCAGAAG CCACAGGCGT CACAACACTG CTCTCTTACC TTGCCAGCGA GTCAGAGGGA
AGCCTGAAAG TACAGGGATG GTCAGCCAGT GGCGGCAGGG CAGAAGTGGT CAGCGATGCG
GAGGGAACCG GAGGTAAGGC AGTGAAGCTG ACCAAGGAAG CCGGTAAAAG CAGCTGGGTG
CTGGAGTACG CCGCGGGCAA CGGTGCGGCT CTGTTACAGA AAGGGGGGCA GATTCGCTGC
CGCTTTAAGG TTTCGGGAGC GCTGGCTGCG AACCAGTATG TTATGGCGTT TTACTGGCCG
GTATCTTCAC TGCCACAGGG CGTTGCCCTG ACCGGAGACG GGGGGAATAA CCTGCTGGCA
GCGTTCTACA TCCAGACAGA TGCAAAAGAC CTGAATGTGA TGTACCACAA TGCGAAAGTG
GCGACAAACA ACCTGAAACT GGGAACCTTT GGCGCATTTG ATAACGAATG GCATACGCTG
GCTTTCCGCT TTGCCGGGAA TAACAGCCTG CAGGTGACGC CGGTTATTGA TGGTCAGGAT
GGCACACCGT TCACGCTGAC GCAGTCACCG GTCAGTGCCT TTGCGGCGGA TAAACTGCAT
GTGACAGACA TTACCAGAGG TGCGACTTAC CCGGTACTGA TAGACAGCAT TGCGGTGGAA
GTGAACAGCA CAGACACTGC GGCATGA
 
Protein sequence
MTFKHYDVVR AASPSDLADA LAQKIREGWQ PYGGPFSSYT DDGAALIQAI VAEGDVSTPV 
VVKLTGGEGA VISATRDPEY YFIVVLAGQS NSMAYGEGLP LPETYDRPDP RIKQLARRST
VTPGGVACKY NDIIPADHCL HDVQDMSRLN HPKADLSKGQ YGTVGQGLHI AKKLLPFIPA
NAGILLVPCC RGGSAFTTGA DGTYSDASGA SENSTRWGVD KPLYKDLIGR TKAALKKNPK
NVLFAVVWMQ GEFDFGGTPA NHAAQFGALV DKFRADLADM AGQCVGGSAD GVPWICGDTT
YFWKQKNEAT YQTVYGSYKN KTEKNIHFVP FMTDENGVNV PTNKPEEDPD IPGIGYYGSK
WRDSSATWTS QDRASHFSSW ARRGIISDRL ATAILRHAGR VALNAGASST VSEVRPSSPS
GAEATGVTTL LSYLASESEG SLKVQGWSAS GGRAEVVSDA EGTGGKAVKL TKEAGKSSWV
LEYAAGNGAA LLQKGGQIRC RFKVSGALAA NQYVMAFYWP VSSLPQGVAL TGDGGNNLLA
AFYIQTDAKD LNVMYHNAKV ATNNLKLGTF GAFDNEWHTL AFRFAGNNSL QVTPVIDGQD
GTPFTLTQSP VSAFAADKLH VTDITRGATY PVLIDSIAVE VNSTDTAA