Gene ECH74115_0828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0828 
Symbol 
ID6969183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp850517 
End bp851887 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content55% 
IMG OID643384853 
Producthypothetical protein 
Protein accessionYP_002269359 
Protein GI209395921 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTT CTTTTAAAGT CTTATCGCCA ACGGCAATCC TCGGGTATGG ATTCCCGGAA 
GAGAGCTTCC GCAAAGCGAT GGCAGAATCG CCCGATCTGA TCGCCGTTGA TGCAGGCTCT
TCCGATCCAG GCCCCCACTA CCTCGGGGCA GGCAAACCGT TTACCGATCG CGCAGGGGTC
AAGCGCGATC TGCGCTATAT GATCACCGCA GGCGTGCAAA ATAATATACC GGTGGTGATC
GGAACGGCTG GCGGTTCTGG TGCCGCACCA CACCTGGAGT GGTGCCGAGA AATAATTCAT
GAGATTGCCC GGGAAGAGCA CCTCTCGTTC TCGATGGCGT TGATCCCGGC AGACGTTGAC
AAAGCCATCG TCCACCAGGC GCTGGATAAC GGCAAAATCA CGGCGCTGGA TTTTGTCCCG
CCGTTAACCC ACGACGCGAT TGACGAAAGT ACGTATATCG TCGCGCAAAT GGGCATCGAA
CCCTTCCAGC GGGCGCTGAA AGAACGCGCG CAAGTGGTGC TGGGCGGGCG CGCTTACGAC
CCGGCCTGCT TTGCTGCGCT CCCGATCATG CAAGGCTTTG ATGAAGGTCT GGCGCTGCAT
TGTGGGAAAA TCCTTGAATG CGCGGCAATC GCCGCAACGC CCGGCTCAGG CTCTGACTGT
GCGATGGGCA TCATTGATGA CAACGGCTTT ACGCTGAAAA CATTTAATCC GAAGCGTAAA
TTTACCGAAA CGTCAGCAGC TGCACACACA CTGTATGAGA AATCCGATCC CTACTTCCTG
CCAGGACCTG GCGGCGTGTT GAACCTGAAA GGCTGCACAT TTAAAGCAGT AAATGACGGC
GAAGTCTACG TCAGCGGTTC TAAGCATGAA GAAACGCCGT ATGCCCTGAA ACTGGAAGGT
GCACGACGGG TGGGCTTCCG CTGTCTGACC ATCGCCGGAA CGCGCGACCC GATTATGATC
GCCGGGATCG ATAAAATCAT CGATGAAGTC AAAACCAGCG TTTCGCGTAA CCTGTCGCTC
GACGATGACA GCATTCGCAT CAATTTCCAC CTGTACGGCA AAAACGGGGT GATGGGAGAC
CATGAACCGA TGCAAACTGC CGGGCATGAG CTGGGGATTG TCCTTGATGT AGTTGCACCG
ACCCAGGAAA TTGCCAACAG CGTTTGCTCG CTGGTGCGCT CTACCATGCT GCACTACGGC
TATGAAAACC GCATCGCTAC CGCAGGTAAT CTGGCGTTCC CGTTCTCCCC TTCTGATATC
CAGGGCGGGC CGGTATACGA ATTTTCCATA TATCACCTGA TTGAAGCCAA CGACGCTCTG
CGTTTTGATT TCCATATTGA ACAGGTGACG CCAGAAGGAG TTCAGGCATG A
 
Protein sequence
MARSFKVLSP TAILGYGFPE ESFRKAMAES PDLIAVDAGS SDPGPHYLGA GKPFTDRAGV 
KRDLRYMITA GVQNNIPVVI GTAGGSGAAP HLEWCREIIH EIAREEHLSF SMALIPADVD
KAIVHQALDN GKITALDFVP PLTHDAIDES TYIVAQMGIE PFQRALKERA QVVLGGRAYD
PACFAALPIM QGFDEGLALH CGKILECAAI AATPGSGSDC AMGIIDDNGF TLKTFNPKRK
FTETSAAAHT LYEKSDPYFL PGPGGVLNLK GCTFKAVNDG EVYVSGSKHE ETPYALKLEG
ARRVGFRCLT IAGTRDPIMI AGIDKIIDEV KTSVSRNLSL DDDSIRINFH LYGKNGVMGD
HEPMQTAGHE LGIVLDVVAP TQEIANSVCS LVRSTMLHYG YENRIATAGN LAFPFSPSDI
QGGPVYEFSI YHLIEANDAL RFDFHIEQVT PEGVQA