Gene ECH74115_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1682 
Symbol 
ID6970051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1622095 
End bp1623255 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID643385641 
Producthypothetical protein 
Protein accessionYP_002270135 
Protein GI209396595 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00147404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.402738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCA AACAACACAA TGGGAATACC AAAGCCGATC GTCTCGCTGA ATTAAAAATC 
CGTTCGCCCT CAATTCAACT GATAAAATTT GGCGCTATTG GTTTGAATGC AATTCTCTTT
TCCCCCCTGC TGATAGCTGC TGATACAGGA AGTCAATATG GCACCAATAT TACTATTAAT
GATGGTGACA GAATTACTGG AGATACCGCC GATCCATCAG GAAACCTCTA TGGTGTAATG
ACCCCAGCAG GAAACACGCC TGGCAATATC AACCTGGGTA ATGATGTCAC CGTCAATGTC
AACGACGCCT CTGGATATGC AAAAGGAATC ATTATTCAGG GCAAAAACAG CTCCCTGACA
GCTAACCGAC TCACAGTAGA TGTTGTTGGT CAAACCTCTG CCATCGGCAT TAATTTAATT
GGTGACTATA CCCATGCTGA CTTAGGCACA GGCAGCACCA TTAAGAGTAA CGATGACGGC
ATCATTATTG GGCATAGCTC AACACTAACA GCCACTCAAT TCACCATTGA AAACTCGAAC
GGTATAGGCC TAACCATCAA TGACTATGGC ACCAGTGTCG ATCTTGGAAG CGGAAGTAAA
ATCAAGACCG ATGGAAGTAC AGGTGTTTAT ATCGGTGGTC TCAACGGCAA TAACGCCAAT
GGTGCTGCGC GTTTTACGGC GACAGACCTG ACAATCGATG TTCAGGGCTA CAGCGCCATG
GGGATAAACG TACAGAAAAA CTCTGTTGTC GATCTCGGAA CAAACAGTTC CATTAAAACC
AGTGGCGATA ATGCACACGG CCTCTGGAGC TTTGGCCAGG TTAGCGCGAA TGCACTCACT
GTTGATGTAA CTGGAGCCGC GGCCAATGGC GTCGAAGTTC GTGGTGGTAC AACCACTATC
GGTGCAGATA GCCATATTTC TTCCGCGCAG GGCGGTGGTC TCGTCACCAG TGGTTCAGAC
GCGACAATCA ATTTTTCTGG CACGGCAGCG CAACGAAACA GCATCTTTTC CGGCGGTTCT
TATGGTGCCT CGGCCCAGAC GGCAACGGCT GTTATCAACA TGCAAAATAC CGATATTACG
GTTGATCGTA ATGGCAGTCT GGCGCTGGGT TTGTGGGCGC TCAGCGGCGC AAGAATGAAA
CCATCACCAC TCCCCGTCTG A
 
Protein sequence
MGIKQHNGNT KADRLAELKI RSPSIQLIKF GAIGLNAILF SPLLIAADTG SQYGTNITIN 
DGDRITGDTA DPSGNLYGVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT
ANRLTVDVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN
GIGLTINDYG TSVDLGSGSK IKTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM
GINVQKNSVV DLGTNSSIKT SGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI
GADSHISSAQ GGGLVTSGSD ATINFSGTAA QRNSIFSGGS YGASAQTATA VINMQNTDIT
VDRNGSLALG LWALSGARMK PSPLPV