Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1682 |
Symbol | |
ID | 6970051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1622095 |
End bp | 1623255 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643385641 |
Product | hypothetical protein |
Protein accession | YP_002270135 |
Protein GI | 209396595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00147404 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.402738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCATCA AACAACACAA TGGGAATACC AAAGCCGATC GTCTCGCTGA ATTAAAAATC CGTTCGCCCT CAATTCAACT GATAAAATTT GGCGCTATTG GTTTGAATGC AATTCTCTTT TCCCCCCTGC TGATAGCTGC TGATACAGGA AGTCAATATG GCACCAATAT TACTATTAAT GATGGTGACA GAATTACTGG AGATACCGCC GATCCATCAG GAAACCTCTA TGGTGTAATG ACCCCAGCAG GAAACACGCC TGGCAATATC AACCTGGGTA ATGATGTCAC CGTCAATGTC AACGACGCCT CTGGATATGC AAAAGGAATC ATTATTCAGG GCAAAAACAG CTCCCTGACA GCTAACCGAC TCACAGTAGA TGTTGTTGGT CAAACCTCTG CCATCGGCAT TAATTTAATT GGTGACTATA CCCATGCTGA CTTAGGCACA GGCAGCACCA TTAAGAGTAA CGATGACGGC ATCATTATTG GGCATAGCTC AACACTAACA GCCACTCAAT TCACCATTGA AAACTCGAAC GGTATAGGCC TAACCATCAA TGACTATGGC ACCAGTGTCG ATCTTGGAAG CGGAAGTAAA ATCAAGACCG ATGGAAGTAC AGGTGTTTAT ATCGGTGGTC TCAACGGCAA TAACGCCAAT GGTGCTGCGC GTTTTACGGC GACAGACCTG ACAATCGATG TTCAGGGCTA CAGCGCCATG GGGATAAACG TACAGAAAAA CTCTGTTGTC GATCTCGGAA CAAACAGTTC CATTAAAACC AGTGGCGATA ATGCACACGG CCTCTGGAGC TTTGGCCAGG TTAGCGCGAA TGCACTCACT GTTGATGTAA CTGGAGCCGC GGCCAATGGC GTCGAAGTTC GTGGTGGTAC AACCACTATC GGTGCAGATA GCCATATTTC TTCCGCGCAG GGCGGTGGTC TCGTCACCAG TGGTTCAGAC GCGACAATCA ATTTTTCTGG CACGGCAGCG CAACGAAACA GCATCTTTTC CGGCGGTTCT TATGGTGCCT CGGCCCAGAC GGCAACGGCT GTTATCAACA TGCAAAATAC CGATATTACG GTTGATCGTA ATGGCAGTCT GGCGCTGGGT TTGTGGGCGC TCAGCGGCGC AAGAATGAAA CCATCACCAC TCCCCGTCTG A
|
Protein sequence | MGIKQHNGNT KADRLAELKI RSPSIQLIKF GAIGLNAILF SPLLIAADTG SQYGTNITIN DGDRITGDTA DPSGNLYGVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT ANRLTVDVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN GIGLTINDYG TSVDLGSGSK IKTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM GINVQKNSVV DLGTNSSIKT SGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI GADSHISSAQ GGGLVTSGSD ATINFSGTAA QRNSIFSGGS YGASAQTATA VINMQNTDIT VDRNGSLALG LWALSGARMK PSPLPV
|
| |