Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_B0001 |
Symbol | |
ID | 6966416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011350 |
Strand | + |
Start bp | 78551 |
End bp | 81247 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643384017 |
Product | hypothetical protein |
Protein accession | YP_002268496 |
Protein GI | 209395628 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.362485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.442936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACTA AAATGAATGA GAGATGGAGA ACACCGATGA AATTAAAGTA TCTGTCATGT ACGATCCTTG CCCCTCTGGC GATTGGGGTA TTTTCTGCAA CAGCTGCTGA TAATAATTCA GCCATTTATT TCAATACCTC CCAGCCTATA AATGATCTGC AGGGTTCGTT GGCCGCAGAG GTGAAATTTG CACAAAGCCA GATTTTACCC GCCCATCCTA AAGAAGGGGA TAGTCAACCA CATCTGACCA GCCTGCGGAA AAGTCTGCTG CTTGTCCGTC CGGTGAAAGC TGATGATAAA ACACCTGTTC AGGTGGAAGC CCGCGATGAT AATAATAAAA TTCTCGGTAC GTTAACCCTT TATCCTCCTT CATCACTACC GGATACAATC TACCATCTGG ATGGTGTTCC GGAAGGTGGT ATCGATTTCA CACCTCATAA TGGAACGAAA AAGATCATTA ATACGGTGGC TGAAGTAAAC AAACTCAGTG ATGCCAGCGG GAGTTCTATT CATAGCCATC TAACAAATAA TGCACTGGTG GAGATCCATA CTGCAAATGG TCGTTGGGTA AGAGACATTT ATCTGCCGCA GGGACCCGAC CTTGAAGGTA AGATGGTTCG CTTTGTTTCG TCTGCAGGCT ATAGTTCAAC GGTTTTTTAT GGTGATCGAA AAGTCACACT CTCGGTGGGT AACACTCTTC TGTTCAAATA TGTAAATGGT CAGTGGTTCC GCTCCGGTGA ACTGGAGAAT AATCGAATCA CTTATGCTCA GCATATTTGG AGTGCTGAAC TGCCTGCGCA CTGGATCGTG CCTGGTTTAA ACTTGGTGAT TAAACAGGGC AATCTGAGCG GTCGCCTAAA TGATATCAAG ATTGGAGCAC CGGGTGAGCT GTTGTTGCAT ACAATTGATA TCGGGATGTT GACCACTCCC CGGGATCGCT TTGATTTTGC CAAAGACAAA GAAGCACATA GGGAATATTT CCAGACCATT CCTGTAAGTC GTATGATTGT TAATAATTAT GCGCCTCTAC ACCTAAAGGA AGTTATGTTA CCAACCGGAG AGTTATTGAC AGATATGGAT CCAGGAAATG GTGGGTGGCA TAGTGGTACA ATGCGTCAAA GAATAGGTAA AGAATTGGTT TCGCATGGCA TTGATAATGC TAACTATGGT TTAAATAGTA CCGCAGGCTT AGGGGAGAAT AGTCATCCAT ATGTAGTTGC GCAATTAGCG GCACATAATA GCCGCGGTAA TTATGCTAAT GGCATCCAGG TTCATGGTGG CTCCGGAGGT GGGGGAATTG TTACTTTAGA TTCCACATTG GGGAATGAGT TCAGTCATGA AGTTGGTCAT AATTATGGTC TTGGTCATTA TGTAGATGGT TTCAAGGGTT CTGTACATCG TAGTGCAGAA AATAACAACT CAACTTGGGG ATGGGATGGT GATAAAAAAC GGTTTATTCC TAACTTTTAT CCGTCTCAAA CAAATGAAAA GAGTTGTCTG AATAATCAGT GTCAAGAACC GTTTGATGGA CACAAATTTG GTTTTGACGC CATGGCGGGA GGCAGCCCTT TCTCTGCTGC AAACCGTTTC ACAATGTATA CTCCGAATTC ATCGGCTATC ATCCAGCGTT TTTTTGAAAA TAAAGCTGTG TTCGATAGCC GTTCCTCCAC CGGCTTCAGC AAGTGGAATG CAGATACGCA GGAAATGGAA CCGTATGAAC ACACCATTGA CCGTGCGGAG CAGATTACGG CTTCAGTCAA TGAGCTAAGT GAAAGCAAAA TGGCTGAGCT GATGGCAGAG TACGCTGTCG TCAAAGTGCA TATGTGGAAC GGTAACTGGA CAAGAAACAT CTATATCCCT ACAGCCTCCG CAGATAATAG AGGCAGTATC CTGACCATCA ACCATGAGGC CGGTTATAAT AGTTATCTGT TTATAAATGG TGACGAAAAG GTCGTTTCCC AGGGGTATAA AAAGAGCTTT GTTTCCGATG GTCAGTTCTG GAAAGAACGT GATGTGGTTG ATACTCGTGA AGCGCGTAAG CCAGAGCAGT TTGGTGTTCC TGTGACGACC CTGGTGGGGT ATTACGATCC GGAAGGCACG CTGTCAAGCT ACATCTATCC TGCGATGTAT GGTGCCTATG GCTTCACTTA TTCCGATGAT AGTCAGAATC TATCCGATAA CGACTGCCAG CTGCAGGTGG ATACGAAAGA AGGGCAGTTG CGATTCAGAC TGGCTAATCA CCGGGCTAAC AACACTGTAA TGAATAAGTT CCATATTAAC GTGCCAACAG AAAGTCAGCC CACACAGGCC ACATTGGTTT GCAATAACAA GATACTGGAT ACCAAATCGC TCACACCTGC GCCAGAAGGA CTTACCTATA CTGTAAATGG GCAGGCACTT CCAGCAAAAG AAAACGAGGG ATGCATCGTG TCCGTGAATT CAGGTAAACG TTACTGTTTG CCGGTTGGTC AACGGTCAGG ATATAGCCTT CCTGACTGGA TTGTTGGGCA GGAAGTCTAT GTCGACAGCG GGGCTAAAGC GAAAGTGCTG CTTTCTGACT GGGATAACCT GTCCTATAAC AGGATTGGTG AGTTTGTAGG TAATGTGAAC CCAGCTGATA TGAAAAAAGT TAAAGCCTGG AACGGACAGT ATTTGGACTT CAGTAAACCT AGGTCAATGA GGGTTGTATA TAAATAA
|
Protein sequence | MNTKMNERWR TPMKLKYLSC TILAPLAIGV FSATAADNNS AIYFNTSQPI NDLQGSLAAE VKFAQSQILP AHPKEGDSQP HLTSLRKSLL LVRPVKADDK TPVQVEARDD NNKILGTLTL YPPSSLPDTI YHLDGVPEGG IDFTPHNGTK KIINTVAEVN KLSDASGSSI HSHLTNNALV EIHTANGRWV RDIYLPQGPD LEGKMVRFVS SAGYSSTVFY GDRKVTLSVG NTLLFKYVNG QWFRSGELEN NRITYAQHIW SAELPAHWIV PGLNLVIKQG NLSGRLNDIK IGAPGELLLH TIDIGMLTTP RDRFDFAKDK EAHREYFQTI PVSRMIVNNY APLHLKEVML PTGELLTDMD PGNGGWHSGT MRQRIGKELV SHGIDNANYG LNSTAGLGEN SHPYVVAQLA AHNSRGNYAN GIQVHGGSGG GGIVTLDSTL GNEFSHEVGH NYGLGHYVDG FKGSVHRSAE NNNSTWGWDG DKKRFIPNFY PSQTNEKSCL NNQCQEPFDG HKFGFDAMAG GSPFSAANRF TMYTPNSSAI IQRFFENKAV FDSRSSTGFS KWNADTQEME PYEHTIDRAE QITASVNELS ESKMAELMAE YAVVKVHMWN GNWTRNIYIP TASADNRGSI LTINHEAGYN SYLFINGDEK VVSQGYKKSF VSDGQFWKER DVVDTREARK PEQFGVPVTT LVGYYDPEGT LSSYIYPAMY GAYGFTYSDD SQNLSDNDCQ LQVDTKEGQL RFRLANHRAN NTVMNKFHIN VPTESQPTQA TLVCNNKILD TKSLTPAPEG LTYTVNGQAL PAKENEGCIV SVNSGKRYCL PVGQRSGYSL PDWIVGQEVY VDSGAKAKVL LSDWDNLSYN RIGEFVGNVN PADMKKVKAW NGQYLDFSKP RSMRVVYK
|
| |