Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0662 |
Symbol | |
ID | 6970000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 692056 |
End bp | 693174 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643384699 |
Product | carboxylate-amine ligase |
Protein accession | YP_002269212 |
Protein GI | 209400990 |
COG category | [S] Function unknown |
COG ID | [COG2170] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR02050] uncharacterized enzyme |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATTAC CCGATTTTCA TGTTTCTGAA CCTTTTACCC TCGGTATTGA ACTGGAAATG CAGGTGGTTA ATCCGCCGGG CTATGACTTA AGCCAGGACT CTTCAATGCT GATTGACGCG GTTAAAAATA AGATCACGGC CGGAGAGGTA AAGCACGATA TCACCGAAAG TATGCTGGAG CTGGCGACGG ATGTTTGCCG TGATATCAAC CAGGCTGCCG GGCAATTTTC AGCGATGCAG AAAGTCGTAT TGCAGGCAGC CGCAGACCAT CATCTGGAAA TTTGCGGCGG TGGCACGCAC CCGTTTCAGA AATGGCAGCG TCAGGAGGTA TGCGACAACG AACGCTATCA ACGAACGCTG GAAAACTTTG GCTATCTCAT CCAGCAGGCG ACCGTTTTTG GTCAGCATGT CCATGTTGGC TGTGCCAGTG GCGATGACGC CATTTATTTG CTGCACGGCT TGTCACGGTT TGTGCCGCAC TTTATCGCCC TTTCCGCCGC GTCGCCATAT ATGCAGGGAA CGGATACGCG TTTTGCCTCC TCACGACCGA ATATTTTTTC CGCCTTTCCT GATAATGGCC CGATGCCGTG GGTCAGTAAC TGGCAACAAT TTGAAGCCCT GTTTCGCTGT CTGAGTTACA CCACGATGAT CGACAGCATT AAAGATCTGC ACTGGGATAT TCGCCCCAGT CCTCATTTTG GCACGGTGGA GGTTCGGGTG ATGGATACCC CGTTAACCCT TAGCCACGCG GTAAATATGG CGGGATTAAT TCAGGCCACC GCCCACTGGT TACTGACAGA ACGCCCGTTC AAACATAAGG AGAAAGATTA CCTGCTGTAT AAATTCAACC GTTTCCAGGC CTGCCGCTAT GGGCTGGAAG GCGTCATTAC CGATCCGTAC ACTGGCGATC GTCGACCACT AACGGAAGAC ACCTTGCGAT TGCTGGAAAA AATCGCCCCT TCTGCACATA AAATTGGTGC ATCGAGCGCG ATTGAGGCCC TGCATCGCCA GGTCGTCAGC GGTCTGAATG AAGCGCAGCT GATGCGCGAT TTCGTCGCCG ATGGCGGCTC GCTGATTGGG CTGGTGAAAA AGCATTGTGA GATCTGGGCC GGTGACTAA
|
Protein sequence | MPLPDFHVSE PFTLGIELEM QVVNPPGYDL SQDSSMLIDA VKNKITAGEV KHDITESMLE LATDVCRDIN QAAGQFSAMQ KVVLQAAADH HLEICGGGTH PFQKWQRQEV CDNERYQRTL ENFGYLIQQA TVFGQHVHVG CASGDDAIYL LHGLSRFVPH FIALSAASPY MQGTDTRFAS SRPNIFSAFP DNGPMPWVSN WQQFEALFRC LSYTTMIDSI KDLHWDIRPS PHFGTVEVRV MDTPLTLSHA VNMAGLIQAT AHWLLTERPF KHKEKDYLLY KFNRFQACRY GLEGVITDPY TGDRRPLTED TLRLLEKIAP SAHKIGASSA IEALHRQVVS GLNEAQLMRD FVADGGSLIG LVKKHCEIWA GD
|
| |