Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0330 |
Symbol | |
ID | 6969040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 335024 |
End bp | 337222 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643384391 |
Product | xanthine dehydrogenase family protein, molybdopterin-binding subunit |
Protein accession | YP_002268906 |
Protein GI | 209395976 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTTG ATAAACCCGC AGGGGAAAAC CCGATCGATC AGCTGAAGGT TGTCGGTCGT CCCCATGACC GCATCGACGG ACCGCTGAAA ACTACCGGCA CGGCACGCTA CGCCTACGAA TGGCATGAAG AATCCCCCAA CGCCGCCTAT GGCTATATCG TCGGTTCCGC CATTGCCAAA GGACGCCTCA CCGCCCTTGA TACGGACGCC GCGCAAAAAG CGCCGGGCGT ACTGCCTGTC ATTACCGCCA GTAACGCCGG GGCACTCAGT AAAGGCGACA AAAACACCGC CAGGCTGTTA GGCGGCCCCA CTATTGAGCA CTATCATCAG GCCATTGCTC TGGTAGTGGC CGAGACCTTC GAACAGGCGC GAGCGGCGGC CTCGCTGGTG CAGGCGCACT ATCGCCGTAA TAAAGGAGCT TACTCCCTGG CGGACGAAAA ACAGGCCGTC AGTCAGCCGC CGGAAGACAC GCCCGACAAG AACGTCGGTG ACTTTGACGG GGCTTTCTCC TCCGCTGCGG TGAAGATTGA TGCTACCTAC ACGACCCCGG ACCAGAGCCA TATGGCGATG GAGCCGCATG CCTCGATGGC CGTCTGGGAT GGAAATAAGC TTACTCTCTG GACCTCAAAT CAGATGATTG ACTGGTGCCG CACCGATCTG GCAAAAACGC TGAAAGTGCC CGTGGAGAAT GTGCGTATTA TCTCCCCGTA TATCGGCGGA GGGTTTGGCG GCAAGCTGTT CCTGAGAAGC GATGCGCTGC TGGCGGCCCT CGCCGCCCGA GCGGTGAAAC GTCCGGTTAA AGTGATGCTC CCCCGCCCCT CTATTCCCAA TAACACCACG CACCGCCCCG CCACCCTTCA GCACTTGCGT ATCGGTGCCG ACCAGAGCGG GAAAATCACC GCTATCTCAC ATGAAAGCTG GTCTGGAAAC CTGCCCGGCG GCACGCCGGA AACGGCGGTA CAGCAAAGCG AATTACTCTA CGCCGGGGCA AACCGTCATA CCGGCCTGCG GCTCGCCACG CTTGATTTGC CGGAAGGGAA CGCCATGCGT GCGCCCGGCG AAGCCCCCGG TCTGATGGCG CTCGAAATCG CGATCGACGA ACTGGCGGAA AAAGCGGGCA TCGATCCCGT CGAGTTTCGC ATCCTGAATG ACACTCAGAT TGACCCCGCC GACCCGACGC GCCGCTTCTC TCGCCGTCAG CTTATCGAGT GCTTGCGCAC CGGAGCGGAT AAATTTGGTT GGAAGCAGCG CAACGCCACA CCCGGACAGG TGCGCGACGG GGAGTGGCTA GTCGGCCACG GCGTCGCGGC GGGCTTTCGC AATAATCTGC TGGAAAAATC GGGGGCTCGG GTTCACCTCG AACCAAACGG CACCGTTACC GTGGAAACGG ACATGACCGA CATTGGCACC GGCAGCTACA CCATTCTGGC CCAGACGGCA GCGGAAATGC TTGGCGTACC GCTGGAGCAG GTTGCGGTTC ACCTCGGCGA TTCCAGTTTC CCGGTTTCTG CGGGTTCTGG TGGACAATGG GGCGCGAATA CCTCCACCTC CGGCGTTTAC GCCGCCTGTG TGAAGCTTCG CGAAATGATT GCCTCGGCAG TCGGGTTTGA TCCTGAGCAG TCGCAGTTTG CCGACGGCAA GATTACCAAC GGTACCCGAA GCGCCATACT ACATGAGGCC ACCGCAGGCG GCAGACTGAC AGCGGAAGAG AGCATTGAAT TCGGAACACT GAGCAAGGAG TACCAGCAGT CGACCTTTGC CGGGCATTTT GTGGAGGTCG GCGTGCATAG CGCGACGGGA GAAGTTCGGG TCCGGCGTAT GCTCGCTGTG TGTGCTGCAG GACGCATCCT GAATCCGAAA ACTGCACGCA GCCAAGTCAT TGGCGCAATG ACTATGGGCA TGGGCGCGGC ACTGATGGAG GAGCTGGCGG TGGATGACCG TTTGGGCTAC TTCGTTAATC ACGATATGGC GGGGTATGAG GTGCCGGTCC ATGCGGATAT CCCAAAACAG GAAGTGATTT TCCTGGATGA TACCGACCCC ATATCCTCCC CGATGAAGGC CAAAGGTGTC GGTGAGCTGG GCCTGTGCGG CGTGAGCGCG GCTATCGCCA ACGCGGTGTA TAACGCCACC GGTATTCGGG TACGCGATTA TCCCATCACT CTGGATAAGC TGCTCGATAA GCTGCCGGAT GTGGTTTAA
|
Protein sequence | MKFDKPAGEN PIDQLKVVGR PHDRIDGPLK TTGTARYAYE WHEESPNAAY GYIVGSAIAK GRLTALDTDA AQKAPGVLPV ITASNAGALS KGDKNTARLL GGPTIEHYHQ AIALVVAETF EQARAAASLV QAHYRRNKGA YSLADEKQAV SQPPEDTPDK NVGDFDGAFS SAAVKIDATY TTPDQSHMAM EPHASMAVWD GNKLTLWTSN QMIDWCRTDL AKTLKVPVEN VRIISPYIGG GFGGKLFLRS DALLAALAAR AVKRPVKVML PRPSIPNNTT HRPATLQHLR IGADQSGKIT AISHESWSGN LPGGTPETAV QQSELLYAGA NRHTGLRLAT LDLPEGNAMR APGEAPGLMA LEIAIDELAE KAGIDPVEFR ILNDTQIDPA DPTRRFSRRQ LIECLRTGAD KFGWKQRNAT PGQVRDGEWL VGHGVAAGFR NNLLEKSGAR VHLEPNGTVT VETDMTDIGT GSYTILAQTA AEMLGVPLEQ VAVHLGDSSF PVSAGSGGQW GANTSTSGVY AACVKLREMI ASAVGFDPEQ SQFADGKITN GTRSAILHEA TAGGRLTAEE SIEFGTLSKE YQQSTFAGHF VEVGVHSATG EVRVRRMLAV CAAGRILNPK TARSQVIGAM TMGMGAALME ELAVDDRLGY FVNHDMAGYE VPVHADIPKQ EVIFLDDTDP ISSPMKAKGV GELGLCGVSA AIANAVYNAT GIRVRDYPIT LDKLLDKLPD VV
|
| |