Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_0295 |
Symbol | |
ID | 5589074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 322257 |
End bp | 324455 |
Gene Length | 2199 bp |
Protein Length | 732 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640924020 |
Product | xanthine dehydrogenase family protein, molybdopterin-binding subunit |
Protein accession | YP_001461448 |
Protein GI | 157156885 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTTG ATAAACCCGC AGGGGAAAAC CCGATCGATC AGCTGAAGGT TGTCGGTCGT CCCCATGACC GCATCGACGG ACCGCTGAAA ACTACCGGCA CGGCACGCTA CGCCTACGAA TGGCATGAAG AAGCCCCCAA CGCCGCCTAT GGCTATATCG TCGGTTCCGC CATTGCCAAA GGACGCCTCA CCGCCCTTGA TACGGACGCC GCGCAAAAAG CGCCGGGCGT ACTGGCTGTC ATTACCGCCA GTAACGCCGG GGCACTCGGC AAAGGCGACA AAAACACCGC CAGACTGTTA GGCGGCCCCA CTATTGAGCA CTATCATCAG GCCATTGCGC TGGTAGTGGC CGAGACCTTC GAACAGGCGC GAGCGGCGGC CTCGCTGGTG CAGGCACACT ATCGCCGTAA TAAAGGAGCT TACTCCCTGG CGGACGAAAA ACAGGCCGTC AATCAGCCGC CGGAAGACAC GCCCGACAAA AACGTCGGTG ACTTTGACGG GGCTTTCACC TCCGCTGCGG TGAAGATTGA TGCTACCTAC ACGACCCCGG ACCAGAGCCA TATGGCGATG GAGCCGCATG CCTCGATGGC CGTCTGGGAT GGAAATAAGC TTACCCTCTG GACCTCAAAT CAGATGATTG ACTGGTGCCG CACCGATCTG GCAAAAACGC TGAAAGTTCC CGTGGAGAAT GTGCGTATTA TCTCCCCGTA TATCGGCGGA GGGTTTGGCG GCAAGCTGTT CCTGAGAAGC GATGCGCTGC TGGCGGCCCT CGCCGCCCGA GCGGTGAAAC GTCCGGTTAA AGTGATGCTC CCCCGCCCCT CTATTCCCAA TAACACCACG CACCGCCCCG CCACCCTTCA GCACTTGCGT ATCGGTGCCG ACCAGAGCGG GAAAATCACC GCTATCTCAC ATGAAAGCTG GTCTGGAAAC CTGCCCGGCG GCACGCCGGA AACGGCGGTA CAGCAAAGCG AATTACTCTA CGCCGGGGCG AATCGTCATA CCGGCCTGCG GCTCGCCACG CTTGATTTGC CGGAAGGGAA CGCCATGCGT GCGCCCGGCG AAGCCCCCGG TCTGATGGCG CTCGAAATCG CGATCGACGA ACTGGCGGAA AAAGCGGGCA TCGATCCCGT CGAGTTTCGC ATCCTGAATG ACACTCAGGT TGACCCCGCC GACCCGACGC GCTGCTTCTC TCGCCGTCAG CTTATCGAGT GCTTGCGCAC CGGAGCGGAT AAATTTGGCT GGAAGCAGCG CAACGCCACA CCCGGACAGG TGCGCGACGG GGAGTGGCTA GTCGGCCACG GTGTTGCGGC GGGCTTTCGC AATAATCTGC TGGAAAAATC GGGTGCTCGG GTTCACCTCG AACAAAACGG CACCGTTACC GTAGAAACGG ACATGACCGA CATTGGCACC GGCAGCTACA CCATTCTGGC CCAGACGGCA GCGGAAATGC TTGGCGTACC GCTGGAGCAG GTTGCGGTTC ACCTCGGCGA TTCCAGTTTC CCGGTTTCTG CGGGTTCTGG TGGACAATGG GGCGCGAATA CCTCCACCTC CGGCGTTTAC GCCGCCTGTA TGAAGCTTCG CGAAATGATT GCCTCGGCAG TCGGGTTTGA TCCTGAGCAG TCGCAGTTTG CCGACGGCAA GATTACCAAC GGTACCCGAA GCGCCACGCT ACATGAAGCC ACCGCAGGCG GCAGACTGAC AGCGGAAGAG AGCATTGAAT TCGGAACACT GAGCAAAGAG TACCAGCAGT CGACCTTTGC CGGGCATTTT GTGGAGGTCG GCGTGCATAG CGCGACGGGA GAAGTTCGGG TCCGGCGTAT GCTCGCTGTG TGTGCTGCAG GACGCATCCT GAATCCGAAA ACTGCGCGCA GCCAGGTCAT TGGCGCAATG ACTATGGGCA TGGGTGCGGC ACTGATGGAG GAGCTGGCGG TGGATGACCG TTTGGGCTAC TTCGTTAATC ACGATATGGC GGGGTATGAG GTGCCGGTCC ATGCGGATAT CCCAAAACAG GAGGTGATTT TCCTGGATGA TACCGACCCC ATATCCTCCC CGATGAAGGC CAAAGGTGTC GGTGAGCTGG GCCTGTGCGG CGTGAGCGCG GCTATCGCCA ACGCGGTGTA TAACGCCACC GGTATTCGGG TACGGGATTA TCCCATCACT CTGGATAAGC TGCTCGATAA GCTGCCGGAT GTAGTTTAA
|
Protein sequence | MKFDKPAGEN PIDQLKVVGR PHDRIDGPLK TTGTARYAYE WHEEAPNAAY GYIVGSAIAK GRLTALDTDA AQKAPGVLAV ITASNAGALG KGDKNTARLL GGPTIEHYHQ AIALVVAETF EQARAAASLV QAHYRRNKGA YSLADEKQAV NQPPEDTPDK NVGDFDGAFT SAAVKIDATY TTPDQSHMAM EPHASMAVWD GNKLTLWTSN QMIDWCRTDL AKTLKVPVEN VRIISPYIGG GFGGKLFLRS DALLAALAAR AVKRPVKVML PRPSIPNNTT HRPATLQHLR IGADQSGKIT AISHESWSGN LPGGTPETAV QQSELLYAGA NRHTGLRLAT LDLPEGNAMR APGEAPGLMA LEIAIDELAE KAGIDPVEFR ILNDTQVDPA DPTRCFSRRQ LIECLRTGAD KFGWKQRNAT PGQVRDGEWL VGHGVAAGFR NNLLEKSGAR VHLEQNGTVT VETDMTDIGT GSYTILAQTA AEMLGVPLEQ VAVHLGDSSF PVSAGSGGQW GANTSTSGVY AACMKLREMI ASAVGFDPEQ SQFADGKITN GTRSATLHEA TAGGRLTAEE SIEFGTLSKE YQQSTFAGHF VEVGVHSATG EVRVRRMLAV CAAGRILNPK TARSQVIGAM TMGMGAALME ELAVDDRLGY FVNHDMAGYE VPVHADIPKQ EVIFLDDTDP ISSPMKAKGV GELGLCGVSA AIANAVYNAT GIRVRDYPIT LDKLLDKLPD VV
|
| |