Gene EcolC_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3335 
Symbol 
ID6065177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3658577 
End bp3660775 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content59% 
IMG OID641602750 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001726283 
Protein GI170021329 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.880255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTG ATAAACCCGC AGGGGAAAAC CCGATCGATC AGCTGAAGGT TGTCGGTCGT 
CCCCATGACC GCATCGACGG ACCGCTGAAA ACTACCGGCA CGGCACGCTA CGCCTACGAA
TGGCATGAAG AAGCTCCCAA CGCCGCCTAT GGCTATATCG TCGGTTCCGC CATTGCCAAA
GGACGCCTCA TCGCCCTTGA TACGGACGCA GCGCAAAAAG CGCCGGGCGT ACTGGCTGTC
ATTACCGCCA GTAACGCCGG GGCACTCGGC AAAGGCGACA AAAACACCGC CAGGCTGTTA
GGCGGCCCCA CTATTGAGCA CTATCATCAG GCCATTGCGC TGGTAGTGGC CGAGACCTTC
GAACAGGCAC GAGCGGCGGC CTCGCTGGTG CAGCCGCACT ATCGCCGTAA TAAAGGAGCT
TACTCCCTGG CGGACGAAAA ACAGGCCGTC AGTCAGCCGC CGGAAGACAC GCCCGACAAA
AACGTCGGTG ACTTTGACGG GGCTTTCACC TCCGCTGCGG TGAAGATTGA TGCTACCTAC
ACGACCCCGG ACCAGAGCCA TATGGCGATG GAGCCGCATG CCTCGATGGC CGTCTGGGAT
GGAAATAAGC TTACTCTCTG GACCTCAAAT CAGATGATTG ACTGGTGCCG CACCGATCTG
GCAAAAACGC TAAAAGTGCC CGTGGAGAAT GTGCGTATTA TCTCCCCGTA TATCGGCGGA
GGGTTTGGCG GCAAGCTGTT CCTGAGAAGC GATGCACTGC TGGCGGCCCT CGCCGCCCGA
GCGGTGAAAC GTCCGGTTAA AGTGATGCTC CCCCGCCCCA CTATTCCCAA TAACACCACG
CACCGCCCCG CCACCCTTCA ACACCTGCGT ATCGGTGCCG ACCAGAGCGG GAAAATCACC
GCTATCTCAC ATGAAAGCTG GTCCGGAAAC CTGCCCGGCG GCACGCCGGA AACGGCGGTA
CAGCAAAGTG AATTACTCTA CGCAGGGGCA AACCGTCATA CCGGCCTGCG GCTCGCCACG
CTTGATTTGC CGGAAGGGAA CGCCATGCGT GCGCCCGGCG AAGCCCCCGG TCTGATGGCG
CTCGAAATCG CGATCGACGA ACTGGCGGAA AAAGCGGGCA TCGATCCCGT CGAGTTTCGC
ATCCTGAATG ACACTCAGGT TGACCCCGCC GACCCGACGC GCCGCTTCTC TCGCCGTCAG
CTTATCGAGT GCTTGCGCAC CGGAGCGGAT AAATTTGGCT GGAAGCAGCG CAACGCCACA
CCCGGACAGG TGCGCGACGG GGAGTGGCTA GTCGGCCACG GTGTTGCGGC GGGCTTTCGC
AATAATCTGC TGGAAAAATC GGGGGCTCGG GTTCACCTCG AACCAAACGG CACCGTTACC
GTGGAAACGG ACATGACCGA CATTGGCACC GGCAGCTACA CCATTCTGGC CCAGACGGCA
GCGGAAATGC TTGGCGTACC GCTGGAGCAG GTTGCGGTTC ACCTCGGCGA TTCCAGTTTC
CCGGTTTCTG CGGGTTCTGG TGGACAATGG GGCGCGAATA CCTCCACCTC CGGCGTTTAC
GCCGCCTGTG TGAAGCTTCG CGAAATGATT GCCTCGGCAG TCGGGTTTGA TCCTGAGCAG
TCGCAGTTTG CCGACGGCAA GATTACCAAC GGTACCCGAA GCGCCACGCT ACATGAGGCC
ACCGCAGGCG GCAGACTGAC AGCGGAAGAG AGCATTGAAT TCGGAACACT GAGCAAGGAG
TACCAGCAGT CGACCTTTGC CGGGCATTTT GTGGAGGTCG GCGTGCATAG CGCGACGGGA
GAAGTTCGGG TCCGGCGTAT GCTCGCTGTG TGTGCTGCAG GACGCATCCT GAATCCGAAA
ACTGCACGCA GCCAGGTCAT TGGCGCAATG ACTATGGGCA TGGGCGCGGC ACTGATGGAG
GAGCTGGCGG TGGATGACCG TTTGGGCTAC TTCGTTAATC ACGATATGGC GGGGTATGAG
GTGCCGGTTC ATGCGGATAT CCCAAAACAG GAGGTGATTT TCCTGGATGA TACCGACCCC
ATATCCTCCC CGATGAAGGC CAAAGGTGTC GGTGAGCTGG GCCTGTGCGG CGTGAGCGCG
GCTATCGCCA ACGCGGTGTA TAACGCCACC GGTATTCGGG TACGGGATTA TCCCATCACT
CTGGATAAGC TGCTCGATAA GCTGCCGGAT GTGGTTTAA
 
Protein sequence
MKFDKPAGEN PIDQLKVVGR PHDRIDGPLK TTGTARYAYE WHEEAPNAAY GYIVGSAIAK 
GRLIALDTDA AQKAPGVLAV ITASNAGALG KGDKNTARLL GGPTIEHYHQ AIALVVAETF
EQARAAASLV QPHYRRNKGA YSLADEKQAV SQPPEDTPDK NVGDFDGAFT SAAVKIDATY
TTPDQSHMAM EPHASMAVWD GNKLTLWTSN QMIDWCRTDL AKTLKVPVEN VRIISPYIGG
GFGGKLFLRS DALLAALAAR AVKRPVKVML PRPTIPNNTT HRPATLQHLR IGADQSGKIT
AISHESWSGN LPGGTPETAV QQSELLYAGA NRHTGLRLAT LDLPEGNAMR APGEAPGLMA
LEIAIDELAE KAGIDPVEFR ILNDTQVDPA DPTRRFSRRQ LIECLRTGAD KFGWKQRNAT
PGQVRDGEWL VGHGVAAGFR NNLLEKSGAR VHLEPNGTVT VETDMTDIGT GSYTILAQTA
AEMLGVPLEQ VAVHLGDSSF PVSAGSGGQW GANTSTSGVY AACVKLREMI ASAVGFDPEQ
SQFADGKITN GTRSATLHEA TAGGRLTAEE SIEFGTLSKE YQQSTFAGHF VEVGVHSATG
EVRVRRMLAV CAAGRILNPK TARSQVIGAM TMGMGAALME ELAVDDRLGY FVNHDMAGYE
VPVHADIPKQ EVIFLDDTDP ISSPMKAKGV GELGLCGVSA AIANAVYNAT GIRVRDYPIT
LDKLLDKLPD VV