Gene EcDH1_0450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0450 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp476498 
End bp477847 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionACX38139 
Protein GI260447717 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGATA AAATTGTTAT TGCCAACCGC GGCGAGATTG CATTGCGTAT TCTTCGTGCC 
TGTAAAGAAC TGGGCATCAA GACTGTCGCT GTGCACTCCA GCGCGGATCG CGATCTAAAA
CACGTATTAC TGGCAGATGA AACGGTCTGT ATTGGCCCTG CTCCGTCAGT AAAAAGTTAT
CTGAACATCC CGGCAATCAT CAGCGCCGCT GAAATCACCG GCGCAGTAGC AATCCATCCG
GGTTACGGCT TCCTCTCCGA GAACGCCAAC TTTGCCGAGC AGGTTGAACG CTCCGGCTTT
ATCTTCATTG GCCCGAAAGC AGAAACCATT CGCCTGATGG GCGACAAAGT ATCCGCAATC
GCGGCGATGA AAAAAGCGGG CGTCCCTTGC GTACCGGGTT CTGACGGCCC GCTGGGCGAC
GATATGGATA AAAACCGTGC CATTGCTAAA CGCATTGGTT ATCCGGTGAT TATCAAAGCC
TCCGGCGGCG GCGGCGGTCG CGGTATGCGC GTAGTGCGCG GCGACGCTGA ACTGGCACAA
TCCATCTCCA TGACCCGTGC GGAAGCGAAA GCTGCTTTCA GCAACGATAT GGTTTACATG
GAGAAATACC TGGAAAATCC TCGCCACGTC GAGATTCAGG TACTGGCTGA CGGTCAGGGC
AACGCTATCT ATCTGGCGGA ACGTGACTGC TCCATGCAAC GCCGCCACCA GAAAGTGGTC
GAAGAAGCGC CAGCACCGGG CATTACCCCG GAACTGCGTC GCTACATCGG CGAACGTTGC
GCTAAAGCGT GTGTTGATAT CGGCTATCGC GGTGCAGGTA CTTTCGAGTT CCTGTTCGAA
AACGGCGAGT TCTATTTCAT CGAAATGAAC ACCCGTATTC AGGTAGAACA CCCGGTTACA
GAAATGATCA CCGGCGTTGA CCTGATCAAA GAACAGCTGC GTATCGCTGC CGGTCAACCG
CTGTCGATCA AGCAAGAAGA AGTTCACGTT CGCGGCCATG CGGTGGAATG TCGTATCAAC
GCCGAAGATC CGAACACCTT CCTGCCAAGT CCGGGCAAAA TCACCCGTTT CCACGCACCT
GGCGGTTTTG GCGTACGTTG GGAGTCTCAT ATCTACGCGG GCTACACCGT ACCGCCGTAC
TATGACTCAA TGATCGGTAA GCTGATTTGC TACGGTGAAA ACCGTGACGT GGCGATTGCC
CGCATGAAGA ATGCGCTGCA GGAGCTGATC ATCGACGGTA TCAAAACCAA CGTTGATCTG
CAGATCCGCA TCATGAATGA CGAGAACTTC CAGCATGGTG GCACTAACAT CCACTATCTG
GAGAAAAAAC TCGGTCTTCA GGAAAAATAA
 
Protein sequence
MLDKIVIANR GEIALRILRA CKELGIKTVA VHSSADRDLK HVLLADETVC IGPAPSVKSY 
LNIPAIISAA EITGAVAIHP GYGFLSENAN FAEQVERSGF IFIGPKAETI RLMGDKVSAI
AAMKKAGVPC VPGSDGPLGD DMDKNRAIAK RIGYPVIIKA SGGGGGRGMR VVRGDAELAQ
SISMTRAEAK AAFSNDMVYM EKYLENPRHV EIQVLADGQG NAIYLAERDC SMQRRHQKVV
EEAPAPGITP ELRRYIGERC AKACVDIGYR GAGTFEFLFE NGEFYFIEMN TRIQVEHPVT
EMITGVDLIK EQLRIAAGQP LSIKQEEVHV RGHAVECRIN AEDPNTFLPS PGKITRFHAP
GGFGVRWESH IYAGYTVPPY YDSMIGKLIC YGENRDVAIA RMKNALQELI IDGIKTNVDL
QIRIMNDENF QHGGTNIHYL EKKLGLQEK