Gene EcolC_3259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3259 
Symbol 
ID6066839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3568750 
End bp3569724 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content54% 
IMG OID641602674 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_001726208 
Protein GI170021254 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.583001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACT TAATCCAACG CCCTCGTCGC CTGCGCAAAT CTCCTGCGCT GCGCGCTATG 
TTTGAAGAGA CAACACTTAG CCTTAACGAC CTGGTGTTGC CGATCTTTGT TGAAGAAGAA
ATTGACGACT ACAAAGCCGT TGAAGCCATG CCAGGCGTGA TGCGCATTCC AGAGAAACAT
CTGGCACGCG AAATTGAACG CATCGCCAAC GCCGGTATCC GTTCCGTGAT GACTTTCGGC
ATCTCTCACC ATACCGATGA AACCGGCAGC GATGCCTGGC GGGAAGATGG ACTGGTGGCG
CGAATGTCGC GCATCTGCAA GCAGACCGTG CCAGAAATGA TCGTCATGTC AGACACCTGC
TTCTGCGAAT ACACTTCTCA CGGTCACTGC GGTGTGCTGT GCGAGCATGG CGTCGACAAC
GACGCGACTC TGGAAAATTT AGGCAAGCAA GCCGTGGTTG CTGCTGCTGC AGGTGCAGAC
TTCATCGCCC CTTCCGCCGC GATGGACGGC CAGGTACAGG CGATTCGTCA GGCGCTGGAC
GCTGCGGGCT TTAAAGATAC GGCGATTATG TCGTATTCGA CCAAGTTCGC CTCTTCCTTT
TATGGTCCGT TCCGTGAAGC TGCCGGAAGC GCATTAAAAG GCGACCGCAA AAGCTATCAG
ATGAACCCAA TGAACCGTCG CGAGGCGATT CGTGAATCAC TGCTGGATGA AGCCCAGGGC
GCAGACTGCC TGATGGTTAA ACCTGCTGGA GCGTACCTCG ACATCGTGCG TGAGCTGCGT
GAACGTACTG AATTGCCGAT TGGCGCGTAT CAGGTGAGCG GTGAGTATGC GATGATTAAG
TTCGCCGCGC TGGCGGGTGC TATAGATGAA GAGAAAGTCG TGCTCGAAAG CTTAGGTTCG
ATTAAGCGTG CGGGTGCGGA TCTGATTTTC AGCTACTTTG CGCTGGATTT GGCTGAGAAG
AAGATTCTGC GTTAA
 
Protein sequence
MTDLIQRPRR LRKSPALRAM FEETTLSLND LVLPIFVEEE IDDYKAVEAM PGVMRIPEKH 
LAREIERIAN AGIRSVMTFG ISHHTDETGS DAWREDGLVA RMSRICKQTV PEMIVMSDTC
FCEYTSHGHC GVLCEHGVDN DATLENLGKQ AVVAAAAGAD FIAPSAAMDG QVQAIRQALD
AAGFKDTAIM SYSTKFASSF YGPFREAAGS ALKGDRKSYQ MNPMNRREAI RESLLDEAQG
ADCLMVKPAG AYLDIVRELR ERTELPIGAY QVSGEYAMIK FAALAGAIDE EKVVLESLGS
IKRAGADLIF SYFALDLAEK KILR