Gene EcolC_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1139 
Symbol 
ID6068044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1242530 
End bp1243891 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content53% 
IMG OID641600555 
Productaromatic-ring-hydroxylating dioxygenase, alpha subunit-like protein 
Protein accessionYP_001724133 
Protein GI170019179 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAC CCTCAGATTT GAACATTTAC CAACTGATTG ATACCCAAAA TGGTCGGGTC 
ACTCCGCGTA TTTATACCGA CCCAGATATT TACCAACTGG AGCTTGAACG TATTTTCGGC
CGTTGCTGGT TATTTCTCGC CCACGAAAGC CAGATCCCAA AACCTGGTGA TTTCTTTAAC
ACCTACATGG GGGAAGATGC GGTTGTCGTA GTGCGTCAGA AAGACGGCAG CATTAAGGCG
TTTCTCAACC AATGCCGCCA CCGGGCCATG CGTGTGAGTT ATGCAGATTG CGGCAACACT
CGCGCCTTTA CCTGTCCGTA TCACGGCTGG TCTTATGGCA TTAACGGCGA GTTGATCGAT
GTACCGCTGG AACCTCGCGC CTACCCACAA GGGTTGTGTA AATCCCACTG GGGACTAAAC
GAAGTTCCTT GTGTGGAGAG TTATAAAGGG CTGATTTTTG GCAACTGGGA TACCAGCGCA
CCGGGCCTGC GTGATTACCT GGGTGACATT GCCTGGTATC TGGATGGCAT GCTGGATCGT
CGCGAAGGCG GCACCGAAAT TGTCGGCGGC GTACAAAAGT GGGTGATCAA CTGTAACTGG
AAATTCCCGG CAGAGCAGTT CGCCAGTGAC CAGTATCATG CTCTGTTCAG CCATGCTTCT
GCCGTTCAGG TATTAGGGGC GAAAGATGAT GGCAGCGATA AGCGCCTCGG TGATGGACAA
ACCGCCCGCC CGGTGTGGGA AACCGCCAAA GATGCGCTGC AATTTGGTCA GGACGGTCAC
GGTAGCGGTT TCTTCTTTAC TGAAAAACCG GATGCTAATG TCTGGGTCGA TGGCGCAGTT
TCAAGCTATT ACCGCGAAAC CTATGCCGAA GCAGAACAAC GTTTAGGTGA AGTTCGCGCC
CTGCGCCTGG CGGGTCATAA CAATATTTTC CCCACGCTTT CATGGCTCAA CGGCACTGCC
ACGCTCCGCG TCTGGCATCC GCGCGGCCCT GATCAAGTTG AAGTGTGGGC GTTCTGTATT
ACTGACAAAG CCGCCTCCGA TGAAGTTAAA GCCGCTTTTG AAAACAGCGC CACTCGTGCT
TTTGGTCCTG CTGGTTTTCT CGAGCAGGAT GACTCGGAGA ACTGGTGTGA AATCCAGAAA
TTGCTTAAAG GCCACCGCGC CCGCAACAGC AAACTGTGTC TGGAAATGGG GCTTGGTCAG
GAAAAGCGTC GCGACGACGG CATTCCTGGC ATTACTAACT ATATTTTCTC AGAAACTGCC
GCTCGCGGAA TGTACCAACG TTGGGCCGAT CTCCTGAGTA GCGAAAGCTG GCAAGAAGTG
CTCGATAAAA CCGCCGCTTA CCAGCAGGAG GTGATGAAAT GA
 
Protein sequence
MTTPSDLNIY QLIDTQNGRV TPRIYTDPDI YQLELERIFG RCWLFLAHES QIPKPGDFFN 
TYMGEDAVVV VRQKDGSIKA FLNQCRHRAM RVSYADCGNT RAFTCPYHGW SYGINGELID
VPLEPRAYPQ GLCKSHWGLN EVPCVESYKG LIFGNWDTSA PGLRDYLGDI AWYLDGMLDR
REGGTEIVGG VQKWVINCNW KFPAEQFASD QYHALFSHAS AVQVLGAKDD GSDKRLGDGQ
TARPVWETAK DALQFGQDGH GSGFFFTEKP DANVWVDGAV SSYYRETYAE AEQRLGEVRA
LRLAGHNNIF PTLSWLNGTA TLRVWHPRGP DQVEVWAFCI TDKAASDEVK AAFENSATRA
FGPAGFLEQD DSENWCEIQK LLKGHRARNS KLCLEMGLGQ EKRRDDGIPG ITNYIFSETA
ARGMYQRWAD LLSSESWQEV LDKTAAYQQE VMK