Gene EcolC_2263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2263 
Symbol 
ID6067034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2493390 
End bp2494460 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content54% 
IMG OID641601667 
Productphenylacetate-CoA oxygenase/reductase, PaaK subunit 
Protein accessionYP_001725226 
Protein GI170020272 
COG category[C] Energy production and conversion 
COG ID[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID[TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.379907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGT TTCATTCCTT AACGGTGGCA AAAGTGGAGC CGGAAACCCG TGATGCGGTG 
ACCATTACCT TTGCGGTGCC CCAGCCTTTG CAGGAGGCGT ATCGCTTTCG CCCCGGTCAA
CATTTGACCT TAAAAGCCAG CTTTGATGGT GAAGAATTAC GCCGTTGTTA CTCCATTTGC
CGCAGCTATC TGCCTGGCGA AATTAGTGTG GCGGTGAAAG CCATTGAAGG CGGACGTTTC
TCCCGCTATG CCCGCGAACA CATCCGCCAG GGTATGACGC TGGAGGTCAT GGTGCCGCAG
GGGCATTTCG GCTATCAGCC GCAGGCCGAA CGCCAGGGCC GCTATCTGGC AATTGCAGCA
GGATCAGGTA TTACGCCAAT GCTGGCGATT ATCGCCACCA CTTTACAAAC CGAGCCTGAA
AGTCAGTTCA CCCTGATCTA CGGTAACCGT ACCAGCCAGA GCATGATGTT TCGCCAGGCA
CTGGCAGACC TGAAAGACAA ATATCCTCAG CGTTTACAGT TGTTGTGCAT TTTCAGTCAG
GAAACCCTCG ACAGCGATCT GCTTCACGGG CGTATTGACG GTGAAAAATT ACAGTCACTT
GGGGCCTCGC TCATTAATTT TCGTCTTTAT GATGAGGCGT TTATTTGTGG TCCGGCGGCG
ATGATGGATG ACGCGGAAAC CGCCTTAAAA GCACTGGGAA TGCCAGAAAA AGCGATTCAT
CTGGAGCGGT TTAATACGCC TGGCACGCGC GTCAAACGTA GCGTTAACGT GCAAAGTGAC
GGACAAAAAG TGACTGTACG TCAGGATGGG CGGGATCGGG AAATCGTGCT TAATGCCGAC
GATGAAAGCA TTCTCGATGC GGCATTGCGC CAGGGGGCGG ATCTGCCCTA TGCCTGCAAA
GGCGGCGTCT GTGCGACCTG CAAATGCAAA GTGCTGCGTG GCAAAGTGGC GATGGAAACC
AATTACAGTC TGGAACCGGA TGAACTGGCC GCAGGCTATG TGTTGAGTTG CCAGGCACTG
CCGCTGACCA GCGATGTGGT GGTTGACTTT GACGCGAAGG GGATGGCATG A
 
Protein sequence
MTTFHSLTVA KVEPETRDAV TITFAVPQPL QEAYRFRPGQ HLTLKASFDG EELRRCYSIC 
RSYLPGEISV AVKAIEGGRF SRYAREHIRQ GMTLEVMVPQ GHFGYQPQAE RQGRYLAIAA
GSGITPMLAI IATTLQTEPE SQFTLIYGNR TSQSMMFRQA LADLKDKYPQ RLQLLCIFSQ
ETLDSDLLHG RIDGEKLQSL GASLINFRLY DEAFICGPAA MMDDAETALK ALGMPEKAIH
LERFNTPGTR VKRSVNVQSD GQKVTVRQDG RDREIVLNAD DESILDAALR QGADLPYACK
GGVCATCKCK VLRGKVAMET NYSLEPDELA AGYVLSCQAL PLTSDVVVDF DAKGMA