Gene EcHS_A1479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1479 
SymbolpaaK 
ID5591587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1479623 
End bp1480693 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID640920636 
Productphenylacetate-CoA oxygenase/reductase, PaaK subunit 
Protein accessionYP_001458192 
Protein GI157160874 
COG category[C] Energy production and conversion 
COG ID[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID[TIGR02160] phenylacetate-CoA oxygenase/reductase, PaaK subunit 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACGT TTCATTCCTT AACGGTGGCA AAAGTGGAGC CGGAAACCCG TGATGCGGTG 
ACCATTACCT TTGCGGTGCC CCAGCCTTTG CAGGAGGCGT ATCGCTTTCG CCCCGGTCAA
CATTTGACCT TAAAAGCCAG CTTTGATGGT GAAGAATTAC GCCGTTGTTA CTCCATTTGC
CGCAGCTATC TGCCTGGCGA AATTAGTGTG GCGGTGAAAG CCATTGAAGG CGGACGTTTC
TCCCGCTATG CCCGCGAACA CATCCGCCAG GGTATGACGC TGGAGGTCAT GGTGCCGCAG
GGGCATTTCG GCTATCAGCC GCAGGCCGAA CGCCAGGGGC GCTATCTGGC AATTGCAGCA
GGATCAGGTA TTACGCCAAT GCTGGCGATT ATCGCCACCA CTTTACAAAC CGAGCCTGAA
AGTCAGTTCA CCCTGATCTA CGGTAACCGT ACCAGCCAGA GCATGATGTT TCGCCAGGCA
CTGGCAGACC TGAAAGACAA ATATCCTCAG CGTTTACAGT TGTTGTGCAT TTTCAGTCAG
GAAACCCTCG ACAGCGATCT GCTTAACGGG CGTATTGACG GTGAAAAATT ACAGTCACTT
GGGGCCTCGC TCATTAATTT TCGTCTTTAT GATGAGGCAT TTATTTGTGG TCCGGCGGCG
ATGATGGATG ACGCGGAAAC CGCCTTAAAA GCACTGGGAA TGCCAGATAA AACCATTCAT
CTGGAGCGGT TTAATACGCC TGGCACGCGC GTCAAACGTA GCGTTAACGT GCAAAGTGAC
GGACAAAAAG TGACTGTACG TCAGGATGGG CGGGATCGGG AAATCGTGCT TAATGCCGAC
GATGAAAGCA TTCTCGATGC GGCATTGCGC CAGGGGGCGG ATCTGCCCTA TGCCTGCAAA
GGCGGCGTCT GTGCGACCTG CAAATGCAAA GTGCTGCGTG GCAAAGTGGC GATGGAAACC
AATTACAGTC TGGAACCGGA TGAACTGGCC GCAGGTTATG TGTTGAGTTG CCAGGCACTG
CCGCTGACCA GCGATGTGGT GGTTGACTTT GACGCGAAGG GGATGGCATG A
 
Protein sequence
MTTFHSLTVA KVEPETRDAV TITFAVPQPL QEAYRFRPGQ HLTLKASFDG EELRRCYSIC 
RSYLPGEISV AVKAIEGGRF SRYAREHIRQ GMTLEVMVPQ GHFGYQPQAE RQGRYLAIAA
GSGITPMLAI IATTLQTEPE SQFTLIYGNR TSQSMMFRQA LADLKDKYPQ RLQLLCIFSQ
ETLDSDLLNG RIDGEKLQSL GASLINFRLY DEAFICGPAA MMDDAETALK ALGMPDKTIH
LERFNTPGTR VKRSVNVQSD GQKVTVRQDG RDREIVLNAD DESILDAALR QGADLPYACK
GGVCATCKCK VLRGKVAMET NYSLEPDELA AGYVLSCQAL PLTSDVVVDF DAKGMA