Gene CPS_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_2047 
Symbol 
ID3522036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp2120511 
End bp2122040 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content39% 
IMG OID637284507 
Product4-hydroxyphenylacetate-3-hydroxylase family protein 
Protein accessionYP_268775 
Protein GI71281564 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGAA CTGGTGAAGA GTACCGTGAA TCGATTAAAG ATGGCCGTGA AGTTTATATC 
GAAGGTGATA AGGTTAAAGA TGTTGCTACT CATCCTGCTT TTAAACCAAT TATAGATGTT
AGATCGAAAA TTTATGACAT GCAGCATAGC GCTTCAACAA AAGATGTGAT GACCTATAAA
GCAGATAATA GCGAAGAGTG TGCAATAGCT TTAAAGTTAC CTCGCAGCCA ACAAGACTGG
TGGAATAAGA AAAATGCCAC TGAAGTGATG ATGAATGAAA TTGGTGGTGT TGTTACCCGT
GTGGGTGATG AAACTGCTGG CGAAATGTGG TCATTGTATG ATGCGCAAAC TGCTTTAAAT
GAAGTAAACC CTGAGTTTGC AGATAACATC AAAAATCATA TCGATAATAT TGTCATCAAT
GACCCTTTCC ACGTAACTGC TAATACAGAC CCAAAGGGTG ATCGTTCTAA ACCCCCACAA
GCACAAGATC CAGACATGTT ATTACATGTG GTGCGAGAAA CTGACCGAGG TATTGTTGTT
CGAGGGGCTA AATTTGAAAC GGCTGCAGCT TATGCAAACC AAGCTTTTAC CAAGCCCACA
ATAGCTAACT GGACAGCCGA TACAAAATTT TCAGATTATG CCGTAGGCTT TATTTGCGAT
TTAGGCTCTG CAGGTATCAA AATTATTTGC CGAGATGGTT TTGCCGGTAA AAATCCTTTG
GATTATCCAA TTGCAACAAA ATTTGATGAA GTTGAAGCGT TAGTTGTTTT TAATGATGTT
GAAATCCCAT GGGAAAACGT CCTGTTTTAT CGTAATACTA AAGCAGCAAT GTTTATCCGC
TCAACTCTAC ACCGCTATTC AGCTTTTGCT TATATACAAC GTAACTTGAA AGTCGCTGAA
TTATTAATAG GTGCATCAAT ACTTAATGTT CAACAAACGG GTACAGACAA TCAACCTGCA
GTACAAGAAC GTTTAGCCAA ATTGGCCTGT TATAGAGAGG GTATTGACGC CCATTTGATA
TCGTCAATAG CGATGGCTGA AGAAAGCCCA GGAGGCTTAA CCATGCCAAA CCAATCCTTG
CTTTATGCTG GTCGAGTTTT TGCTTGTTCT CAACTTTATG AAATGGTCAA TATTGCCCGT
GAATTAGGTG GTGGACAGGT ATGTTTAACT CCAAGTTTTG ACACATTTAA TCATAAAGAA
ATTAAGCCAT GGCTAGATAA ATTTTATAGT CTAAATGATC AATGGTCTGC TGAAGATCGA
CGTAAATTAC TCGCCTATTC TCGTGATTTA TGTAATTCCG ATTACGCTGG TCATCGCTTA
ACGTTTCAAG AATTCGCTCA ATCACCGCGT TTTGCCCATT TAGCATCGGT ATATCATCAT
TTTGATTTTG ACGGGCCTAT AGATCTTGTG CGTCAAGCGG CTGGCTTAAG TGAATCACTT
CAATCAAACA AACTAAAGAA AACAAATATA GCTGGCGGAC ATGGGGCTTT TTTAGGTACC
ACTGCCGATA ACTTAAAGAG GAAAATATAA
 
Protein sequence
MIRTGEEYRE SIKDGREVYI EGDKVKDVAT HPAFKPIIDV RSKIYDMQHS ASTKDVMTYK 
ADNSEECAIA LKLPRSQQDW WNKKNATEVM MNEIGGVVTR VGDETAGEMW SLYDAQTALN
EVNPEFADNI KNHIDNIVIN DPFHVTANTD PKGDRSKPPQ AQDPDMLLHV VRETDRGIVV
RGAKFETAAA YANQAFTKPT IANWTADTKF SDYAVGFICD LGSAGIKIIC RDGFAGKNPL
DYPIATKFDE VEALVVFNDV EIPWENVLFY RNTKAAMFIR STLHRYSAFA YIQRNLKVAE
LLIGASILNV QQTGTDNQPA VQERLAKLAC YREGIDAHLI SSIAMAEESP GGLTMPNQSL
LYAGRVFACS QLYEMVNIAR ELGGGQVCLT PSFDTFNHKE IKPWLDKFYS LNDQWSAEDR
RKLLAYSRDL CNSDYAGHRL TFQEFAQSPR FAHLASVYHH FDFDGPIDLV RQAAGLSESL
QSNKLKKTNI AGGHGAFLGT TADNLKRKI