Gene CPS_3734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_3734 
Symbol 
ID3520330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp3892101 
End bp3893591 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content35% 
IMG OID637286181 
Productputative tryptophan halogenase 
Protein accessionYP_270401 
Protein GI71279868 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATC TATCAGAAGT AAAAAATGTT GTTATTGCTG GCGGAGGAAC CGCTGGCTGG 
ATGGCTGCAA CAGCATTAGC AAAACTTATT GGTAATAACT TAAATATTAC CTTAGTTGAG
TCCGATGAAA TATCAACGGT TGGTGTTGGC GAAGCAACAA TTCCCCCTCT GTTAACATTT
CATAAAATGT TAAAGATTAA TGAAGCTGAA TTTATGAGTA GTGTAAACGC AACATTTAAA
CTAGGCATCA ATTTTGAAAA TTGGAAAGAT AACGATAGCG AATATTTTCA CTCTTTTGGC
ACTACTGGTA GGGATTGTTG GGCAACAACA TTTTTACAAT TTTACAATCG AAGTAAAAAA
GAAGGCTATC AGGCAAAGTA TGAAGAATAT TGCTTAGAAC TCCAAGCTGG ACTAAATGAA
AAATTCGCTC ACCTGCCAAA TAGTGGCTTG AAATACGCTT ACCATATGGA TGCAACATTA
TACGCAAAAT TTTTACGTAA ACTATCTGAA AAAAATGGCG TAAAAAGAAT AGAAGGTAAA
ATAGAGCAAG TAAATATCGA TGAGAAATCA GGTTTTATAA AATCTCTTAC CTTGCTTTCA
GGCGTAACTA TAGAAGGTGA TTTATTCATT GACTGTACGG GCTTTAGAGG TGTACTGATT
GAGCAAGCTT TACATACAGG CTACGATGAT TGGTCACACT GGCTGCCATG TGATCGCGCT
GTTGCAGTGC AAACAGAAAG TACTGAAGAG CCTAAACCTT ATACGCGTTC TATTGCCCAT
GAATCTGGAT GGCAATGGAA GATTCCATTA CAAAATAGAA CAGGAAACGG CCTCGTCTAT
TGTAGTAAAT ATTTGAATGA TGAAAACGCT GAGAAACTGT TGCTTGAAAA TGTTGAAGGT
AAAACATTAA AGAAACCTTT ATTTATTAAA TTTACCCCCG GCCAGCGCCG AAAACATTGG
AATAAAAACT GTGTAGCCTT AGGTTTATCA AGCGGTTTCA TTGAGCCTTT AGAGTCTACC
AGTATACATT TAATTCAACG AGGTATTATT AGACTGATGC AAATGTTCCC TAAAATGGGG
ATAACTGACT CAGTAATAAA TGAATATAAT TATCAATCTG AAGAAGAAAT TCGATACATT
AGAGATTTTA TCATTTTGCA TTACCATGTT ACCAATAGAA ACGATTCAAC TTTTTGGCGT
TATTGTAGCA AAATGGATGT TCCAAGTTCA TTGAGTCATC GTATTGAATT ATTTAAAGAA
CATGGTTATG TCTTTAAAGC ACCATGGGAT TTATTTGCTG AGAATTCTTG GGTTCAAGTC
ATGTTAGGGC AAGGTTTGTC CCCACAAAAT CACCATCCAA TAGCTGATAT GATGTCAACG
AATGAACAAG ACATGTTTTT AAATGGTATT AAAAATTCGA TAGAGCAAAC GGTCAAATCA
CTACCTACGC ATAAACAATA CTTAGATCAA TATTGTAATT CTAAAGCCTA A
 
Protein sequence
MNNLSEVKNV VIAGGGTAGW MAATALAKLI GNNLNITLVE SDEISTVGVG EATIPPLLTF 
HKMLKINEAE FMSSVNATFK LGINFENWKD NDSEYFHSFG TTGRDCWATT FLQFYNRSKK
EGYQAKYEEY CLELQAGLNE KFAHLPNSGL KYAYHMDATL YAKFLRKLSE KNGVKRIEGK
IEQVNIDEKS GFIKSLTLLS GVTIEGDLFI DCTGFRGVLI EQALHTGYDD WSHWLPCDRA
VAVQTESTEE PKPYTRSIAH ESGWQWKIPL QNRTGNGLVY CSKYLNDENA EKLLLENVEG
KTLKKPLFIK FTPGQRRKHW NKNCVALGLS SGFIEPLEST SIHLIQRGII RLMQMFPKMG
ITDSVINEYN YQSEEEIRYI RDFIILHYHV TNRNDSTFWR YCSKMDVPSS LSHRIELFKE
HGYVFKAPWD LFAENSWVQV MLGQGLSPQN HHPIADMMST NEQDMFLNGI KNSIEQTVKS
LPTHKQYLDQ YCNSKA