Gene CPS_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_3700 
Symbol 
ID3518393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp3849914 
End bp3851398 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content39% 
IMG OID637286148 
Productputative tryptophan halogenase 
Protein accessionYP_270368 
Protein GI71278800 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATA AAATTGAAAA TATTGTTATT GCTGGTGGCG GCACCGCTGG TTGGATGGCG 
GCAGCAGCTT TTTCAAAACT ACTAGGGAAG AACTTAAATA TCACCTTAGT TGAGTCAGAT
GACATTGCCT CAGTAGGCGT AGGTGAAGCC ACCATACCTC CGATAAAAAC ATTCCATAAA
TTACTCGGTA TTAATGAGCA AGAAGTTATG CGAGCCACGC ATGCCACTTT TAAACTCGGT
ATAGGATTTG AAAACTGGGG ACAACAAGGT GATCATTACA TTCACTCCTT TGGCGTCACA
GGTAAAGAAT GTTGGGCAGG TGAATTCCAT CATTTTTGGT TACATGGTCT TCGCAAGGGT
ATTAAAGCCG ACTTTGGTGA TTATTGTTAT GAGTTACAAG CAGCGAAAGC AAATAAGTTT
GCTTTATCAA AGAACACGCC GATTAATTAT GCGTATCACC TTGATGCCAC ACGCTACGCA
AAATATTTAC AAGAATTTAG TAAAAAACTG GGCGTAACTC GTGTCGAAGG AAAAATTCAA
CAGGTAAATA AAGGTAATAA AACAGGCGAA ATAAACTCAC TTACACTAGC TTCAGGACAA
GTCATTGAAG GTGACTTTTT TATAGATTGT ACTGGTTTTC AGGGGCTTTT AATTGAACAA
GCTCTTCACA CTGGATTTGA TGATTGGTCA CACTGGTTAC CCTGTGATAG AGCGGTAGCA
GTGCAAACCA AAGCGGTTGC AGCACCTTTA CCTTACACAC GTTCAATAGC CCGGAAAAGT
GGCTGGCAAT GGAGAATACC ATTACAAAAT CGTGTTGGTA ATGGCCTGGT TTTTTGTAGT
AAATATTGCT CAGATGAAGA AGCGATAAGT ACGTTAACAG CAAACATCGA AGGGGAGTTA
CTTACAGAGC CACGAATCAT AAAATTTAAC ACCGGCCGCC GTCGAAAGGG TTGGAATAAA
AACTGTGTAG CTTTAGGTTT ATCAAGTGGT TTTATCGAAC CTCTTGAGTC AACAAGTATT
CATTTAATTA TGTCTGGAAT TATCCGCTTA TTACGTTTAT TTCCTTTTGA TGGCATCCAT
CAATCAGCTA TTAATGAATA CAATAACAAA CTCGATTCAG AATTAAACGC CGTTCGTGAC
TTTATCATAC TACATTACAA AGCAACTCAG CGTGAAGATA GTAATTTTTG GTTACATTGT
AAGAATATGG AAATCCCCCC TTCCCTAGTG CATAAAATGC AATTATTTAA AGATACAGGT
CGTGTCTTTT TAGATGATGG CGATATTTTC CGCGTAGACT CTTGGACCCA AGTAATGCTC
GGCCAAGGCA TTATGCCAAC GCAGTACCAC AAAATAGCTG AAATAATGAA TGATAAAGAG
CTGGAGAACT TCATGAGTAA CCTGAAAGCA TCGATAACTA ATGCTGTTGA ACAATTACCT
AGTCACACAG AATTTATACA AAGTTATTGT AAATCAGACT ATTAA
 
Protein sequence
MKDKIENIVI AGGGTAGWMA AAAFSKLLGK NLNITLVESD DIASVGVGEA TIPPIKTFHK 
LLGINEQEVM RATHATFKLG IGFENWGQQG DHYIHSFGVT GKECWAGEFH HFWLHGLRKG
IKADFGDYCY ELQAAKANKF ALSKNTPINY AYHLDATRYA KYLQEFSKKL GVTRVEGKIQ
QVNKGNKTGE INSLTLASGQ VIEGDFFIDC TGFQGLLIEQ ALHTGFDDWS HWLPCDRAVA
VQTKAVAAPL PYTRSIARKS GWQWRIPLQN RVGNGLVFCS KYCSDEEAIS TLTANIEGEL
LTEPRIIKFN TGRRRKGWNK NCVALGLSSG FIEPLESTSI HLIMSGIIRL LRLFPFDGIH
QSAINEYNNK LDSELNAVRD FIILHYKATQ REDSNFWLHC KNMEIPPSLV HKMQLFKDTG
RVFLDDGDIF RVDSWTQVML GQGIMPTQYH KIAEIMNDKE LENFMSNLKA SITNAVEQLP
SHTEFIQSYC KSDY