Gene CPS_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_1023 
Symbol 
ID3519714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp1047586 
End bp1049142 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content38% 
IMG OID637283488 
Productputative tryptophan halogenase 
Protein accessionYP_267772 
Protein GI71278074 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAC AAGTGAAACA AGTTGTGGTG GTAGGTGGTG GTACGGCCGG TTGGTTAACT 
GCCGCAAATT TAGCTCAAAA ATTCAATAGT GTCGAAAGTG GAGCGATACA AGTCACGCTA
GTTGAATCTC CTGATATCCC CACCATTGGT GTGGGTGAGG GAACTTGGCC TACCATGAGA
AAAACGTTAG CAAAACTAGG GATTAGCGAA GCTGATTTTT TATCAAAGTG TAATGCTTCA
TTCAAGCAAG CAACAAAATT TGTTAATTGG CAGCAAGCAC CAAAAAATGG CATTAATAGT
CATTATTATC ATTTGTTTAC CTCAATTAAT GATCCTTTAG ATTTTAACTT AGCGCCTTAT
TGGAAGCTAG GTATGATTGG TGATAACAAC AGCTACGCTG AAACTATTAG TATGCAAGCG
GCTATTTGCG AATCAGGTTT AGCCCCTAAG TTGATCACCA ACCGAGAATT TGAAGGTGTA
CAAAACTATG CTTATCACTT AGATGCGGGG CTTTTTACTG ACTTATTACG TGAACATGCA
ACGACCAAGT TAGGGGTTAA ACATGTATCT GCTAACGTCA CTGATGTGAA TTTAGATGAT
GATGGTTATA TCATTAATAT TCATTGCGAT AGTGTAGGTG TTGTATTAGG CGAGTTTTTT
GTTGATTGTA CGGGCTTTAA AAGTTTATTG ATTGGTAAAG CGCTAGGTAT ACCTTTTAAA
AGTATCGATG ATACTTTACT TTGTGATCAC GCGTTGGCTA TTCAAGTGCC CTATGAAAAT
GAAGAATCAT CTATTGCTAG TTGTACTATT TCTACTGCGC AAGAAGCTGG GTGGACATGG
GATATCGGTT TGTCTAATAG ACGTGGTACA GGTTATGTGT ATTCCTCTGC GCACACGAGT
CATGAACGTG CAGAACAAGT ATTACGTAAT TACATTGGCC CTCAAGCTGA TCAGCTTGAG
TCACGTTTAA TAAAGATGAA TGTTGGTTAT AGAGAAAAAT TTTGGCATAA AAACTGTTTT
GCTATTGGTT TATCTGCAGC ATTTGTTGAG CCACTGGAAG CGTCGGCGAT TTTCCTTATT
GAAGCCTCTG CCAATATGTT GTCAGAGCTA TTTCCTCGTG ATCGTCACGC GATGCTAGCT
GTTGAGGAAA AAGTGAATAA GTCATTTAAA TTCCGTTGGG ATAAAACCAT TGATTTTATC
AAAATGCATT ACTTTTTATC AAAAAGAACA GAACCATTTT GGCAAGACAA CAAGGTGCTT
AGTACTGTGC CAGACACATT GTTAGCGTCT TTGGATAGCT GGAAACACCA GTTAATTACC
GGTTACGATT TTGATAATGT CTACGAGCCT TTTCCGTTAG ACAGTTATCA ATACGTGCTT
TATGGTATGG GCTTTGACCA GACGTTAACC TTTAACGAAA ACTCATATAC GAAGCAGGGT
TTTGCTCAGC AACAATATAA TACAGTACAA GATTTAACAC AGAAAATGCA GCAACAATTA
CCTGAAAATA GAGAATTATT GAACAAGATA GCTCAATATG GTTTTCAGAA AATATAA
 
Protein sequence
MNEQVKQVVV VGGGTAGWLT AANLAQKFNS VESGAIQVTL VESPDIPTIG VGEGTWPTMR 
KTLAKLGISE ADFLSKCNAS FKQATKFVNW QQAPKNGINS HYYHLFTSIN DPLDFNLAPY
WKLGMIGDNN SYAETISMQA AICESGLAPK LITNREFEGV QNYAYHLDAG LFTDLLREHA
TTKLGVKHVS ANVTDVNLDD DGYIINIHCD SVGVVLGEFF VDCTGFKSLL IGKALGIPFK
SIDDTLLCDH ALAIQVPYEN EESSIASCTI STAQEAGWTW DIGLSNRRGT GYVYSSAHTS
HERAEQVLRN YIGPQADQLE SRLIKMNVGY REKFWHKNCF AIGLSAAFVE PLEASAIFLI
EASANMLSEL FPRDRHAMLA VEEKVNKSFK FRWDKTIDFI KMHYFLSKRT EPFWQDNKVL
STVPDTLLAS LDSWKHQLIT GYDFDNVYEP FPLDSYQYVL YGMGFDQTLT FNENSYTKQG
FAQQQYNTVQ DLTQKMQQQL PENRELLNKI AQYGFQKI