Gene CPS_0992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPS_0992 
Symbol 
ID3519166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameColwellia psychrerythraea 34H 
KingdomBacteria 
Replicon accessionNC_003910 
Strand
Start bp1006526 
End bp1008016 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content39% 
IMG OID637283457 
Productputative tryptophan halogenase 
Protein accessionYP_267741 
Protein GI71278208 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.259329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTC CAGTTAAAAA AGTAGTTGTA TTAGGCGGCG GCACTTCTGG ATGGATTTCT 
GCAGCCTTAT TGAAAAAAAT TCTCGGTTCA GCTATTCAAC TTGAGTTAGT CGAGTCTGAT
GCAATTGGCA CTATTGGTGT TGGAGAAGCG ACCATTCCAC CAATCAAACA CTTAAATAAT
GTTTTGGGCA TAAATGAAGC CGAATTTTTA CGTGAAACCA AAGCAACCAT AAAACTTGGT
ATTAATTTTG AAAACTGGAA AAGCCAGGGT CACAGCTATT TACATAGTTT TGGTGCGGCA
GGAAAAAGCT TAGCCTTTTG TCATTTTCAT CACCTATTAA AGCGCGCCAA TCAATTAGAA
GATGACAGCC ATTTATGGCA ATACGATTTA AATTACTTAT GTGCTAAAGC AGGTAAATTT
GCCCAGATAA AAAGTAAAGA CCCTATCATT GAATTACCTT ACGCTTATCA TTTTGATGCG
GGGTTATATG CAAAGTTTTT ACGAAAGTTT AGTGAGAAAA TAGGCGTAAT TAGAACAGAA
GGTTTAGTCG AGCACGTTGA GCAATGCCCT AACTCTGGAC ATATAACGTC TCTTAAGTTG
AAAAGTGGGC AAACAGTCAC AGGTGATTTG TTTATCGACT GTTCCGGTTT TAAAGGCGTG
CTTATTCAAG AAAAACTTCG CACTGGTTAT GAAGATTGGA GTCATTTACT GCAATGTGAC
CGTGCGATTG CGGTACCTTC AGAACGATTA GAAAAAACCT TACCTTATAC GCGTTCTATA
GCTCATGCCG CAGGCTGGCA ATGGCGCATT CCACTGCAAC ACCGCAACGG TAATGGCTTA
GTTTACAGTA GTAGCCATTG TAGTGAACAA CAGGCTATGG ATACCATAAT GGGTAATTTA
GACAGTAAAG CCATTGCTGA TCCTAAAGTT ATTAAGTTTC AAACGGGTCG ACGTTACCAA
CAGTGGAATA AAAATGTTAT CGCTATTGGT CTATCAAGTG GCTTTTTAGA GCCTCTTGAA
TCCACCAGTA TTCATCTTGT GCAATCTGCT GTTGTTCGCT TAGCACACCT TTTTCCTCAT
CAGGGTATTA ATAGCAGCCT AGTTGATGAA TTCAACAAGC AGTCGGCTAC TGAATTTGAA
CAAATAAGAG ACTTTTTAGT ACTGCATTAT CATGCTACAG AGCGCACTGA TAGTGATTTT
TGGCAAGATA TGCGTCATAT GAAAATTCCG GACAGTCTCG CCCATAAAAT TGAAATATTT
AAACAAAGCG GACGATTATT TAGAGAGCAA AATGATTTAT TCACTGATAG TTCATGGCTC
CAAGTGATGT TAGGACAAGG TATCGTACCG CAAGATTATC ATCCAATAGC CAATATAATG
TCTGATGAAA AGCTCGCTGA AATGCTAAGA AAAGTGAAAG AGATTAAGCA AGCACCTATT
GATAAACTGC CCAGTCATGA TGAGTTTTTG AGAATATTCT GTCAGCAGTA G
 
Protein sequence
MNSPVKKVVV LGGGTSGWIS AALLKKILGS AIQLELVESD AIGTIGVGEA TIPPIKHLNN 
VLGINEAEFL RETKATIKLG INFENWKSQG HSYLHSFGAA GKSLAFCHFH HLLKRANQLE
DDSHLWQYDL NYLCAKAGKF AQIKSKDPII ELPYAYHFDA GLYAKFLRKF SEKIGVIRTE
GLVEHVEQCP NSGHITSLKL KSGQTVTGDL FIDCSGFKGV LIQEKLRTGY EDWSHLLQCD
RAIAVPSERL EKTLPYTRSI AHAAGWQWRI PLQHRNGNGL VYSSSHCSEQ QAMDTIMGNL
DSKAIADPKV IKFQTGRRYQ QWNKNVIAIG LSSGFLEPLE STSIHLVQSA VVRLAHLFPH
QGINSSLVDE FNKQSATEFE QIRDFLVLHY HATERTDSDF WQDMRHMKIP DSLAHKIEIF
KQSGRLFREQ NDLFTDSSWL QVMLGQGIVP QDYHPIANIM SDEKLAEMLR KVKEIKQAPI
DKLPSHDEFL RIFCQQ