Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_1023 |
Symbol | |
ID | 3519714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 1047586 |
End bp | 1049142 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637283488 |
Product | putative tryptophan halogenase |
Protein accession | YP_267772 |
Protein GI | 71278074 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAAC AAGTGAAACA AGTTGTGGTG GTAGGTGGTG GTACGGCCGG TTGGTTAACT GCCGCAAATT TAGCTCAAAA ATTCAATAGT GTCGAAAGTG GAGCGATACA AGTCACGCTA GTTGAATCTC CTGATATCCC CACCATTGGT GTGGGTGAGG GAACTTGGCC TACCATGAGA AAAACGTTAG CAAAACTAGG GATTAGCGAA GCTGATTTTT TATCAAAGTG TAATGCTTCA TTCAAGCAAG CAACAAAATT TGTTAATTGG CAGCAAGCAC CAAAAAATGG CATTAATAGT CATTATTATC ATTTGTTTAC CTCAATTAAT GATCCTTTAG ATTTTAACTT AGCGCCTTAT TGGAAGCTAG GTATGATTGG TGATAACAAC AGCTACGCTG AAACTATTAG TATGCAAGCG GCTATTTGCG AATCAGGTTT AGCCCCTAAG TTGATCACCA ACCGAGAATT TGAAGGTGTA CAAAACTATG CTTATCACTT AGATGCGGGG CTTTTTACTG ACTTATTACG TGAACATGCA ACGACCAAGT TAGGGGTTAA ACATGTATCT GCTAACGTCA CTGATGTGAA TTTAGATGAT GATGGTTATA TCATTAATAT TCATTGCGAT AGTGTAGGTG TTGTATTAGG CGAGTTTTTT GTTGATTGTA CGGGCTTTAA AAGTTTATTG ATTGGTAAAG CGCTAGGTAT ACCTTTTAAA AGTATCGATG ATACTTTACT TTGTGATCAC GCGTTGGCTA TTCAAGTGCC CTATGAAAAT GAAGAATCAT CTATTGCTAG TTGTACTATT TCTACTGCGC AAGAAGCTGG GTGGACATGG GATATCGGTT TGTCTAATAG ACGTGGTACA GGTTATGTGT ATTCCTCTGC GCACACGAGT CATGAACGTG CAGAACAAGT ATTACGTAAT TACATTGGCC CTCAAGCTGA TCAGCTTGAG TCACGTTTAA TAAAGATGAA TGTTGGTTAT AGAGAAAAAT TTTGGCATAA AAACTGTTTT GCTATTGGTT TATCTGCAGC ATTTGTTGAG CCACTGGAAG CGTCGGCGAT TTTCCTTATT GAAGCCTCTG CCAATATGTT GTCAGAGCTA TTTCCTCGTG ATCGTCACGC GATGCTAGCT GTTGAGGAAA AAGTGAATAA GTCATTTAAA TTCCGTTGGG ATAAAACCAT TGATTTTATC AAAATGCATT ACTTTTTATC AAAAAGAACA GAACCATTTT GGCAAGACAA CAAGGTGCTT AGTACTGTGC CAGACACATT GTTAGCGTCT TTGGATAGCT GGAAACACCA GTTAATTACC GGTTACGATT TTGATAATGT CTACGAGCCT TTTCCGTTAG ACAGTTATCA ATACGTGCTT TATGGTATGG GCTTTGACCA GACGTTAACC TTTAACGAAA ACTCATATAC GAAGCAGGGT TTTGCTCAGC AACAATATAA TACAGTACAA GATTTAACAC AGAAAATGCA GCAACAATTA CCTGAAAATA GAGAATTATT GAACAAGATA GCTCAATATG GTTTTCAGAA AATATAA
|
Protein sequence | MNEQVKQVVV VGGGTAGWLT AANLAQKFNS VESGAIQVTL VESPDIPTIG VGEGTWPTMR KTLAKLGISE ADFLSKCNAS FKQATKFVNW QQAPKNGINS HYYHLFTSIN DPLDFNLAPY WKLGMIGDNN SYAETISMQA AICESGLAPK LITNREFEGV QNYAYHLDAG LFTDLLREHA TTKLGVKHVS ANVTDVNLDD DGYIINIHCD SVGVVLGEFF VDCTGFKSLL IGKALGIPFK SIDDTLLCDH ALAIQVPYEN EESSIASCTI STAQEAGWTW DIGLSNRRGT GYVYSSAHTS HERAEQVLRN YIGPQADQLE SRLIKMNVGY REKFWHKNCF AIGLSAAFVE PLEASAIFLI EASANMLSEL FPRDRHAMLA VEEKVNKSFK FRWDKTIDFI KMHYFLSKRT EPFWQDNKVL STVPDTLLAS LDSWKHQLIT GYDFDNVYEP FPLDSYQYVL YGMGFDQTLT FNENSYTKQG FAQQQYNTVQ DLTQKMQQQL PENRELLNKI AQYGFQKI
|
| |