Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_3699 |
Symbol | |
ID | 3519309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | + |
Start bp | 3848339 |
End bp | 3849883 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637286147 |
Product | putative tryptophan halogenase |
Protein accession | YP_270367 |
Protein GI | 71279185 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAAAG CTATTAAAAA AATTGTTATT GTTGGTGGAG GATCCGCGGG TTGGATTACC GCCGGAAGCC TCGCAGCAGA GCATTGCGTC GATGCAGCGA GTAGTATAGA AGTTATTTTG ATTGAATCAC CTGAGGTTAA TAGTATCGGT GTTGGAGAAG GCACTTGGCC CTCAATGCGT AATACTTTAG AAAAAATTGG CATAGATGAA AAAGAGTTTT TGCAACAATG TGATGCCTCA TTTAAACAGG GGTCAAAGTT TATAGGTTGG ACAACCGGTG ATGACACCGA CTCTTACTAT CATCCCTTTA TGACACCTGA TGGTTATGGC CACATAGATT TACATGCCGC ATGGCAAGCT AACCATTCAG AGCAAGCTTT TGGTGATGCG GTTAATATTC AAAGCCATGT TTGCCAAGCA GGTCTCGCAC CTAAACAACT GGCAACACCA AGTTACGCTG CAGTTACCAA TTATGGCTAT CACCTTGATG CAAAGAAATT TGCCTCACTT CTACATAAAC ACTGTACTCA GAAATTAAAT GTTCAGCATA TTGTTGATCA TATGGACGGT ATTATTTCTG CAGACAATGG TGATATTAAC GCCATAAGTA CCAAAGAACA TGGTCATATA GCCGGTGATT TATTTATTGA TTGTACCGGT AGCGCATCAT TGCTGCTGGG AAAGCATTTT AATATTGGCT TTATCAATAA ACAGCACATT TTATTTAATG ATTCCGCCCT TGCTGTACAA GTTCCCTACC CTGATCAAGT AAACCCAATA AATTCAGCCA CGTTATCAAC AGCACAAAGT GCTGGCTGGA TTTGGGATAT TGGTTTACAG ACACGACGAG GTGTTGGTTA CACCTACTCG AGTCAGTATA TCAGCGATGC AGAGGCTGAA AAATCATTAA GACAATATCT AATCAATTCC GTTGGCGAAG AGCAAGCTAA TTTACTCAAG CCCAAAAAAT TAATTTTTGA ACCCGGTCAT AGAGAAAAGT TTTGGCATAA AAACTGTGTT GCCATAGGAA TGTCTGCAGG GTTTCTTGAG CCTCTTGAAG CTTCTGCCTT AGCCATGGTT GAATTATCAT CAACTATGGT CAGTGAAGAG TTACCCGTGA CGCGTGCCCA TATGGATATT ATTGCTAAGC GATTCAATGA GAGGTTTAAC TATCGCTGGC AGCGTATTAT CGATTTTTTA AAATTACATT ACATTTTAAG TAAACGCACT GATTCACACT ACTGGAGAGA TAACCAACAA CCTAACAGCA TTAGTGAAGA ACTTCAGGAG CTGATAAAAT TATGGCAATA TCAACCTCCA AGCCGTTATG ATTTTGTTCA AAATGAAGAA GTATTTCCTT CAGCAAGTTA TCAATATGTA CTCTATGGTA TGGGGTTTGA AACAGAGCAA CGCGCTAACC CAAGAAAATT TGAGGCTAAT CCCCTAGCGG AAAAAACGAT TGCAGAGACA CAAAAAAAAA TAGATAAATA TCTTACGGGT TTGCCCACTA ATAGAGAGTT ACTCAATAAG TTAAAAGAGA AGTAA
|
Protein sequence | MGKAIKKIVI VGGGSAGWIT AGSLAAEHCV DAASSIEVIL IESPEVNSIG VGEGTWPSMR NTLEKIGIDE KEFLQQCDAS FKQGSKFIGW TTGDDTDSYY HPFMTPDGYG HIDLHAAWQA NHSEQAFGDA VNIQSHVCQA GLAPKQLATP SYAAVTNYGY HLDAKKFASL LHKHCTQKLN VQHIVDHMDG IISADNGDIN AISTKEHGHI AGDLFIDCTG SASLLLGKHF NIGFINKQHI LFNDSALAVQ VPYPDQVNPI NSATLSTAQS AGWIWDIGLQ TRRGVGYTYS SQYISDAEAE KSLRQYLINS VGEEQANLLK PKKLIFEPGH REKFWHKNCV AIGMSAGFLE PLEASALAMV ELSSTMVSEE LPVTRAHMDI IAKRFNERFN YRWQRIIDFL KLHYILSKRT DSHYWRDNQQ PNSISEELQE LIKLWQYQPP SRYDFVQNEE VFPSASYQYV LYGMGFETEQ RANPRKFEAN PLAEKTIAET QKKIDKYLTG LPTNRELLNK LKEK
|
| |