Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2031 |
Symbol | |
ID | 5705685 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2324552 |
End bp | 2325793 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641271521 |
Product | tryptophan halogenase |
Protein accession | YP_001536892 |
Protein GI | 159037639 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0818162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0655371 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACAAC CCGTTGCCGA GTACGACGTC ATCGTCGTCG GTGGCGGCCC CGCAGGGGCG TGCACCGCCG GTCTTCTCGC GCTGGAGGGG CACCGGATCC TGTTGCTGGA GCGCGAGAAG TTTCCCCGCT ACCACGTCGG CGAATCGCTG ATCACCGGCG TCGTCCCGAC GCTTACCGCG TTGGGACTGC TGGACCGCAT GGCCGAGCTG CGGTTCCAGG TCAAGTACGG CGGCAGTCTC CTGTGGGGCG AGAACCAGAC CGAACCGTGG TCGTTCCGCT TCCGGGAGAT CCGTGGCGGC CGCTACGAGT ACGCCTGGCA GGTCCGGCGC GCCGAGTTCG ACGCGATGTT GCTGGACCGG GCGCGGGAGC TGGGCGTGCA CGTCGTCGAG GGGGCGACCG TCCGGGACGC GCTGACCGAC GGCGACCGGC TCGCGGGCGT GCGCTATCAG CTCAAGGGCG AGTCGGGCTC CGTGCCGGCG CGGGCGACGA TGGTGGTCGA CGCCTCGGGC CAGCATCGCT GGTTGGGTCG CCGGTTCGGG CTGGTCGACT GGTACGACGA CCTGCGCAAC GTGGCCGTGT GGAGCTACTG GCAGGGTGCC CTGCGCTACC CGGGTGAGCA CGAGGGTGAC CTGCTGACCG AGAGCTGCCG GCAGGGCTGG CTCTGGTACG CGCCGCTGAG CCCGGAGCTG ACGGGCATCG GCTACGTCAC GACCAGCGAT CGGCTGGTGG CCTCTGGGTT AACGCCGGAG CAGTTGCTGG AAAGACACAT TGCGGAATCG TCCGAGGTCT CCTGGCTCAC CGCGGGCGCG AAGCGGGTGG ACATCTATCG CGCCGCGCGC GACTGGTCGT ACACCTGCCA GCAGTTCTCC GGCCCGGGCT GGGTCCTGGT CGGCGACGCG GCCGCATTCA TCGACCCGCT GCTCTCCGCC GGAGTGACCC TGGCCATGCG CGCGGCGAGC AGCGTGGCGA AGGCGGTCCA CGAGACGCTG ACCGCGCCGG ACAAGGAACG GCACGTCATG AAGGACTACG AGGACCGGTA CCGGGACTTC CTCGGCTCAC TGCTGGAGTT GGTCCGGTTC TTTTACGACG GCGCGCACGG CAAGGAGGAG CTGCACCTGC GAGCCCAGGC CATCGTGGAT CCCGACCGCA GCCTGCCGCC CAAGCTCTCA TTCGTGTCGC TGCTCTCCGG GCTGGTCCGC GGGGACGAAA GCCTCGGCGC GGACGCGGTC GACGAATATT GA
|
Protein sequence | MRQPVAEYDV IVVGGGPAGA CTAGLLALEG HRILLLEREK FPRYHVGESL ITGVVPTLTA LGLLDRMAEL RFQVKYGGSL LWGENQTEPW SFRFREIRGG RYEYAWQVRR AEFDAMLLDR ARELGVHVVE GATVRDALTD GDRLAGVRYQ LKGESGSVPA RATMVVDASG QHRWLGRRFG LVDWYDDLRN VAVWSYWQGA LRYPGEHEGD LLTESCRQGW LWYAPLSPEL TGIGYVTTSD RLVASGLTPE QLLERHIAES SEVSWLTAGA KRVDIYRAAR DWSYTCQQFS GPGWVLVGDA AAFIDPLLSA GVTLAMRAAS SVAKAVHETL TAPDKERHVM KDYEDRYRDF LGSLLELVRF FYDGAHGKEE LHLRAQAIVD PDRSLPPKLS FVSLLSGLVR GDESLGADAV DEY
|
| |