Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1866 |
Symbol | |
ID | 5899321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2001808 |
End bp | 2003313 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562356 |
Product | tryptophan halogenase |
Protein accession | YP_001683493 |
Protein GI | 167645830 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGAA CCAACCCTAT TCGTTCGATC CTGATCGTCG GCGGCGGCAC AGCCGGCTGG ATGGCCGCGA CCTGGCTGGC CGGGCGGCTG GCGCGCCAGG ACATCCAGAT CACCGTCGTG GAGTCGCCCG ATATCCGCAC CGTCGGGGTC GGGGAGGCGA CCGTCCCGGC CATTCGCGAC TATTTCCGCG ACATCGGCGT CACCGAAGCC GAAGTGATGG CGGCCACCCA GGGCACGGTG AAGCTGGGCA TCGAGTTTCG TGACTGGAAG CGGGACGGGG AGTGCTTCTT CCATCCCTTC GGCCTTTACG GCATGCCCTC GCGCGGCGTG CCGTTCCACC AGTTCTGGCT CAAGCGCCGC GCCGAGGGCG ACGCCACGCC GCTGGCCGCC TACAGCCTGT GCACCCAACT GGCCATGGCC AACCAGATGA TGGAGCCGCC GGCTTCGCCG CCCAACGATC TGGGCGTGTT CAATTGGGCG GTCCATTTTG ACGCCGGCCT GTATGCGCAG TTCCTGAGAC GCAAGGCCAC GTCGGAGCTA GGCGTGACCC ATGTCGACGG CACGGTCGTC GAGGTTTCAA AGAACGGCGA GAACGGTTTC CTGACCGGCG TGACCCTGGC GGACGGCCGT ATCTTCGAGG CCGATCTCTT CATCGATTGC TCGGGTTTTC GCAGCCTGCT GCTCGGCCAA GCGCTGGGCG TGGCCTACGA GGATTGGACC CATCTGCTGC CCTGCGACCG CGCCGTGGCC TTGCCGTGCG AGCGCGACGG CCCGCTGACG CCCTACACCC GCAGCACGGC GCTGGCGGCC GGCTGGCAGT GGCGCATCCC GTTGCAGCAC CGGGTCGGCA ATGGCTATGT CTATTCCAGC CGGCACATCT CGGACGATGA GGCCACCGCC GTCCTGATGT CGCGCCTGGA GGGGCCGGCC TTGGCCGAGC CCAACCTGCT GCGCTTTCAG ACCGGCCATC GCCGCCGCTT CTGGGAGAAG AACTGCATAG CTTTGGGCTT GGCCGCAGGC TTCATGGAGC CGCTGGAATC GACCAGCATC GTTCTCATCC AGAGCGGGGT GGAGCGGCTC GGCGCGCTGT TCCCGGAGCG CGGCTTCGAT CCGGCCTTGG CCGACGAATA CAACCGCATC ACCACGCTCG AATACGAGCG GATCCGCGAT TTCCTGTTGC TGCACTACGT CGCCAACCGT CGAGACGGCG AGGCCATGTG GGATCATGTC CGTCAACTGG CCTTGCCCGA ACCCCTGGTC CACAAGATGC GGATGTTCGC CAGCCGCGGA ACGATGGTCC GCTACGAGTG GGAGTCTTTC CACGACCCCA GCTGGCTGTC GATGTACGGC GGCTTCGACA TTGTCCCGCA GGCTCATGAT CCGATGGCGG ACTATTTCAC CAAGCCCGAG CTCGACAGCG CCTTGCGCCG GATGCGCGAA GCGATCACTC GCGCTCAGGC CTTCGCCGTT CCTCACGAAA CGTTCCTGGC GGCGCAACGG ACTTGA
|
Protein sequence | MARTNPIRSI LIVGGGTAGW MAATWLAGRL ARQDIQITVV ESPDIRTVGV GEATVPAIRD YFRDIGVTEA EVMAATQGTV KLGIEFRDWK RDGECFFHPF GLYGMPSRGV PFHQFWLKRR AEGDATPLAA YSLCTQLAMA NQMMEPPASP PNDLGVFNWA VHFDAGLYAQ FLRRKATSEL GVTHVDGTVV EVSKNGENGF LTGVTLADGR IFEADLFIDC SGFRSLLLGQ ALGVAYEDWT HLLPCDRAVA LPCERDGPLT PYTRSTALAA GWQWRIPLQH RVGNGYVYSS RHISDDEATA VLMSRLEGPA LAEPNLLRFQ TGHRRRFWEK NCIALGLAAG FMEPLESTSI VLIQSGVERL GALFPERGFD PALADEYNRI TTLEYERIRD FLLLHYVANR RDGEAMWDHV RQLALPEPLV HKMRMFASRG TMVRYEWESF HDPSWLSMYG GFDIVPQAHD PMADYFTKPE LDSALRRMRE AITRAQAFAV PHETFLAAQR T
|
| |