Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2144 |
Symbol | |
ID | 5899599 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2322695 |
End bp | 2324209 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562634 |
Product | tryptophan halogenase |
Protein accession | YP_001683770 |
Protein GI | 167646107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0060883 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0489946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACC TGAACAAGAT CGTCATCGTC GGCGGCGGCT CGGCGGGCTG GATCTGCGCG GCCATGCTCA GCCACTACTT TCAGAACGGG CCGACCCAGG TCGAACTGAT CGAATCCGAG GAGATCGGCA CGATCGGCGT GGGGGAATCC ACCATTCCCC CGTTCCTCCA ACTGATCCGC ACCTTGGGGA TCAACGAGCA GGAATTCATT CAAGAAACCC AGGCCGCCTT CAAGCTTGGC ATCCGGTTCG AGAACTGGCT CGAGAAGGGC GACGTCTACT ACCACCCGTT CGGCCAGATC GGCGGCCCGC TGGAGGTCAA CGAGTTCTAT CAGTGCTGGC TGCGGGCCAA GGCCAACGGC CATCCGTCGA GCCTGCAGGA CTTCGCCCCG GCCACGGTGA TGGCCGCCGC TGGCAAGTTC ATGCTGCCGG CCAAGGCCCA GCGCACGATG ATCGCCAACG CCAACTACGC CCTGCACGTC GACGCCCGGC TGGTGGCGCT GTACCTGCGC AAGTTCGCCG AGGCGCGGGG CGTCAAGCGC ACCGAGGGCA TCGTCACCGA CGTGGCGACC CGGGCCGACG GCGGCGTTGA GAAGGTGATC ATGAAGGACG GCCGCGAGGT CGCCGGCGAC TTCTTCATCG ACTGTTCGGG CTTCCGGGCG TTGCTGATCG GCAAGACCCT GAACGAACCC TTCCGCGACT GGTCCGACGT TCTGCTCTGC GACCGCGCCA TCGTCGCCCA GACCGAGAAC ATCGGGCCGC CTCATCCCTA TACCCTGGTC CAGGCGCAGG ATTTCGGCTG GCGCTGGCGC ATCCCGCTGC AGCACCGCTC GGGCAACGGC TATGTGTTCG CCAGCCAGTA TCTCAGCGAC GACGAGGCCA CCGCCACCTT GCTGAGCCAA CTTCAGGGCG AGATCGTGCT GGGTCCGTCG GTCATCCCGT TCAAGACCGG CGTGCGCGAG CGGCCGTGGG TCAAGAACGT AGTGTCCATC GGCCTGTCCT GCGGCTTCAT CGAGCCGCTG GAATCCACGG CCCTGCACCT GATCTACAAG GGCATGGACT ATCTGCTGCG GTTCATGCCC GACATGGACG CCGACCAGAC CCTGGCGGCC GAGTACAATC GTCGCATGGT CGCCGACTAT GAGGAGATCC GCGACTTCAT CGTCCTGCAC TACGTGACCA CCCGGCGCGA CGACACGCCG TTCTGGCGCG CCTACCAGCA GGTCGAGCCG CCCGAGAGCC TGAAGGCGCG CATCGCCCTG TTCAAGGCGG CCGGGGTGCT GCGCGACGGC GTCGATGACA TGTTCCGCGC CCCCAGCTGG CAGTCGGTGA TGGAGGGCAT GGGCGTCCGG CCCGAGCGCT ACCAGCAGTT GGTCGACCGC ATCCCGCTGA GCGTGATCAT GAACCTGATG GACAAGTCCG CGCCGATGCT GGCCGACTTC GTCAAGACCC TGCCCAGCCA TCAGGAGTTC CTGGACGCCT ATTGTCCGGC GGAGCCGTTC AAGCGAACGG CCTAA
|
Protein sequence | MAHLNKIVIV GGGSAGWICA AMLSHYFQNG PTQVELIESE EIGTIGVGES TIPPFLQLIR TLGINEQEFI QETQAAFKLG IRFENWLEKG DVYYHPFGQI GGPLEVNEFY QCWLRAKANG HPSSLQDFAP ATVMAAAGKF MLPAKAQRTM IANANYALHV DARLVALYLR KFAEARGVKR TEGIVTDVAT RADGGVEKVI MKDGREVAGD FFIDCSGFRA LLIGKTLNEP FRDWSDVLLC DRAIVAQTEN IGPPHPYTLV QAQDFGWRWR IPLQHRSGNG YVFASQYLSD DEATATLLSQ LQGEIVLGPS VIPFKTGVRE RPWVKNVVSI GLSCGFIEPL ESTALHLIYK GMDYLLRFMP DMDADQTLAA EYNRRMVADY EEIRDFIVLH YVTTRRDDTP FWRAYQQVEP PESLKARIAL FKAAGVLRDG VDDMFRAPSW QSVMEGMGVR PERYQQLVDR IPLSVIMNLM DKSAPMLADF VKTLPSHQEF LDAYCPAEPF KRTA
|
| |