Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2143 |
Symbol | |
ID | 5899598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2321138 |
End bp | 2322676 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562633 |
Product | tryptophan halogenase |
Protein accession | YP_001683769 |
Protein GI | 167646106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0261967 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0242357 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGAGA CCATGCACGA CCGGCGCATC CGCGACATCG TCATCGTTGG CGGCGGCACG GCGGGCTGGA TGGCGGCCGC CAGCCTCAAG CAGCATTTCG GAAACGCGCC GATCGGCATC ACCCTGATCG AGTCCTCCGA AATCGGCGCG ATCGGGGTGG GCGAGGCGAC GATCCCCACC ATCCGCCGCT TTTATCAATC CCTGGGCCTG TCCGACATCG ACGTGCTGCG GGCTACAGGC GGCACCTGCA AGCTGGGCAT CCGCTTCAAC GACTGGCTGC GACCCGGTTC GTCCTTCATC CATCCGTTCG GCCTGTACGG CCAGGACCTG AAGGGCGTGT CGTTCCATCA CTACTGGATG CGCCTGCGCG CCCTGGGCGA GGACGCGCCG ATCGGCGACT ATTCGCTGGG CGCCAGCCTG GCCACGGCCG GCAAGTTCAC CACCCCGTCG CGCAATCCGC CGTCGGCGCT GTCGGTGTTT GACTGGGCGG TGCATTTCGA CGCCAGCCTG TTCGCCAGGC TGATGCGCCA GGTGGCCGAG CAGGCGGGCG TCAAGCGCAT CGACGCCAGG ATCGTCAAGA CCAACCTGCG CGGCGAGGAC GGCTTCATCG AGTCCGTCAC GCTCGACACC GGCGCGAGCG TGGCCGGCGA CCTGTTCATC GACTGCTCGG GCTTCCGCGG CCTGCTGATC GAGGAGGCCC TGCACACCGG CTACGAGGAC TGGAGCCAAT GGTTGCTGTG CGACAGCGCC CTGGCCGTGC AAAGCGAGGG GCAGGGGGCT CCGCCGCCCT ATACCGACGT CACCGCCCGG CCGGCCGGCT GGCAGTGGCG CATCCCGCTG CAGCACCGCT GGGGCAACGG CTACGTCTAT TCCAGCCGCC ACACCTCCGA CGAGAACGCC CGCGAGGTGC TGACTGCGTC GCTCGACGAG CGCCTGCTGC ACGAGCCGCG CAAGATCGGC TTCCACCCTG GCCGCCGCTT GAAGGCCTGG AACAAGAACT GCATCGCCCT GGGCCTGGCG TCCGGCTTCC TGGAGCCGCT GGAGAGCACC AGCATCGCCC TGATCGAGAC GGGCATCGAG AAGATCAAGC AGTTGTTCCC CAACCGCGAC TTCGATCCCC GGATCGTTGA CGAGTTCAAC GAGATGTCGC GGCTGGAGAT GGAGCGCGTC CGCGACTTCA TCATCCTGCA CTACAAGGCC AACCAGCGGG CCGATGACCC CACCGGCTTC TGGACCCATT GCCGCCAGAT GGCGGTTCCC GACACCCTCC AGAAGAAGAT CGACCTGTGG CGGGTCCAAG GTCACTTTAT CCGCTATCGG TGGGAGATGT TTTCCCAACC CAGCTGGCTG GCGATCTATG CCGGTTTCGA GATGTTGCCG GAAACCTACG ACCTCAGCGT CGACGGCTTC GACGCGGGTC AGCTCTCGGA GGCCCTGGCC GAGATGCGCA AGGCGGTGGC CGACACCGTC GCCAGCACGC GCACCCACGG CGACTTCATC GAACAGTACG CTCGCCCGCG GCCCGTCGCG GCTGAATAG
|
Protein sequence | MQETMHDRRI RDIVIVGGGT AGWMAAASLK QHFGNAPIGI TLIESSEIGA IGVGEATIPT IRRFYQSLGL SDIDVLRATG GTCKLGIRFN DWLRPGSSFI HPFGLYGQDL KGVSFHHYWM RLRALGEDAP IGDYSLGASL ATAGKFTTPS RNPPSALSVF DWAVHFDASL FARLMRQVAE QAGVKRIDAR IVKTNLRGED GFIESVTLDT GASVAGDLFI DCSGFRGLLI EEALHTGYED WSQWLLCDSA LAVQSEGQGA PPPYTDVTAR PAGWQWRIPL QHRWGNGYVY SSRHTSDENA REVLTASLDE RLLHEPRKIG FHPGRRLKAW NKNCIALGLA SGFLEPLEST SIALIETGIE KIKQLFPNRD FDPRIVDEFN EMSRLEMERV RDFIILHYKA NQRADDPTGF WTHCRQMAVP DTLQKKIDLW RVQGHFIRYR WEMFSQPSWL AIYAGFEMLP ETYDLSVDGF DAGQLSEALA EMRKAVADTV ASTRTHGDFI EQYARPRPVA AE
|
| |