Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1842 |
Symbol | |
ID | 5899297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1958372 |
End bp | 1959889 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562332 |
Product | tryptophan halogenase |
Protein accession | YP_001683469 |
Protein GI | 167645806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00202997 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000756616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGATC GCGCCGTCCG GAATATTCTA ATCGTCGGCG GCGGAACGGC CGGCTGGATG ACGGCGGCGG CGCTCGCGGC CAAGCTGGCG GGGCTGCCCA TCGCCATTCG CCTGGTCGAA TCCGCCGAGA TCGGCACGGT GGGCGTCGGC GAGGCGACGG TTCCACATAT CCGCCATTTC AACGCCGCCC TGGGCCTCGA CGAGGCCGAC TTCATGCGCA AGACCCAGGC GACCTACAAG CTGGGCATCG AGTTCCGAGG CTGGGGCAAG CCCGGCGACA GCTACATCCA TCCATTCGGA GCCTACGGCG CGCCGATCGG CGGGGTGGGT TTCCATCACC ATTGGCTGCG GGCGCGCCAA GCCGGCGATC CGACGCCGCT GGAGGCCTAT TCCCTGCCGA TCATGGCGGC CCGCCAGGGG CGATTCGCTC CGCCCTCCCC CGACCCCCGC GCGCTGGCCT CGACCTATTC CTACGCCTAC CAGTTCGACG CTGGCCTCTA TGCGGCCTAT CTGCGCGCCT ATGCCGAGAC CCGGGGCGTG GTTCGCACCG AGGGCAAGGT CGCCGACGTC GCCCTGCGTG GCGAGGACGG CTTCATCGAA GCCATCACGA TGGAGAATGG CGAGCGGATC GAGGCCGACC TGTTCATCGA CTGCTCGGGT TTCCGCGGCC TGCTGATCGA GCAGAGCCTG AAGACTGGCT ATGAGGACTG GACCCGCTGG CTGCCCTGCG ACCGAGCCGC CGCCGTGCCG TGCGATACGG TCGAGCGCTC GACGCCTTAC ACCCGCTGCA CCGTCGATAT GGCCGGCTGG CGCTGGCGGA TCCCGCTGCA GCATCGGGTC GGCAACGGCT ATGTCTATTG CAGCGGCCAC ATCAGCGACG ACGAGGCCGC CGCCGCCTTG CTGGCGGGAT TGGAAGGCCC GGCCCAGGCC GAGCCGCGCT TCCTGCGGTT CGTCACCGGC CGGCGCAAGA AGCAGTGGAA CAAGAACTGC GTGGCGATCG GGCTGGCCAG CGGCTTCCTC GAGCCATTGG AGAGCACCAG CATCCACCTG ATCCAGGTGG CGGTCACCAC CCTGCTGGAG CTGTTCCCCG AACGCGACTG CGCCCAGGCC GATCAGGACG AATACAATCG CGTGATGACC CTGGAGTTCG AGCGGATCCG CGACTTCCTG GTGCTGCACT ACCATGCCAA CCAGCGCACC GACGCGCCGT TCTGGAACGA GCGCCGGACC ATGAGCATCC CCGACAGCCT GGCCTACAAG ATGGACCTGT TCCGTGATCG CGGGGTGGTG GTGAAGTACA GGGACGGCTT CTTCCTCGAG CCCAGTTGGC TGGCGGTCTA TCTGGGCCAG AACATCCTGC CCGCCGCCTA CGACCCGGTC AGCGACGGCG TGCCGACCGC CGCCCTGACC CGACGCTTGA CGGCGATCCG CGACTCGATC GCCGACACCG TGCGAACCCT GCCGACCCAC GACGACTGGA TCGCCCGGTT CTGCGCCGCG ACGCCGGCCG CCGCATGA
|
Protein sequence | MTDRAVRNIL IVGGGTAGWM TAAALAAKLA GLPIAIRLVE SAEIGTVGVG EATVPHIRHF NAALGLDEAD FMRKTQATYK LGIEFRGWGK PGDSYIHPFG AYGAPIGGVG FHHHWLRARQ AGDPTPLEAY SLPIMAARQG RFAPPSPDPR ALASTYSYAY QFDAGLYAAY LRAYAETRGV VRTEGKVADV ALRGEDGFIE AITMENGERI EADLFIDCSG FRGLLIEQSL KTGYEDWTRW LPCDRAAAVP CDTVERSTPY TRCTVDMAGW RWRIPLQHRV GNGYVYCSGH ISDDEAAAAL LAGLEGPAQA EPRFLRFVTG RRKKQWNKNC VAIGLASGFL EPLESTSIHL IQVAVTTLLE LFPERDCAQA DQDEYNRVMT LEFERIRDFL VLHYHANQRT DAPFWNERRT MSIPDSLAYK MDLFRDRGVV VKYRDGFFLE PSWLAVYLGQ NILPAAYDPV SDGVPTAALT RRLTAIRDSI ADTVRTLPTH DDWIARFCAA TPAAA
|
| |