Gene Caul_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1866 
Symbol 
ID5899321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2001808 
End bp2003313 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content66% 
IMG OID641562356 
Producttryptophan halogenase 
Protein accessionYP_001683493 
Protein GI167645830 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGAA CCAACCCTAT TCGTTCGATC CTGATCGTCG GCGGCGGCAC AGCCGGCTGG 
ATGGCCGCGA CCTGGCTGGC CGGGCGGCTG GCGCGCCAGG ACATCCAGAT CACCGTCGTG
GAGTCGCCCG ATATCCGCAC CGTCGGGGTC GGGGAGGCGA CCGTCCCGGC CATTCGCGAC
TATTTCCGCG ACATCGGCGT CACCGAAGCC GAAGTGATGG CGGCCACCCA GGGCACGGTG
AAGCTGGGCA TCGAGTTTCG TGACTGGAAG CGGGACGGGG AGTGCTTCTT CCATCCCTTC
GGCCTTTACG GCATGCCCTC GCGCGGCGTG CCGTTCCACC AGTTCTGGCT CAAGCGCCGC
GCCGAGGGCG ACGCCACGCC GCTGGCCGCC TACAGCCTGT GCACCCAACT GGCCATGGCC
AACCAGATGA TGGAGCCGCC GGCTTCGCCG CCCAACGATC TGGGCGTGTT CAATTGGGCG
GTCCATTTTG ACGCCGGCCT GTATGCGCAG TTCCTGAGAC GCAAGGCCAC GTCGGAGCTA
GGCGTGACCC ATGTCGACGG CACGGTCGTC GAGGTTTCAA AGAACGGCGA GAACGGTTTC
CTGACCGGCG TGACCCTGGC GGACGGCCGT ATCTTCGAGG CCGATCTCTT CATCGATTGC
TCGGGTTTTC GCAGCCTGCT GCTCGGCCAA GCGCTGGGCG TGGCCTACGA GGATTGGACC
CATCTGCTGC CCTGCGACCG CGCCGTGGCC TTGCCGTGCG AGCGCGACGG CCCGCTGACG
CCCTACACCC GCAGCACGGC GCTGGCGGCC GGCTGGCAGT GGCGCATCCC GTTGCAGCAC
CGGGTCGGCA ATGGCTATGT CTATTCCAGC CGGCACATCT CGGACGATGA GGCCACCGCC
GTCCTGATGT CGCGCCTGGA GGGGCCGGCC TTGGCCGAGC CCAACCTGCT GCGCTTTCAG
ACCGGCCATC GCCGCCGCTT CTGGGAGAAG AACTGCATAG CTTTGGGCTT GGCCGCAGGC
TTCATGGAGC CGCTGGAATC GACCAGCATC GTTCTCATCC AGAGCGGGGT GGAGCGGCTC
GGCGCGCTGT TCCCGGAGCG CGGCTTCGAT CCGGCCTTGG CCGACGAATA CAACCGCATC
ACCACGCTCG AATACGAGCG GATCCGCGAT TTCCTGTTGC TGCACTACGT CGCCAACCGT
CGAGACGGCG AGGCCATGTG GGATCATGTC CGTCAACTGG CCTTGCCCGA ACCCCTGGTC
CACAAGATGC GGATGTTCGC CAGCCGCGGA ACGATGGTCC GCTACGAGTG GGAGTCTTTC
CACGACCCCA GCTGGCTGTC GATGTACGGC GGCTTCGACA TTGTCCCGCA GGCTCATGAT
CCGATGGCGG ACTATTTCAC CAAGCCCGAG CTCGACAGCG CCTTGCGCCG GATGCGCGAA
GCGATCACTC GCGCTCAGGC CTTCGCCGTT CCTCACGAAA CGTTCCTGGC GGCGCAACGG
ACTTGA
 
Protein sequence
MARTNPIRSI LIVGGGTAGW MAATWLAGRL ARQDIQITVV ESPDIRTVGV GEATVPAIRD 
YFRDIGVTEA EVMAATQGTV KLGIEFRDWK RDGECFFHPF GLYGMPSRGV PFHQFWLKRR
AEGDATPLAA YSLCTQLAMA NQMMEPPASP PNDLGVFNWA VHFDAGLYAQ FLRRKATSEL
GVTHVDGTVV EVSKNGENGF LTGVTLADGR IFEADLFIDC SGFRSLLLGQ ALGVAYEDWT
HLLPCDRAVA LPCERDGPLT PYTRSTALAA GWQWRIPLQH RVGNGYVYSS RHISDDEATA
VLMSRLEGPA LAEPNLLRFQ TGHRRRFWEK NCIALGLAAG FMEPLESTSI VLIQSGVERL
GALFPERGFD PALADEYNRI TTLEYERIRD FLLLHYVANR RDGEAMWDHV RQLALPEPLV
HKMRMFASRG TMVRYEWESF HDPSWLSMYG GFDIVPQAHD PMADYFTKPE LDSALRRMRE
AITRAQAFAV PHETFLAAQR T