Gene Caul_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1839 
Symbol 
ID5899294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1953899 
End bp1955404 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content65% 
IMG OID641562329 
Producttryptophan halogenase 
Protein accessionYP_001683466 
Protein GI167645803 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.703806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00201458 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACC AACGCCTGCG CAAGATCGTC ATCGTCGGAG GCGGGACCGC CGGCTGGATG 
ACGGCGGCGG CGCTCGGCCG CTTCCTGAAG GACGGCCACA CCCAGGTGAC CCTGATCGAG
TCCGAGGAAA TCGGCACGAT CGGGGTCGGC GAGTCGACCA TCCCGCAGAT CAACATCTTC
AACCGCATGC TGGGCCTGGA CGAGAACGAG TTCGTCCGCC GCACCAAGGC GACCTTCAAG
CTGGCCATCG AGTTCGTCGA CTGGAAACGG ATCGGCCACG CCTATTATCA CCCGTTCGGA
CCCTACGGCG TCGACATGGA CGGGGTGTCT TTTCATGCCT ACTGGCTGCG GCTGAAGGCC
ATGGGCGAGG CCGCCGATCT GGGCGAATAC TCCCTGCAGG CCCTGGCGGC GGCGCAGGGC
AAGTTCATGC GGGCCAATCA CCAGCCCAAC TCGCCCCTGG GCAGCATCGC CCACGCCTAT
CACATCGACG CCGGCCTCTA CGCCCGCTTC CTGCGCGACT ACGCCGAGGA TCACGGGATC
CGCCGCCAGG AAGGCAAGAT CGTCGAGGTC CACCAGCGCG CCGTGGACGG CTTCATCGAG
GCCGTGACCC TGCAGAGCGG TCAGCGCGTC GAGGGCGATC TGTTCATCGA CTGCTCGGGC
TTCCGCGGCC TGCTGATCGA ACAGACCCTG AAGACCGGCT ACGAGGACTG GTCCAACTGG
CTGCTCAACG ACCGCGCCGT GGCCGTGCCC TGCGAGCCGG CGGGCGCGCG CGCGCCGGTC
ACCCGCGCCA CCGCCCGGCC AGCCGGCTGG CAGTGGCGCA TCCCGCTGCA GCATCGCCTG
GGCAACGGCT ACGCCTATTC CAGCGAGCAC ATCAGCGAGG ACGAGGCCAC GGCCTACCTG
CTCGCTAACC TCGACGGCGC GCCGCTGCGC GATCCGTTCA CCCTGCGCTT CAAGGCCGGG
CGGCGAAAGA AGAGCTGGAA CAAGAACGTC GTCGCCATAG GCCTGTCGGC CGGGTTCATG
GAGCCGCTGG AAAGCCAGAG CATCCACCTG ATCCAGGTGG GGATCTCGCG CCTGCTGGCC
ATGTTCCCCG ACAAGCGGTT CGAGCAGCCC GACATCGACC GCTACAACAG GGTGATGCAG
TTCGAATACG AGAAGATCCG CGACTTCCTG ATCCTGCACT TCCACGCCAC CCAGCGGAAC
GACACGCCCT ACTGGGACTA TCTGCGGGAA ATGCCGATCC CGGACTACCT GGCCGACAAG
ATCGCGGTGT TCGAGAGCTA CGGCCGGGTG TTCCGCGAGA ATGAGGAACT GTTCAACGAC
ACCAGCTGGT TCGCGGTGAT GATCGGTCAG GGTCTGGAGC CGCGCGGCCA CGACCCGATG
GCCGACGTAA TGTCCGACGA CGAGTTGCGC GCCAAGATGA AGGGCATCCA CGGTGTTATC
GCCAAGTCGG CCGAGGTCAT GCCCGACCAC ATGACGTTCA TCGCCGAAAA CTGCGCGGCT
CAATAA
 
Protein sequence
MTDQRLRKIV IVGGGTAGWM TAAALGRFLK DGHTQVTLIE SEEIGTIGVG ESTIPQINIF 
NRMLGLDENE FVRRTKATFK LAIEFVDWKR IGHAYYHPFG PYGVDMDGVS FHAYWLRLKA
MGEAADLGEY SLQALAAAQG KFMRANHQPN SPLGSIAHAY HIDAGLYARF LRDYAEDHGI
RRQEGKIVEV HQRAVDGFIE AVTLQSGQRV EGDLFIDCSG FRGLLIEQTL KTGYEDWSNW
LLNDRAVAVP CEPAGARAPV TRATARPAGW QWRIPLQHRL GNGYAYSSEH ISEDEATAYL
LANLDGAPLR DPFTLRFKAG RRKKSWNKNV VAIGLSAGFM EPLESQSIHL IQVGISRLLA
MFPDKRFEQP DIDRYNRVMQ FEYEKIRDFL ILHFHATQRN DTPYWDYLRE MPIPDYLADK
IAVFESYGRV FRENEELFND TSWFAVMIGQ GLEPRGHDPM ADVMSDDELR AKMKGIHGVI
AKSAEVMPDH MTFIAENCAA Q