Gene Caul_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1853 
Symbol 
ID5899308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1975622 
End bp1977115 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content66% 
IMG OID641562343 
Producttryptophan halogenase 
Protein accessionYP_001683480 
Protein GI167645817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.930503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAGAAC AGGTCGTGAA AAGGGTCGTG ATCGCCGGGG GCGGAACGGC CGGCTGGATG 
GCGGCGGCGG CGCTGGTCAA GCAACTGGGG CCGCTGCTCG ACATCAGCCT AGTCGAGTCC
GACGAGATCG GCACGGTCGG GGTGGGGGAG TCCACCATCC CGACGGCCCG CACCTTCAAC
GCGCTGCTAG GGATCGACGA GCCGGCGTTC ATGCGCGCCA CCCAGGCGAC ATTCAAGCTG
GGCATCGCGT TCGAGAACTG GGGGCGGATC GGCGATCGCT ACATCCACTC GTTCGGCCAA
GTGGGCAAAT CCACCTGGAT GGGCGGCTTC CACCATTTCT GGCTACAGGC CAAGGCGGCG
GGCTTTGGCG GCGATCTGGG GGACTATTGC CTGGAGTTGA AGGCCGCCGA GGCCGACCGG
TTCTCCACCG GTGACGGTCC AGAGCTGAAC TACGCCTATC ATCTGGACGC GACGCTCTAC
GGCGGCTTCC TGCGCCGCAT GGCCGAGGCT TTGGGCGTCA AGCGGATCGA GGGCAAGATC
AGCCAGGTCG AGCAGCAGGC CGAGACCGGC TTCATCCAGG CCTTGGTCAT GGAAAATGGC
GACCGGGTCG AGGGCGACCT GTTCATTGAT TGCACAGGCT TCCGAGGGCT GCTGATCGAG
CAGACGTTGA AGGCGGGTTG GGAGGACTGG GGCGACTGGC TGCCGACCAA CAGCGCGCTG
GCGGTGCAGA CCAGGGCCAC GGGTCCGGCC GTGCCCTATA CCCGCGCCAT CGCCCACGAG
GCGGGCTGGC GCTGGAAGAT CCCGCTGCAG AATCGGGTCG GCAACGGTCT GGTCTATTGC
AGCGAGTACA TGTCGGACGA CAAGGCCCGC GAGACCCTGC TGGAGTCGCT GGACGGCGAG
CGGCTGATCG AGCCTCGGCT GATCCGCTAC CGCACGGGCC GCCGCCTGAA GACCTGGCAC
AAGAACTGCG TCGCCCTGGG CCTGGCCAGC GGCTTCGTCG AACCGCTGGA GTCGACCTCG
ATCCACCTGA TCATGATCGG GGTGACGCGG CTGATGCAGC TGTTTCCGTT CCACGGCGTC
AGCGACGCCG TCGTCGCGCG CTACAACCAG CAGGCCGTCG ACGAGCTGGA GAAGATCCGC
GACTTCATCA TCCTGCACTA TAAACTGACC GAGCGGACCG ACAGTCCGTT TTGGGATCGT
TGCCGGACGA TGGACATCCC GGACTCCCTG GCCCAGCGCA TCGACCTGTT CCGCGAGAGC
GCCCAGGCCT ACCAGTCGCC AGGCGAGCTG TTCCAGGTCG ACTCGTGGCT GCAGGTCATG
CTCGGCCAGA GGCTGGAGCC GCGCGAGCAC CATCTCATGG GCCGCCTGAT GCCGGCTGAT
CAGCTGAACC GGGCGCTGAG CGACTTGAGG GGCAACATCG CGCGCGCCGT GACCCAACTG
CCGAGCCATC AGGCGTTCCT CGACCGTTAC TGTCCCGCGT CAGCGGCGAT GTGA
 
Protein sequence
MQEQVVKRVV IAGGGTAGWM AAAALVKQLG PLLDISLVES DEIGTVGVGE STIPTARTFN 
ALLGIDEPAF MRATQATFKL GIAFENWGRI GDRYIHSFGQ VGKSTWMGGF HHFWLQAKAA
GFGGDLGDYC LELKAAEADR FSTGDGPELN YAYHLDATLY GGFLRRMAEA LGVKRIEGKI
SQVEQQAETG FIQALVMENG DRVEGDLFID CTGFRGLLIE QTLKAGWEDW GDWLPTNSAL
AVQTRATGPA VPYTRAIAHE AGWRWKIPLQ NRVGNGLVYC SEYMSDDKAR ETLLESLDGE
RLIEPRLIRY RTGRRLKTWH KNCVALGLAS GFVEPLESTS IHLIMIGVTR LMQLFPFHGV
SDAVVARYNQ QAVDELEKIR DFIILHYKLT ERTDSPFWDR CRTMDIPDSL AQRIDLFRES
AQAYQSPGEL FQVDSWLQVM LGQRLEPREH HLMGRLMPAD QLNRALSDLR GNIARAVTQL
PSHQAFLDRY CPASAAM