Gene Caul_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1842 
Symbol 
ID5899297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1958372 
End bp1959889 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content68% 
IMG OID641562332 
Producttryptophan halogenase 
Protein accessionYP_001683469 
Protein GI167645806 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00202997 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000756616 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGATC GCGCCGTCCG GAATATTCTA ATCGTCGGCG GCGGAACGGC CGGCTGGATG 
ACGGCGGCGG CGCTCGCGGC CAAGCTGGCG GGGCTGCCCA TCGCCATTCG CCTGGTCGAA
TCCGCCGAGA TCGGCACGGT GGGCGTCGGC GAGGCGACGG TTCCACATAT CCGCCATTTC
AACGCCGCCC TGGGCCTCGA CGAGGCCGAC TTCATGCGCA AGACCCAGGC GACCTACAAG
CTGGGCATCG AGTTCCGAGG CTGGGGCAAG CCCGGCGACA GCTACATCCA TCCATTCGGA
GCCTACGGCG CGCCGATCGG CGGGGTGGGT TTCCATCACC ATTGGCTGCG GGCGCGCCAA
GCCGGCGATC CGACGCCGCT GGAGGCCTAT TCCCTGCCGA TCATGGCGGC CCGCCAGGGG
CGATTCGCTC CGCCCTCCCC CGACCCCCGC GCGCTGGCCT CGACCTATTC CTACGCCTAC
CAGTTCGACG CTGGCCTCTA TGCGGCCTAT CTGCGCGCCT ATGCCGAGAC CCGGGGCGTG
GTTCGCACCG AGGGCAAGGT CGCCGACGTC GCCCTGCGTG GCGAGGACGG CTTCATCGAA
GCCATCACGA TGGAGAATGG CGAGCGGATC GAGGCCGACC TGTTCATCGA CTGCTCGGGT
TTCCGCGGCC TGCTGATCGA GCAGAGCCTG AAGACTGGCT ATGAGGACTG GACCCGCTGG
CTGCCCTGCG ACCGAGCCGC CGCCGTGCCG TGCGATACGG TCGAGCGCTC GACGCCTTAC
ACCCGCTGCA CCGTCGATAT GGCCGGCTGG CGCTGGCGGA TCCCGCTGCA GCATCGGGTC
GGCAACGGCT ATGTCTATTG CAGCGGCCAC ATCAGCGACG ACGAGGCCGC CGCCGCCTTG
CTGGCGGGAT TGGAAGGCCC GGCCCAGGCC GAGCCGCGCT TCCTGCGGTT CGTCACCGGC
CGGCGCAAGA AGCAGTGGAA CAAGAACTGC GTGGCGATCG GGCTGGCCAG CGGCTTCCTC
GAGCCATTGG AGAGCACCAG CATCCACCTG ATCCAGGTGG CGGTCACCAC CCTGCTGGAG
CTGTTCCCCG AACGCGACTG CGCCCAGGCC GATCAGGACG AATACAATCG CGTGATGACC
CTGGAGTTCG AGCGGATCCG CGACTTCCTG GTGCTGCACT ACCATGCCAA CCAGCGCACC
GACGCGCCGT TCTGGAACGA GCGCCGGACC ATGAGCATCC CCGACAGCCT GGCCTACAAG
ATGGACCTGT TCCGTGATCG CGGGGTGGTG GTGAAGTACA GGGACGGCTT CTTCCTCGAG
CCCAGTTGGC TGGCGGTCTA TCTGGGCCAG AACATCCTGC CCGCCGCCTA CGACCCGGTC
AGCGACGGCG TGCCGACCGC CGCCCTGACC CGACGCTTGA CGGCGATCCG CGACTCGATC
GCCGACACCG TGCGAACCCT GCCGACCCAC GACGACTGGA TCGCCCGGTT CTGCGCCGCG
ACGCCGGCCG CCGCATGA
 
Protein sequence
MTDRAVRNIL IVGGGTAGWM TAAALAAKLA GLPIAIRLVE SAEIGTVGVG EATVPHIRHF 
NAALGLDEAD FMRKTQATYK LGIEFRGWGK PGDSYIHPFG AYGAPIGGVG FHHHWLRARQ
AGDPTPLEAY SLPIMAARQG RFAPPSPDPR ALASTYSYAY QFDAGLYAAY LRAYAETRGV
VRTEGKVADV ALRGEDGFIE AITMENGERI EADLFIDCSG FRGLLIEQSL KTGYEDWTRW
LPCDRAAAVP CDTVERSTPY TRCTVDMAGW RWRIPLQHRV GNGYVYCSGH ISDDEAAAAL
LAGLEGPAQA EPRFLRFVTG RRKKQWNKNC VAIGLASGFL EPLESTSIHL IQVAVTTLLE
LFPERDCAQA DQDEYNRVMT LEFERIRDFL VLHYHANQRT DAPFWNERRT MSIPDSLAYK
MDLFRDRGVV VKYRDGFFLE PSWLAVYLGQ NILPAAYDPV SDGVPTAALT RRLTAIRDSI
ADTVRTLPTH DDWIARFCAA TPAAA