Gene Caul_2143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2143 
Symbol 
ID5899598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2321138 
End bp2322676 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content67% 
IMG OID641562633 
Producttryptophan halogenase 
Protein accessionYP_001683769 
Protein GI167646106 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0261967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0242357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAGA CCATGCACGA CCGGCGCATC CGCGACATCG TCATCGTTGG CGGCGGCACG 
GCGGGCTGGA TGGCGGCCGC CAGCCTCAAG CAGCATTTCG GAAACGCGCC GATCGGCATC
ACCCTGATCG AGTCCTCCGA AATCGGCGCG ATCGGGGTGG GCGAGGCGAC GATCCCCACC
ATCCGCCGCT TTTATCAATC CCTGGGCCTG TCCGACATCG ACGTGCTGCG GGCTACAGGC
GGCACCTGCA AGCTGGGCAT CCGCTTCAAC GACTGGCTGC GACCCGGTTC GTCCTTCATC
CATCCGTTCG GCCTGTACGG CCAGGACCTG AAGGGCGTGT CGTTCCATCA CTACTGGATG
CGCCTGCGCG CCCTGGGCGA GGACGCGCCG ATCGGCGACT ATTCGCTGGG CGCCAGCCTG
GCCACGGCCG GCAAGTTCAC CACCCCGTCG CGCAATCCGC CGTCGGCGCT GTCGGTGTTT
GACTGGGCGG TGCATTTCGA CGCCAGCCTG TTCGCCAGGC TGATGCGCCA GGTGGCCGAG
CAGGCGGGCG TCAAGCGCAT CGACGCCAGG ATCGTCAAGA CCAACCTGCG CGGCGAGGAC
GGCTTCATCG AGTCCGTCAC GCTCGACACC GGCGCGAGCG TGGCCGGCGA CCTGTTCATC
GACTGCTCGG GCTTCCGCGG CCTGCTGATC GAGGAGGCCC TGCACACCGG CTACGAGGAC
TGGAGCCAAT GGTTGCTGTG CGACAGCGCC CTGGCCGTGC AAAGCGAGGG GCAGGGGGCT
CCGCCGCCCT ATACCGACGT CACCGCCCGG CCGGCCGGCT GGCAGTGGCG CATCCCGCTG
CAGCACCGCT GGGGCAACGG CTACGTCTAT TCCAGCCGCC ACACCTCCGA CGAGAACGCC
CGCGAGGTGC TGACTGCGTC GCTCGACGAG CGCCTGCTGC ACGAGCCGCG CAAGATCGGC
TTCCACCCTG GCCGCCGCTT GAAGGCCTGG AACAAGAACT GCATCGCCCT GGGCCTGGCG
TCCGGCTTCC TGGAGCCGCT GGAGAGCACC AGCATCGCCC TGATCGAGAC GGGCATCGAG
AAGATCAAGC AGTTGTTCCC CAACCGCGAC TTCGATCCCC GGATCGTTGA CGAGTTCAAC
GAGATGTCGC GGCTGGAGAT GGAGCGCGTC CGCGACTTCA TCATCCTGCA CTACAAGGCC
AACCAGCGGG CCGATGACCC CACCGGCTTC TGGACCCATT GCCGCCAGAT GGCGGTTCCC
GACACCCTCC AGAAGAAGAT CGACCTGTGG CGGGTCCAAG GTCACTTTAT CCGCTATCGG
TGGGAGATGT TTTCCCAACC CAGCTGGCTG GCGATCTATG CCGGTTTCGA GATGTTGCCG
GAAACCTACG ACCTCAGCGT CGACGGCTTC GACGCGGGTC AGCTCTCGGA GGCCCTGGCC
GAGATGCGCA AGGCGGTGGC CGACACCGTC GCCAGCACGC GCACCCACGG CGACTTCATC
GAACAGTACG CTCGCCCGCG GCCCGTCGCG GCTGAATAG
 
Protein sequence
MQETMHDRRI RDIVIVGGGT AGWMAAASLK QHFGNAPIGI TLIESSEIGA IGVGEATIPT 
IRRFYQSLGL SDIDVLRATG GTCKLGIRFN DWLRPGSSFI HPFGLYGQDL KGVSFHHYWM
RLRALGEDAP IGDYSLGASL ATAGKFTTPS RNPPSALSVF DWAVHFDASL FARLMRQVAE
QAGVKRIDAR IVKTNLRGED GFIESVTLDT GASVAGDLFI DCSGFRGLLI EEALHTGYED
WSQWLLCDSA LAVQSEGQGA PPPYTDVTAR PAGWQWRIPL QHRWGNGYVY SSRHTSDENA
REVLTASLDE RLLHEPRKIG FHPGRRLKAW NKNCIALGLA SGFLEPLEST SIALIETGIE
KIKQLFPNRD FDPRIVDEFN EMSRLEMERV RDFIILHYKA NQRADDPTGF WTHCRQMAVP
DTLQKKIDLW RVQGHFIRYR WEMFSQPSWL AIYAGFEMLP ETYDLSVDGF DAGQLSEALA
EMRKAVADTV ASTRTHGDFI EQYARPRPVA AE