Gene Caul_2144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2144 
Symbol 
ID5899599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2322695 
End bp2324209 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content65% 
IMG OID641562634 
Producttryptophan halogenase 
Protein accessionYP_001683770 
Protein GI167646107 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0060883 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0489946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCACC TGAACAAGAT CGTCATCGTC GGCGGCGGCT CGGCGGGCTG GATCTGCGCG 
GCCATGCTCA GCCACTACTT TCAGAACGGG CCGACCCAGG TCGAACTGAT CGAATCCGAG
GAGATCGGCA CGATCGGCGT GGGGGAATCC ACCATTCCCC CGTTCCTCCA ACTGATCCGC
ACCTTGGGGA TCAACGAGCA GGAATTCATT CAAGAAACCC AGGCCGCCTT CAAGCTTGGC
ATCCGGTTCG AGAACTGGCT CGAGAAGGGC GACGTCTACT ACCACCCGTT CGGCCAGATC
GGCGGCCCGC TGGAGGTCAA CGAGTTCTAT CAGTGCTGGC TGCGGGCCAA GGCCAACGGC
CATCCGTCGA GCCTGCAGGA CTTCGCCCCG GCCACGGTGA TGGCCGCCGC TGGCAAGTTC
ATGCTGCCGG CCAAGGCCCA GCGCACGATG ATCGCCAACG CCAACTACGC CCTGCACGTC
GACGCCCGGC TGGTGGCGCT GTACCTGCGC AAGTTCGCCG AGGCGCGGGG CGTCAAGCGC
ACCGAGGGCA TCGTCACCGA CGTGGCGACC CGGGCCGACG GCGGCGTTGA GAAGGTGATC
ATGAAGGACG GCCGCGAGGT CGCCGGCGAC TTCTTCATCG ACTGTTCGGG CTTCCGGGCG
TTGCTGATCG GCAAGACCCT GAACGAACCC TTCCGCGACT GGTCCGACGT TCTGCTCTGC
GACCGCGCCA TCGTCGCCCA GACCGAGAAC ATCGGGCCGC CTCATCCCTA TACCCTGGTC
CAGGCGCAGG ATTTCGGCTG GCGCTGGCGC ATCCCGCTGC AGCACCGCTC GGGCAACGGC
TATGTGTTCG CCAGCCAGTA TCTCAGCGAC GACGAGGCCA CCGCCACCTT GCTGAGCCAA
CTTCAGGGCG AGATCGTGCT GGGTCCGTCG GTCATCCCGT TCAAGACCGG CGTGCGCGAG
CGGCCGTGGG TCAAGAACGT AGTGTCCATC GGCCTGTCCT GCGGCTTCAT CGAGCCGCTG
GAATCCACGG CCCTGCACCT GATCTACAAG GGCATGGACT ATCTGCTGCG GTTCATGCCC
GACATGGACG CCGACCAGAC CCTGGCGGCC GAGTACAATC GTCGCATGGT CGCCGACTAT
GAGGAGATCC GCGACTTCAT CGTCCTGCAC TACGTGACCA CCCGGCGCGA CGACACGCCG
TTCTGGCGCG CCTACCAGCA GGTCGAGCCG CCCGAGAGCC TGAAGGCGCG CATCGCCCTG
TTCAAGGCGG CCGGGGTGCT GCGCGACGGC GTCGATGACA TGTTCCGCGC CCCCAGCTGG
CAGTCGGTGA TGGAGGGCAT GGGCGTCCGG CCCGAGCGCT ACCAGCAGTT GGTCGACCGC
ATCCCGCTGA GCGTGATCAT GAACCTGATG GACAAGTCCG CGCCGATGCT GGCCGACTTC
GTCAAGACCC TGCCCAGCCA TCAGGAGTTC CTGGACGCCT ATTGTCCGGC GGAGCCGTTC
AAGCGAACGG CCTAA
 
Protein sequence
MAHLNKIVIV GGGSAGWICA AMLSHYFQNG PTQVELIESE EIGTIGVGES TIPPFLQLIR 
TLGINEQEFI QETQAAFKLG IRFENWLEKG DVYYHPFGQI GGPLEVNEFY QCWLRAKANG
HPSSLQDFAP ATVMAAAGKF MLPAKAQRTM IANANYALHV DARLVALYLR KFAEARGVKR
TEGIVTDVAT RADGGVEKVI MKDGREVAGD FFIDCSGFRA LLIGKTLNEP FRDWSDVLLC
DRAIVAQTEN IGPPHPYTLV QAQDFGWRWR IPLQHRSGNG YVFASQYLSD DEATATLLSQ
LQGEIVLGPS VIPFKTGVRE RPWVKNVVSI GLSCGFIEPL ESTALHLIYK GMDYLLRFMP
DMDADQTLAA EYNRRMVADY EEIRDFIVLH YVTTRRDDTP FWRAYQQVEP PESLKARIAL
FKAAGVLRDG VDDMFRAPSW QSVMEGMGVR PERYQQLVDR IPLSVIMNLM DKSAPMLADF
VKTLPSHQEF LDAYCPAEPF KRTA