Gene Caul_2406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2406 
Symbol 
ID5899861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2622912 
End bp2624456 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content65% 
IMG OID641562897 
Producttryptophan halogenase 
Protein accessionYP_001684031 
Protein GI167646368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.699378 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.260972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGCC CCGTAAAAAA TATCGTCATC GTGGGCGGCG GGACCGCCGG TTGGCTCACC 
GCGGGCCTGA TCGCGGCCAA GCACAAGGCC CGCCAAGCGA CGGGCTTCAC CGTCACCCTG
GTGGAATCGC CCAACACGCC CATCATCGGC GTCGGAGAAG GCACATGGCC GACCCTGCGC
ACGAGCCTGG ACAAGATCGG CGTGTCGGAG ACCGATTTCT TCCGGGAGTG CGATGCGGCC
TTCAAGCAGG GCGCGAAATT CGCGCGCTGG ACCACGGGCG CGGCCGACGA CGCCTATTAT
CACCCGCTCA TGCTGCCGCA GAGCTTTTCG CAGGTGAACC TTGTTCCCCA CTGGCTGGTC
GGCGGGGCGG GGCGAAGTTT CTGTGACGCG GTCACGCCGC AAGGGCGGCT CTGCGACGAG
GGTCTGGCCC CCAAGACCAT CACCGCCGCG CCATATCAGG GCGCCGCCAA CTACGCCTAT
CATCTGGACG CGGGCAAGTT CGCGCCGTTC CTGCAGCGCC ACTGCTGCGA CAAGCTGGGC
GTCCGCCATG TCCTGGCCGA CGTCGAAAGC GTAGCCATGA CCGAGGACGG GGATATTCGC
GGCGTCGTCA CCGAACAGCA CGGCGAGATT CAGGGTGATC TCTTCGTCGA TTGCACCGGC
TTTCGCGCCC TCCTGCTTGG CGAGACGCTC GGCGTGCCGT TCCGCGGCTG CGGCGATGTC
CTATTCTGCG ACACGGCGCT GGCCATCCAG GTTCCCTACG AGACCGAGAC CAGCCCCATC
TCCAGTCACA CGATCTCCAC GGCCCAGTCG GCCGGATGGA TCTGGGATAT CGGCTTGCCC
ACGCGCCGCG GCGTTGGCCA CGTTTATTCC AGCCGCCACA TCAGCGATGA GCACGCCGAG
CGCGAGTTGC GGGCCTATAT CGGTCCGGCC GGCCACAACC TTCCGGCCAG GAAGATTGCG
ATCCGCTCGG GCCATCGCGA GACGTTCTGG AAACGCAACT GCGTGGCCGT GGGACTCGCC
GCGGGATTTC TCGAACCGCT CGAAGCGTCC GCGATCGTCC TGATCGAACT ATCGGCCAAA
CTGATCGCCG AGCAGATGCC CGCCTGCCGC GAAGTGATGG ACATCGTCGC GGCGCGCTTC
AACGCCACCA CGCATTATCG CTGGGGCCGC ATTATCGATT TCCTGAAGCT GCATTACGTT
CTGAGCCAAC GGTCGGACAG CGCCTTCTGG CGAGACAATC GCGCGCGCGA AACCATCCCC
GATCGGCTGG CCGACCTGCT CTTGCTGTGG CGTCATCAAC CGCCCTGGCT ACACGACGAG
TTCGACCGCG CCGACGAGAT CTTTCCGGCG GCCAGCTACC AATACGTGCT CTATGGCATG
GGCTTTCGCA CGCAGATCGA ACCAGAGTCC CTGGCGGACG AGCGAGCGAT CGCCGAGCGG
GCCTGGCGGG AGACGGCGGC TCAAACCGAG AGGCTGCGCG CCACCCTGCC CCACCATCGC
GACCTGATCC GCAAGATTGT CGAACACGGC TTGCAGCCCG TATGA
 
Protein sequence
MVRPVKNIVI VGGGTAGWLT AGLIAAKHKA RQATGFTVTL VESPNTPIIG VGEGTWPTLR 
TSLDKIGVSE TDFFRECDAA FKQGAKFARW TTGAADDAYY HPLMLPQSFS QVNLVPHWLV
GGAGRSFCDA VTPQGRLCDE GLAPKTITAA PYQGAANYAY HLDAGKFAPF LQRHCCDKLG
VRHVLADVES VAMTEDGDIR GVVTEQHGEI QGDLFVDCTG FRALLLGETL GVPFRGCGDV
LFCDTALAIQ VPYETETSPI SSHTISTAQS AGWIWDIGLP TRRGVGHVYS SRHISDEHAE
RELRAYIGPA GHNLPARKIA IRSGHRETFW KRNCVAVGLA AGFLEPLEAS AIVLIELSAK
LIAEQMPACR EVMDIVAARF NATTHYRWGR IIDFLKLHYV LSQRSDSAFW RDNRARETIP
DRLADLLLLW RHQPPWLHDE FDRADEIFPA ASYQYVLYGM GFRTQIEPES LADERAIAER
AWRETAAQTE RLRATLPHHR DLIRKIVEHG LQPV