Gene Caul_2064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2064 
Symbol 
ID5899519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2204006 
End bp2205508 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content66% 
IMG OID641562553 
Producttryptophan halogenase 
Protein accessionYP_001683690 
Protein GI167646027 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0180513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCCGC TGAAGAAAAT CCTCATCGCC GGCGGCGGCT CGGCCGGCTG GATGACGGCG 
GCCCTGTGCG CCAAGCTGTT CCAGGGCCTC TACGAAATCG TCCTGATCGA GTCCGAGGAG
ATCGGCACGG TCGGGGTGGG GGAGGCGACC ATCCCCGCGA TCAAGAGGTT CAACGAACTT
CTGGGCCTGG ACGAGGACGA CTTCCTGCGC CGCACCCAGG GCAGCTTCAA GCTGGGCATC
CAGTTCAAGG ACTGGTCGCG GCTCGGCTCC AGCTACGTCC ACGGTTTCGG GGTGATCGGC
CAGGACCTGG GATGGCTGCG CTGCCATCAG TACTGGCTGC GCATGAACGC CTTGGGCCAC
GGCGGGGATT TCGCCCAGCT GTCGATCAAC ACGGCCGCGG CGCTCGACAA CAGGTTCATG
CGCGCCAAGC CGGAGATGGG CGACTCGCCC ATCGCCCACA TCGCCCACGC CTTCCATTTC
GACGCCGGCC TCTATGCCCG CTACCTCAGC GGCTACGCCC AGGAGCGCGG GGTGCGCCGG
CGCGAGGGCA AGATCGTCGA TGTCGCCCTG CGAAGCGACG ACGGGTTCGT GCAGTCGGTG
ACCATGGACG ACGGCGAGGT GATCGCCGCC GATCTGTTTG TCGACTGCTC GGGCTTCCGC
GGCCTGATCA TCGAGCAGGC CATGAAGACC GGCTACGAGG CGTGGAAGCA CTGGCTGCCG
TGCGACCGCG CCATCGCCGT CCCGTGCGAG CGCTCGGCGA ACTTCACGCC CTACACCCGC
TCGACGGCCC GCGAAGCCGG CTGGCAGTGG CGCATCCCCC TGCAGCACCG CACCGGCAAC
GGCCACGTCT ATTCCAGCGA GCACATCGAC GACGACGAGG CCGAACGGGT GCTGCTCGCC
AACCTCGACG GCGCCCAGCG GGCCGATCCG TTGCGCATCC GCTTCGTCAC CGGCAAGCGC
AAGAAGATCT GGAACCGCAA TTGCGTAGCC ATCGGCCTGG CCAGCGGCTT CCTGGAGCCG
CTGGAATCCA CCAGCCTGCA CCTGATCCAG TCGGCGATCA TCCGCATGGT GCGCCTGCTG
CCGGACGCCG GCTTCGATCA GGCGGGGATC GACGAGTTCA ATCGCCAGAG CGACTTCGAA
TACGAGCGCA TCCGCGACTT CATCATCCTC CACTACAAGG CCACCCAGCG CGACGATACC
GCCTTCTGGC GCTATTGCCG CGACATGGAG GTCCCCGCGA CCCTGCAGCG GAAGATCGAC
CTGTTCTCGG CCAACGGCCG GGTCTTCCGG GAAGACGACG AACTGTTCAC CGAGGAGAGC
TGGATCCAGG TGTTCCTCGG GCAGGGGATC ATCCCGCGAG GCTACGATCC GCTGGTTCAG
GTCCAGAGCG ACGCCCAGAT CGCCCAGTAT CTCGCCAATA TCGAGACGGT CATCGGCAAG
TGCGTGAAGG TGATGCCGAC CCACGCCGAT TTCGTCGCCA AGACCTGCCA GGCACCGGGA
TGA
 
Protein sequence
MKPLKKILIA GGGSAGWMTA ALCAKLFQGL YEIVLIESEE IGTVGVGEAT IPAIKRFNEL 
LGLDEDDFLR RTQGSFKLGI QFKDWSRLGS SYVHGFGVIG QDLGWLRCHQ YWLRMNALGH
GGDFAQLSIN TAAALDNRFM RAKPEMGDSP IAHIAHAFHF DAGLYARYLS GYAQERGVRR
REGKIVDVAL RSDDGFVQSV TMDDGEVIAA DLFVDCSGFR GLIIEQAMKT GYEAWKHWLP
CDRAIAVPCE RSANFTPYTR STAREAGWQW RIPLQHRTGN GHVYSSEHID DDEAERVLLA
NLDGAQRADP LRIRFVTGKR KKIWNRNCVA IGLASGFLEP LESTSLHLIQ SAIIRMVRLL
PDAGFDQAGI DEFNRQSDFE YERIRDFIIL HYKATQRDDT AFWRYCRDME VPATLQRKID
LFSANGRVFR EDDELFTEES WIQVFLGQGI IPRGYDPLVQ VQSDAQIAQY LANIETVIGK
CVKVMPTHAD FVAKTCQAPG