Gene Caul_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2138 
Symbol 
ID5899593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2311507 
End bp2313012 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content67% 
IMG OID641562627 
Producttryptophan halogenase 
Protein accessionYP_001683764 
Protein GI167646101 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.903949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0562186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCGAA CCAACCCCAT TCGTTCGATC CTGATCGTCG GCGGCGGCAC GGCCGGTTGG 
ATGGCCGCGA CCTGGCTGGC CGGACGGTTG GCGCGTCAAG ACATCCAGAT CACCGTCGTG
GAGTCGCCCG ACATTCGCAC CATCGGGGTG GGCGAGGCGA CCGTGCCGGC CATTCGCGGC
TATTTCCGAG ACATCGGCGT CAGCGAAGCC GAAGTCATGG CCGCCACCCA GGGCACGGTG
AAGCTGGGCA TCGAATTTCG CGACTGGAAG CGGGACGGGG AGAGCTTCTT CCACCCGTTT
GGTCTCTACG GCATGGCCTC GCGCGGCGTG CCCTTCCACC AGTTCTGGCT CAAGCGCCAA
GCCGAGGGCG ACACCGCGCC GTTGGCCGCC TACAGCCTGT GCACCCAGTT GGCCATGGCC
AACCAGATGA TGGAGCCGCC GGCTTCACCG CCCAACGATC TGGGCGTGTT CAACTGGGCG
GTCCATTTCG ACGCCGGCCT CTATGCGCAG TTCCTGAGGC GCAAGGCCAC GTCCGAGCTG
GGTGTCACCC ATGTCGACGG CACGGTCGTC GAAGTGGCCA AGAACGGCGA GAACGGCTTC
CTGACCGGCG TGGCCCTGGC GGACGGCCGC GTTCTCGAGG CCGACCTGTT CATTGATTGC
TCGGGTTTCC GCAGCCTGCT GCTCGGCCAG GCGCTGGGCG TGGACTATGA GGATTGGACC
CATCTGCTGC CCTGCGACCG GGCGGTGGCC CTGCCCTGCG AGCGTGACGG TCCGCTGACG
CCCTACACCC GCAGCACCGC GCTGGCGGCC GGCTGGCAGT GGCGCATCCC GCTGCAACAT
CGGGTCGGCA ACGGCTATGT CTATTCCAGT CGCCATATTT CGGACGACGA GGCCACGGCC
GTTCTAATGT CGCGCTTGGA GGGGCCGGCT CTGGCCGAGC CCAACCTGCT GCGTTTCCAG
ACCGGGCGCC GGCGCCGCTT CTGGGAGAAG AACTGCATCG CCCTGGGCTT GGCCGCCGGC
TTCATGGAGC CGCTGGAATC CACCAGCATC GTGCTGATCC AAAGCGGGTT GGAGCGCCTG
GGCGCGCTGT TTCCCGATCG CGGTTTCGAC CCGGCCCTGG CCGACGAGTA CAACCGCATC
ACCACGCTCG AGTACGAGCG CATCCGCGAC TTCCTGCTGC TGCATTACGT CGCCAACCGT
CGCGAGGGCG AAGCGATGTG GGACCATGTG CGTCAGCTGG CCTTGCCCGA ACCCCTGGTC
CACAAGATGC GAATGTTCGC CAGCCGTGGA ACCATGGTCC GCTATGAGTG GGAGTCGTTC
CACGACCCCA GCTGGCTGTC GATGTACGCC GGCTTCGACA TCGCGCCGCG CGCCCACGAT
CCGATGGCGG ACTATTTCAC CAAGCCCGAG CTCGACAGCG CCTTGCGCCG GATGCGCGAA
GCGATCGCTC GCGCTCAGGC CTTGGCCGTT CCTCACGAGG CCTTCCTGGC GGCGCAACGC
CCTTGA
 
Protein sequence
MARTNPIRSI LIVGGGTAGW MAATWLAGRL ARQDIQITVV ESPDIRTIGV GEATVPAIRG 
YFRDIGVSEA EVMAATQGTV KLGIEFRDWK RDGESFFHPF GLYGMASRGV PFHQFWLKRQ
AEGDTAPLAA YSLCTQLAMA NQMMEPPASP PNDLGVFNWA VHFDAGLYAQ FLRRKATSEL
GVTHVDGTVV EVAKNGENGF LTGVALADGR VLEADLFIDC SGFRSLLLGQ ALGVDYEDWT
HLLPCDRAVA LPCERDGPLT PYTRSTALAA GWQWRIPLQH RVGNGYVYSS RHISDDEATA
VLMSRLEGPA LAEPNLLRFQ TGRRRRFWEK NCIALGLAAG FMEPLESTSI VLIQSGLERL
GALFPDRGFD PALADEYNRI TTLEYERIRD FLLLHYVANR REGEAMWDHV RQLALPEPLV
HKMRMFASRG TMVRYEWESF HDPSWLSMYA GFDIAPRAHD PMADYFTKPE LDSALRRMRE
AIARAQALAV PHEAFLAAQR P