Gene Caul_3561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3561 
Symbol 
ID5901016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3844978 
End bp3846546 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content69% 
IMG OID641564069 
Producttryptophan halogenase 
Protein accessionYP_001685186 
Protein GI167647523 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCA GTCGAAGAAT CCTCATCGTC GGGGGCGGCA CGGCCGGGTG GCTGACGGCC 
GCCTATCTGG CCAAGTCCCT GCGGATCGCC GAGCAGGCGC ACCTGGAGAT CACGCTGCTG
GAGTCGCCCG ACATCGGCGT CATCGGCGTC GGCGAGGCGA CGTTCCCGAC CATCCGCACG
ACGCTGCGGT TTCTCGGCGT CGACGAGGCG AGGTTCATCC GCGAGACCTC GGCCACCTTC
AAGCAGGGCA TCCGCTTCAA CGACTGGGCG TGGGCGCAGG GCGAGGGCGG CGATGGCCCC
CAGCGTCATC AGTACTTCCA TCCGTTCGAG GCGCCGTTCT CGACCGACGG CGCCAGCCTG
GCGCCCTACT GGCTGCTGCA GAGCGAGGCG ACGCGCGCGC CGTTCGCCGA GGCCATGACC
ATCCAGGCCC GGGTCGCCGA CGCCCAGCGC GCGCCCAAGC GTCCGCACGA GGGCGACTTC
TCCGGGCCCC TGAACTACGC CTATCATTTC GACGCGGCCA AGCTGGCCGT GGTGCTGGCC
GAGCGCGCCG TCGAGCTTGG CGTGCGCCGT CTGCCGGGCC TGCTGACGGG CGTGGAGCTC
GACGCGACCG GCGCCATCGA CCACGTGATC TCGCAGGAGC ATGGCCGCCT GGAGGCCGAT
CTCTACATCG ACTGCACGGG ATTCCGGGCC GAGCTGATCG GCCAGGCCCT GAAGGCCCCG
TTCAAGTCGG CGCGGCCCAT CCTGTTCGCC GACCGGGCCC TGGCCTGCAA GATCCCCTAC
GACCGCCCCG ACGCGCCGAT CCAGAGCTTC ACCGTCGCCA CCGCCCACGA GGCCGGCTGG
ACCTGGGACA TCGGCCTGAA TGGCGCGCGC GGCGTCGGCT GCGTCTATGC CAGCGACCAC
ATGGATGACG ACCGGGCCGA GGCCATCCTG CGCGGCTATG TCGGGGAAGG CGTCGAGATC
GCGCCCCGGT CGCTGTCGTT CGAGGCGGGC TATCGCCAGA AGCAGTGGGT CAAGAACTGC
GTGGCCGTCG GCCTGTCAGC CGGGTTCCTG GAGCCGCTGG AATCGACGGG CGTGGTGCTG
ATCGAGGCGG CGGTGGCGAT CATCGCCGAG CTGTTCCCGC ACAACGGTCC GATCAGCGCC
CCGGCCTTGC GCTTCAACGA GCTGATGACC GCCCGCTACG ACAACATCAT CACCTTCCTG
AAGCTGCACT ACTGCCTGAG CCAGCGCACC GAGCCGTTCT GGCGCGCGAA CGCCGACCCG
GCCTCGATTC CGGAACGGCT GGCCGACCTG CTGGAGCAGT GGCGCTGGCG CCCGCCCACC
CGCTACGACT TCATCCTGGA TCTCGAGACC TTCGCCTTCT TCAACTACCA GTACATCCTG
TACGGCATGG GCTTCAAAAC CGACCTGTCG CCAGGGCGCG GCGAGTTTCC CGACGTGGCG
GCGGCCGACA AGCTGTTCGC CAAGATCAAG ACCTTCGGCG ACCGCGCCAC CCAGGACCTG
CCCAGCCACC GCGACCTGAT CTCGCGGATC AACCGGTTTG GCTTTGATCG GGCGGCGGAG
CACGCTTGA
 
Protein sequence
MDRSRRILIV GGGTAGWLTA AYLAKSLRIA EQAHLEITLL ESPDIGVIGV GEATFPTIRT 
TLRFLGVDEA RFIRETSATF KQGIRFNDWA WAQGEGGDGP QRHQYFHPFE APFSTDGASL
APYWLLQSEA TRAPFAEAMT IQARVADAQR APKRPHEGDF SGPLNYAYHF DAAKLAVVLA
ERAVELGVRR LPGLLTGVEL DATGAIDHVI SQEHGRLEAD LYIDCTGFRA ELIGQALKAP
FKSARPILFA DRALACKIPY DRPDAPIQSF TVATAHEAGW TWDIGLNGAR GVGCVYASDH
MDDDRAEAIL RGYVGEGVEI APRSLSFEAG YRQKQWVKNC VAVGLSAGFL EPLESTGVVL
IEAAVAIIAE LFPHNGPISA PALRFNELMT ARYDNIITFL KLHYCLSQRT EPFWRANADP
ASIPERLADL LEQWRWRPPT RYDFILDLET FAFFNYQYIL YGMGFKTDLS PGRGEFPDVA
AADKLFAKIK TFGDRATQDL PSHRDLISRI NRFGFDRAAE HA