Gene Caul_3278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3278 
Symbol 
ID5900733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3543259 
End bp3544815 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content65% 
IMG OID641563784 
Producttryptophan halogenase 
Protein accessionYP_001684903 
Protein GI167647240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.943732 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTA TCGAAAAGGT CGTCATCCTG GGCGGCGGAA CCGCCGGATG GATGACCGCG 
GCGGCGCTCT CGCGCCGCCT TGGCCGATCC CTTCGCATCG ACCTGGTGGA GTCGGATGCG
ATCGGCACGG TGGGCGTGGG CGAAGCGACG ATACCGACGA TCCACTGGTT CAACGACTTG
ATCGGTCTGG ACGAGGCGGC GTTCGTGCGC GAGACCCAGG CCAGTTTCAA ACTCGGCATC
GAGTTCGTCG ATTGGCGGCG TCCCGGGCAT CGCTACTTCC ATCCGTTCGG GCGCCACGGC
GTGGAACTGG ACCAGATCCC CTTCCATCAG CACTGGCTGA AGGCGCGTGC GGACGGCGGT
CAACATCCTC TTGCGGCTTT CTCGCTCGCC ACGACTCTGG CCGAGGCCAA CCGCTTCGCC
AAGCCCGTCG CCGATCCTCG GTCGATCCTC TCCACCCTGG GATACGCCTA TCACTTCGAC
GCCACGCTCT ACGCCGCGCA TCTGCGTCGG CTGGCCGAGG CTGGCGGGGT GGTCCGTCAT
GAAGGCAAGG TCGCGACGGT GGAGCGTGAT CCGCAAAGCG GCTTTGTAAC CGCGCTGGTG
ACCGACACGG GCATAAGGGT CGAGGGCGAG CTGTTCATCG ACTGCTCGGG TTTCAGGGCG
ATGCTGATCG GCGAGACGAT GGGCGCCGAG TTCCAGGACT GGTCACATTG GTTGCCCTGC
GACCGCGCCG TGGCCGCGCC CTGCGCCCGT GTCGCCGAGA CCACGCCCTA CACCCGGTCG
ACCCTGCGTC CGGCAGGCTG GCAATGGCGC ATCCCCCTGC AGCATCGGAC CGGCAACGGT
TATGTCTATG CCAGCGCCCT GGTGTCCGAT GACGAGGCGG CCGCGACGCT GTTGCGAAAC
CTTGACGGCG ATCTGTTGGC CGACCCTCGC TTCCTGCGCT TCCAGGCCGG ATTCCGGCGC
GAAAGCTGGC GGGGCAATGT TGTCGCCATT GGCCTGTCGT CGGGCTTCCT CGAACCCCTG
GAGTCGACCA GCATCCATCT GATCCAGAGC GGCGTTGCGA AACTGATCAC CCTGTTTCCG
GACCGCGACT GCGATCCTCG CCTGGCGCAT CAGTTCAACA GCCTGTTCGC CCGCGACATG
GATGGCATAC GCGATTTTCT GATCCTGCAT TATCACGCGA CCGAAGGTCA CAACGCGCCG
CTCTGGCGGC AAGCCCGCGC CATGGCCCTG CCCGACAGCT TGACCGACAA ACTGGCGCAC
TACCGCCGCT CCGGTCGCTT GATGCTGACG CCCGACGAGT TGTTTCGCGA AGCAAGCTGG
CTAGCCGTGC TTGAAGGGCA GGGGGTGTCC GCACAGGGGT TCGCGCCCTT GGCCGATACG
CTCGACTCCG CGCAGAACCT GCGCCAATTG AACGACATCG CGTCGCTCAT CGCTCGGGTG
GCGCCGACCC TTCCTCACCA TGACGCCGCG ATTAGCGAAC TGATCCGATC GGCTGGCGCG
CCGCTGACTT CGGAGACGGC TGCGCCAAAC TCAACAGATC GGACATCCGA GCGATAA
 
Protein sequence
MNRIEKVVIL GGGTAGWMTA AALSRRLGRS LRIDLVESDA IGTVGVGEAT IPTIHWFNDL 
IGLDEAAFVR ETQASFKLGI EFVDWRRPGH RYFHPFGRHG VELDQIPFHQ HWLKARADGG
QHPLAAFSLA TTLAEANRFA KPVADPRSIL STLGYAYHFD ATLYAAHLRR LAEAGGVVRH
EGKVATVERD PQSGFVTALV TDTGIRVEGE LFIDCSGFRA MLIGETMGAE FQDWSHWLPC
DRAVAAPCAR VAETTPYTRS TLRPAGWQWR IPLQHRTGNG YVYASALVSD DEAAATLLRN
LDGDLLADPR FLRFQAGFRR ESWRGNVVAI GLSSGFLEPL ESTSIHLIQS GVAKLITLFP
DRDCDPRLAH QFNSLFARDM DGIRDFLILH YHATEGHNAP LWRQARAMAL PDSLTDKLAH
YRRSGRLMLT PDELFREASW LAVLEGQGVS AQGFAPLADT LDSAQNLRQL NDIASLIARV
APTLPHHDAA ISELIRSAGA PLTSETAAPN STDRTSER