Gene Caul_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1840 
Symbol 
ID5899295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1955413 
End bp1956933 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID641562330 
Producttryptophan halogenase 
Protein accessionYP_001683467 
Protein GI167645804 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.105646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00205784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCACG CTCCCGTCCG CAAGGTGCTG GTTCTGGGCG GCGGCACGGC CGGCTGGATG 
ACGGCGGCGG CCCTGGCTAA GGTGCTGCGC GGCCAGGTCG AGGTCACGCT GATCGAGTCC
GACCAGATCG CCACGGTCGG GGTCGGCGAG GCCACCATCC CGCCGATCCT GACCTTCAAC
GCCATGCTGG GCCTCGACGA GCGCGAGTTC ATGCGCGCCA CCAAGGCCAG CTTCAAGCTG
GGCATCGAGT TTGTCGACTG GACCCGCCTG GGCGACCGCT ACATGCATCC GTTCGGAACT
TTCGGCCTGG ATATCGAGGC CATCAAGTTC CATCAGGTCT GGCGCAAGCT GCGCGACCAG
GTCGGGCCGA TCGAGGACTT CAACCTAGCC GCCGTCGCCG CCAAGCAGAA CCGCTTCGCC
ATGCCCGACC GCGATCCGGC CAAGGTGCTG TCGAGCCTGA AATACGCCTT CCACTTCGAC
GCCGGCCTCT ATGCGCGGTT CCTGCGCGGC TTCGCCGAGG CCCGGGGCGC GACGCGGATC
GAGGGCAAGG TGGCCGACGT CGCCCTGCGC GGCGAGGACG GCTTCATCCA GTCGGTGACC
CTGGAGGACG GACGGACCTT CGAGGCCGAC CTGTTCATCG ACTGCACGGG CTTCCGCGCC
CTGCTGATCG GCCAGACCCT GGGCGGCGGC TATAAGGACT GGAGCCACTG GTTGCCCAAC
GACCGGGCCG TGGCGATCCC TTGCGGGGCC GGCGGCGACG GCCTGACGCC CTATACCCGC
GCCACGGCCG ACAAGGCCGG CTGGCGCTGG CGCATTCCGC TGCAGCACCG CACTGGCAAC
GGCTATGTCT ATTCCAGCGC CCACATCAGC GACGACGACG CCCTGGCGGC CCTGATCGCC
GGCCTCGACG GCCCAGCCCA GGCCGAGCCG AACTTCCTGC GCTTCCAGGC CGGCCGCCGC
GACAGGGCCT GGATCAAGAA CTGCGTCGCC ATCGGCCTGT CGTCCGGCTT CCTCGAGCCG
CTGGAGAGCA CCAGCATCCA CCTGATCCAG GCGGGGATCA CCAAGCTCCT GGCCCTGTTT
CCGGACAAGG GTTTCGATTC CCTGGAGATC GACGAATACA ATCGCCTGAC CGCCCTGCAG
GTCGAGTTGG TGCGCGACTT CATCATCCTG CACTTCAAGG CCACGGAACG CTCGGACACG
CCCTATTGGG ACTATGTCCG GACCATGGAC ATTCCCGAGA GCCTGCGACG CAAGATCGAG
CTGTTCGCCG GTCGTGGGCG CTTGTTCCAG TCCGACTACG ACCTGTTCGC CGAGCCCAGC
TGGATCGCGG TGCTGATGGG CCAGGGAATC ACGCCGCGCC AATACGACCC CCTGGTCGAC
GCCCTGCCCG AGCCGGCCCT CGTCCAGCGC CTGCAACGCA TGTCCGACCT GATCGGCCAG
ACCGCCCAGG CCATGCCCAG CCATCAGGCC TTCATCGCCC GCTATTGCGC CGCCGACGCG
GTCGCCAACA TTCCAGCATG A
 
Protein sequence
MTHAPVRKVL VLGGGTAGWM TAAALAKVLR GQVEVTLIES DQIATVGVGE ATIPPILTFN 
AMLGLDEREF MRATKASFKL GIEFVDWTRL GDRYMHPFGT FGLDIEAIKF HQVWRKLRDQ
VGPIEDFNLA AVAAKQNRFA MPDRDPAKVL SSLKYAFHFD AGLYARFLRG FAEARGATRI
EGKVADVALR GEDGFIQSVT LEDGRTFEAD LFIDCTGFRA LLIGQTLGGG YKDWSHWLPN
DRAVAIPCGA GGDGLTPYTR ATADKAGWRW RIPLQHRTGN GYVYSSAHIS DDDALAALIA
GLDGPAQAEP NFLRFQAGRR DRAWIKNCVA IGLSSGFLEP LESTSIHLIQ AGITKLLALF
PDKGFDSLEI DEYNRLTALQ VELVRDFIIL HFKATERSDT PYWDYVRTMD IPESLRRKIE
LFAGRGRLFQ SDYDLFAEPS WIAVLMGQGI TPRQYDPLVD ALPEPALVQR LQRMSDLIGQ
TAQAMPSHQA FIARYCAADA VANIPA