Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1840 |
Symbol | |
ID | 5899295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1955413 |
End bp | 1956933 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562330 |
Product | tryptophan halogenase |
Protein accession | YP_001683467 |
Protein GI | 167645804 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.105646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00205784 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCCACG CTCCCGTCCG CAAGGTGCTG GTTCTGGGCG GCGGCACGGC CGGCTGGATG ACGGCGGCGG CCCTGGCTAA GGTGCTGCGC GGCCAGGTCG AGGTCACGCT GATCGAGTCC GACCAGATCG CCACGGTCGG GGTCGGCGAG GCCACCATCC CGCCGATCCT GACCTTCAAC GCCATGCTGG GCCTCGACGA GCGCGAGTTC ATGCGCGCCA CCAAGGCCAG CTTCAAGCTG GGCATCGAGT TTGTCGACTG GACCCGCCTG GGCGACCGCT ACATGCATCC GTTCGGAACT TTCGGCCTGG ATATCGAGGC CATCAAGTTC CATCAGGTCT GGCGCAAGCT GCGCGACCAG GTCGGGCCGA TCGAGGACTT CAACCTAGCC GCCGTCGCCG CCAAGCAGAA CCGCTTCGCC ATGCCCGACC GCGATCCGGC CAAGGTGCTG TCGAGCCTGA AATACGCCTT CCACTTCGAC GCCGGCCTCT ATGCGCGGTT CCTGCGCGGC TTCGCCGAGG CCCGGGGCGC GACGCGGATC GAGGGCAAGG TGGCCGACGT CGCCCTGCGC GGCGAGGACG GCTTCATCCA GTCGGTGACC CTGGAGGACG GACGGACCTT CGAGGCCGAC CTGTTCATCG ACTGCACGGG CTTCCGCGCC CTGCTGATCG GCCAGACCCT GGGCGGCGGC TATAAGGACT GGAGCCACTG GTTGCCCAAC GACCGGGCCG TGGCGATCCC TTGCGGGGCC GGCGGCGACG GCCTGACGCC CTATACCCGC GCCACGGCCG ACAAGGCCGG CTGGCGCTGG CGCATTCCGC TGCAGCACCG CACTGGCAAC GGCTATGTCT ATTCCAGCGC CCACATCAGC GACGACGACG CCCTGGCGGC CCTGATCGCC GGCCTCGACG GCCCAGCCCA GGCCGAGCCG AACTTCCTGC GCTTCCAGGC CGGCCGCCGC GACAGGGCCT GGATCAAGAA CTGCGTCGCC ATCGGCCTGT CGTCCGGCTT CCTCGAGCCG CTGGAGAGCA CCAGCATCCA CCTGATCCAG GCGGGGATCA CCAAGCTCCT GGCCCTGTTT CCGGACAAGG GTTTCGATTC CCTGGAGATC GACGAATACA ATCGCCTGAC CGCCCTGCAG GTCGAGTTGG TGCGCGACTT CATCATCCTG CACTTCAAGG CCACGGAACG CTCGGACACG CCCTATTGGG ACTATGTCCG GACCATGGAC ATTCCCGAGA GCCTGCGACG CAAGATCGAG CTGTTCGCCG GTCGTGGGCG CTTGTTCCAG TCCGACTACG ACCTGTTCGC CGAGCCCAGC TGGATCGCGG TGCTGATGGG CCAGGGAATC ACGCCGCGCC AATACGACCC CCTGGTCGAC GCCCTGCCCG AGCCGGCCCT CGTCCAGCGC CTGCAACGCA TGTCCGACCT GATCGGCCAG ACCGCCCAGG CCATGCCCAG CCATCAGGCC TTCATCGCCC GCTATTGCGC CGCCGACGCG GTCGCCAACA TTCCAGCATG A
|
Protein sequence | MTHAPVRKVL VLGGGTAGWM TAAALAKVLR GQVEVTLIES DQIATVGVGE ATIPPILTFN AMLGLDEREF MRATKASFKL GIEFVDWTRL GDRYMHPFGT FGLDIEAIKF HQVWRKLRDQ VGPIEDFNLA AVAAKQNRFA MPDRDPAKVL SSLKYAFHFD AGLYARFLRG FAEARGATRI EGKVADVALR GEDGFIQSVT LEDGRTFEAD LFIDCTGFRA LLIGQTLGGG YKDWSHWLPN DRAVAIPCGA GGDGLTPYTR ATADKAGWRW RIPLQHRTGN GYVYSSAHIS DDDALAALIA GLDGPAQAEP NFLRFQAGRR DRAWIKNCVA IGLSSGFLEP LESTSIHLIQ AGITKLLALF PDKGFDSLEI DEYNRLTALQ VELVRDFIIL HFKATERSDT PYWDYVRTMD IPESLRRKIE LFAGRGRLFQ SDYDLFAEPS WIAVLMGQGI TPRQYDPLVD ALPEPALVQR LQRMSDLIGQ TAQAMPSHQA FIARYCAADA VANIPA
|
| |