Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3278 |
Symbol | |
ID | 5900733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3543259 |
End bp | 3544815 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563784 |
Product | tryptophan halogenase |
Protein accession | YP_001684903 |
Protein GI | 167647240 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.943732 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTA TCGAAAAGGT CGTCATCCTG GGCGGCGGAA CCGCCGGATG GATGACCGCG GCGGCGCTCT CGCGCCGCCT TGGCCGATCC CTTCGCATCG ACCTGGTGGA GTCGGATGCG ATCGGCACGG TGGGCGTGGG CGAAGCGACG ATACCGACGA TCCACTGGTT CAACGACTTG ATCGGTCTGG ACGAGGCGGC GTTCGTGCGC GAGACCCAGG CCAGTTTCAA ACTCGGCATC GAGTTCGTCG ATTGGCGGCG TCCCGGGCAT CGCTACTTCC ATCCGTTCGG GCGCCACGGC GTGGAACTGG ACCAGATCCC CTTCCATCAG CACTGGCTGA AGGCGCGTGC GGACGGCGGT CAACATCCTC TTGCGGCTTT CTCGCTCGCC ACGACTCTGG CCGAGGCCAA CCGCTTCGCC AAGCCCGTCG CCGATCCTCG GTCGATCCTC TCCACCCTGG GATACGCCTA TCACTTCGAC GCCACGCTCT ACGCCGCGCA TCTGCGTCGG CTGGCCGAGG CTGGCGGGGT GGTCCGTCAT GAAGGCAAGG TCGCGACGGT GGAGCGTGAT CCGCAAAGCG GCTTTGTAAC CGCGCTGGTG ACCGACACGG GCATAAGGGT CGAGGGCGAG CTGTTCATCG ACTGCTCGGG TTTCAGGGCG ATGCTGATCG GCGAGACGAT GGGCGCCGAG TTCCAGGACT GGTCACATTG GTTGCCCTGC GACCGCGCCG TGGCCGCGCC CTGCGCCCGT GTCGCCGAGA CCACGCCCTA CACCCGGTCG ACCCTGCGTC CGGCAGGCTG GCAATGGCGC ATCCCCCTGC AGCATCGGAC CGGCAACGGT TATGTCTATG CCAGCGCCCT GGTGTCCGAT GACGAGGCGG CCGCGACGCT GTTGCGAAAC CTTGACGGCG ATCTGTTGGC CGACCCTCGC TTCCTGCGCT TCCAGGCCGG ATTCCGGCGC GAAAGCTGGC GGGGCAATGT TGTCGCCATT GGCCTGTCGT CGGGCTTCCT CGAACCCCTG GAGTCGACCA GCATCCATCT GATCCAGAGC GGCGTTGCGA AACTGATCAC CCTGTTTCCG GACCGCGACT GCGATCCTCG CCTGGCGCAT CAGTTCAACA GCCTGTTCGC CCGCGACATG GATGGCATAC GCGATTTTCT GATCCTGCAT TATCACGCGA CCGAAGGTCA CAACGCGCCG CTCTGGCGGC AAGCCCGCGC CATGGCCCTG CCCGACAGCT TGACCGACAA ACTGGCGCAC TACCGCCGCT CCGGTCGCTT GATGCTGACG CCCGACGAGT TGTTTCGCGA AGCAAGCTGG CTAGCCGTGC TTGAAGGGCA GGGGGTGTCC GCACAGGGGT TCGCGCCCTT GGCCGATACG CTCGACTCCG CGCAGAACCT GCGCCAATTG AACGACATCG CGTCGCTCAT CGCTCGGGTG GCGCCGACCC TTCCTCACCA TGACGCCGCG ATTAGCGAAC TGATCCGATC GGCTGGCGCG CCGCTGACTT CGGAGACGGC TGCGCCAAAC TCAACAGATC GGACATCCGA GCGATAA
|
Protein sequence | MNRIEKVVIL GGGTAGWMTA AALSRRLGRS LRIDLVESDA IGTVGVGEAT IPTIHWFNDL IGLDEAAFVR ETQASFKLGI EFVDWRRPGH RYFHPFGRHG VELDQIPFHQ HWLKARADGG QHPLAAFSLA TTLAEANRFA KPVADPRSIL STLGYAYHFD ATLYAAHLRR LAEAGGVVRH EGKVATVERD PQSGFVTALV TDTGIRVEGE LFIDCSGFRA MLIGETMGAE FQDWSHWLPC DRAVAAPCAR VAETTPYTRS TLRPAGWQWR IPLQHRTGNG YVYASALVSD DEAAATLLRN LDGDLLADPR FLRFQAGFRR ESWRGNVVAI GLSSGFLEPL ESTSIHLIQS GVAKLITLFP DRDCDPRLAH QFNSLFARDM DGIRDFLILH YHATEGHNAP LWRQARAMAL PDSLTDKLAH YRRSGRLMLT PDELFREASW LAVLEGQGVS AQGFAPLADT LDSAQNLRQL NDIASLIARV APTLPHHDAA ISELIRSAGA PLTSETAAPN STDRTSER
|
| |