Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2064 |
Symbol | |
ID | 5899519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2204006 |
End bp | 2205508 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562553 |
Product | tryptophan halogenase |
Protein accession | YP_001683690 |
Protein GI | 167646027 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0180513 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGCCGC TGAAGAAAAT CCTCATCGCC GGCGGCGGCT CGGCCGGCTG GATGACGGCG GCCCTGTGCG CCAAGCTGTT CCAGGGCCTC TACGAAATCG TCCTGATCGA GTCCGAGGAG ATCGGCACGG TCGGGGTGGG GGAGGCGACC ATCCCCGCGA TCAAGAGGTT CAACGAACTT CTGGGCCTGG ACGAGGACGA CTTCCTGCGC CGCACCCAGG GCAGCTTCAA GCTGGGCATC CAGTTCAAGG ACTGGTCGCG GCTCGGCTCC AGCTACGTCC ACGGTTTCGG GGTGATCGGC CAGGACCTGG GATGGCTGCG CTGCCATCAG TACTGGCTGC GCATGAACGC CTTGGGCCAC GGCGGGGATT TCGCCCAGCT GTCGATCAAC ACGGCCGCGG CGCTCGACAA CAGGTTCATG CGCGCCAAGC CGGAGATGGG CGACTCGCCC ATCGCCCACA TCGCCCACGC CTTCCATTTC GACGCCGGCC TCTATGCCCG CTACCTCAGC GGCTACGCCC AGGAGCGCGG GGTGCGCCGG CGCGAGGGCA AGATCGTCGA TGTCGCCCTG CGAAGCGACG ACGGGTTCGT GCAGTCGGTG ACCATGGACG ACGGCGAGGT GATCGCCGCC GATCTGTTTG TCGACTGCTC GGGCTTCCGC GGCCTGATCA TCGAGCAGGC CATGAAGACC GGCTACGAGG CGTGGAAGCA CTGGCTGCCG TGCGACCGCG CCATCGCCGT CCCGTGCGAG CGCTCGGCGA ACTTCACGCC CTACACCCGC TCGACGGCCC GCGAAGCCGG CTGGCAGTGG CGCATCCCCC TGCAGCACCG CACCGGCAAC GGCCACGTCT ATTCCAGCGA GCACATCGAC GACGACGAGG CCGAACGGGT GCTGCTCGCC AACCTCGACG GCGCCCAGCG GGCCGATCCG TTGCGCATCC GCTTCGTCAC CGGCAAGCGC AAGAAGATCT GGAACCGCAA TTGCGTAGCC ATCGGCCTGG CCAGCGGCTT CCTGGAGCCG CTGGAATCCA CCAGCCTGCA CCTGATCCAG TCGGCGATCA TCCGCATGGT GCGCCTGCTG CCGGACGCCG GCTTCGATCA GGCGGGGATC GACGAGTTCA ATCGCCAGAG CGACTTCGAA TACGAGCGCA TCCGCGACTT CATCATCCTC CACTACAAGG CCACCCAGCG CGACGATACC GCCTTCTGGC GCTATTGCCG CGACATGGAG GTCCCCGCGA CCCTGCAGCG GAAGATCGAC CTGTTCTCGG CCAACGGCCG GGTCTTCCGG GAAGACGACG AACTGTTCAC CGAGGAGAGC TGGATCCAGG TGTTCCTCGG GCAGGGGATC ATCCCGCGAG GCTACGATCC GCTGGTTCAG GTCCAGAGCG ACGCCCAGAT CGCCCAGTAT CTCGCCAATA TCGAGACGGT CATCGGCAAG TGCGTGAAGG TGATGCCGAC CCACGCCGAT TTCGTCGCCA AGACCTGCCA GGCACCGGGA TGA
|
Protein sequence | MKPLKKILIA GGGSAGWMTA ALCAKLFQGL YEIVLIESEE IGTVGVGEAT IPAIKRFNEL LGLDEDDFLR RTQGSFKLGI QFKDWSRLGS SYVHGFGVIG QDLGWLRCHQ YWLRMNALGH GGDFAQLSIN TAAALDNRFM RAKPEMGDSP IAHIAHAFHF DAGLYARYLS GYAQERGVRR REGKIVDVAL RSDDGFVQSV TMDDGEVIAA DLFVDCSGFR GLIIEQAMKT GYEAWKHWLP CDRAIAVPCE RSANFTPYTR STAREAGWQW RIPLQHRTGN GHVYSSEHID DDEAERVLLA NLDGAQRADP LRIRFVTGKR KKIWNRNCVA IGLASGFLEP LESTSLHLIQ SAIIRMVRLL PDAGFDQAGI DEFNRQSDFE YERIRDFIIL HYKATQRDDT AFWRYCRDME VPATLQRKID LFSANGRVFR EDDELFTEES WIQVFLGQGI IPRGYDPLVQ VQSDAQIAQY LANIETVIGK CVKVMPTHAD FVAKTCQAPG
|
| |