Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5244 |
Symbol | |
ID | 5897362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 175668 |
End bp | 176603 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641555347 |
Product | tryptophan 23-dioxygenase |
Protein accession | YP_001676678 |
Protein GI | 167621893 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | [TIGR03036] tryptophan 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.637424 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCGTA AACGCATCCT GCCCCAGATC CGCTCGCCGG AGCTTGAGCT CTCCCTCGTG CGAGACATCG ACGCGTCCAC CGACGCCACG CGTCGCGCCA ATGTCGTTCA GACCGGCGGC GAACCCATCG TGGCCTTCGC CGAGACAAGC AATCCTTACA TCGACTTCCA CCGCAACGAC GTTCTTCACA GCCTGCAGCA CATGCGCACC GAGGCCTATG ACGAGTTTCC CTTCATCGTC ATGACCCAGG TGAAGGAGCT GATCTTCCGT TCGATCCACT ACGAATTGGC GAACATTCAG CAGCGCATCC GCGACGACGA CTTGACGGGT GCGCTTGTCC TGGCCCCGCG CCTTCGGCGT TGCCTTGAGC TCCTGGTCAA GACCTGGGAC GTGTTGTCGA CGATCACGAC GCAAGGGTTC AACGCGTTCC GCGACCAGCT GGGAACATCG TCCGGTCAGC AGTCGTACGC CTATCGGCAC GTTGAGTTCA TTCTGGGCAA CAAGTCGCGC CGCCTCGCCG CGGCCCACGC CAATAATCCC GATGTCTATC CAGCCATCGC CGAGGCGCTC AACAGCCCCA GCCTCTACGA CGACGCCATC GCCTTGCTGC ACCGTCGCGG CTTCGTTATC CCGCAGGATC GGCTGGAGCG CGATTGGGCC GAGGACTACG CGCCCAGCCC CGCCGTCGCC GCCGCCTGGT TGGCCGTCTA TGACACGCCA ACCCCGGAGA ACGATCTCTA CCAGCTCGGC GAAATGCTGA TCGAGATCGA CGATCTCTTC TCGCAATTTC GCTGGCGGCA CTTTGTTTCG GTGCAACGCA TTCTTGGCCT CAAGCCGGGC ACAGGCGGCT CAGCGGGAGT GGGGTGGCTG CGCGCGGCCG TCGACCTGAG GTTCTTCCCC GAACTGTGGT CCATCCGCAC CGAAATGGGG GCCTAG
|
Protein sequence | MTRKRILPQI RSPELELSLV RDIDASTDAT RRANVVQTGG EPIVAFAETS NPYIDFHRND VLHSLQHMRT EAYDEFPFIV MTQVKELIFR SIHYELANIQ QRIRDDDLTG ALVLAPRLRR CLELLVKTWD VLSTITTQGF NAFRDQLGTS SGQQSYAYRH VEFILGNKSR RLAAAHANNP DVYPAIAEAL NSPSLYDDAI ALLHRRGFVI PQDRLERDWA EDYAPSPAVA AAWLAVYDTP TPENDLYQLG EMLIEIDDLF SQFRWRHFVS VQRILGLKPG TGGSAGVGWL RAAVDLRFFP ELWSIRTEMG A
|
| |