Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2778 |
Symbol | trpC |
ID | 5900233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3015919 |
End bp | 3016707 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563270 |
Product | indole-3-glycerol-phosphate synthase |
Protein accession | YP_001684403 |
Protein GI | 167646740 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0134] Indole-3-glycerol phosphate synthase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.000331448 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000255859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGACA TCCTCGCCAA GATCGCCGCC TACAAGCGCG AGGACGTGGC CGATCGGAAA CGTCGCCGGT CCATTGCTCA GCTCGAGGCC GCCGCCAAGG CCGCCAACAG CCCGCGCGGC TTCAAGGCCG CGCTCGAGGC CGACCACGCG CCAGGCAAGC TGGCTCTGAT CGCCGAAATC AAGAAGGCCT CGCCGTCCAA GGGCCTGATC CGCGCCGACT TCGACCCGCC AGCTCTGGCC CGCGCCTATG CCGCGGGCGG CGCATCGTGC CTGTCGGTGC TGACCGACGG CCCCAGCTTC CAGGGGGCTG ACGGCTATCT GATCGACGTC CGCGCCGCCG TCAGCCTGCC CTGCATCCGC AAGGACTTCC TCGTCGACCC CTGGCAGGTG GCCGAGAGCC GCGCCCTGGG CGCCGACGCC ATCCTGGTGA TCCTGGCGAT GATCGACGAC GCCGTCGCCG CCGACCTGAT GAGCGAGGCC GCCCGCCTGG GCATGGACGC CCTGGTCGAG GTGCACGACG AGGCCGAGAT GGAGCGAGCC GGCAAGCTGG GCTCGACCCT GGTCGGGATC AACAACCGCG ACCTGAAAAG CTTCGTCGTC GACCTGGCCG TCACCGAACG CCTGGCCGTC CAGGCGCCCA GCGACGCCCT GCTGGTTACC GAAAGCGGCC TGTTCGTCGC CGCCGACGTG GCGCGCATGG AAGCGGCTGG CGCCAAGGCC ATGCTGGTCG GCGAGAGCCT GATGCGCCAG GCGGATGTGG CCGCCGCGAC GCGAGCCTTG CTGGGCTAG
|
Protein sequence | MTDILAKIAA YKREDVADRK RRRSIAQLEA AAKAANSPRG FKAALEADHA PGKLALIAEI KKASPSKGLI RADFDPPALA RAYAAGGASC LSVLTDGPSF QGADGYLIDV RAAVSLPCIR KDFLVDPWQV AESRALGADA ILVILAMIDD AVAADLMSEA ARLGMDALVE VHDEAEMERA GKLGSTLVGI NNRDLKSFVV DLAVTERLAV QAPSDALLVT ESGLFVAADV ARMEAAGAKA MLVGESLMRQ ADVAAATRAL LG
|
| |