Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3961 |
Symbol | |
ID | 5901423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4290654 |
End bp | 4291919 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564482 |
Product | chorismate synthase |
Protein accession | YP_001685584 |
Protein GI | 167647921 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.168181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.429407 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACTT TTTCCGCAAC AGACCCCTAT GCGATAGCCC TGCCCCCGGT GGGGGAAGAC GATCGTGAAG CGATCCGCAG GGGGCGGGTG TTCACCACGC CGACCTTTGC CCACCGCCCG CCTTTCCGCT ACACCGCCCC CATGTCGCAC AACACCTTCG GCCACCTGTT CCGCGTCACC ACCTGGGGCG AAAGCCACGG CCCGGCCCTG GGCTGCGTGA TCGACGGCGT CCCGCCGGGC GTCGCCGTCA CCGCCGAACA GATCCAGGCC TTCCTCGACA AGCGCCGCCC CGGCAATGGC AAATTCGTCA CCCAGCGCCA GGAGCCCGAC GCCGTGCGCA TCCTGTCGGG GGTGTTCGAG GACGCGCGCA GCGACGGCCA GCGGACCACC GGCACGCCGA TCAGCCTGAT GATCGACAAC ACCGACCAGC GCTCCAAGGA CTATGGCGAG ATCGCCCAGG CCTTCCGGCC AGGCCACGCC GACTATCCCT ATTTCGCCAA GTACGGCGTG CGCGACTATC GCGGCGGCGG GCGCAGCTCG GCGCGCGAGA CCGCCGCGCG GGTGGCGGCC GGGGCGGTGG CGCGCCTGGT GATCCCGGGC GTGACGGTGC GCGCGGCCCT GGTGCAGATC GGCCCGCACA GGATCGATCG CGGCAACTGG GACTGGGACC AGACGAACGA GAACCCCTAC TGGTCGCCCG ACGCGGCGAT CATCCCGGTC TGGGAAGAAC ATCTGGAAAA GGTCCGCAAG GCGGGCTCCT CGACCGGCGC CGTCGTCGAG GTCGAGGCCA CGGGCGTGCC GGCCGGCTGG GGCGCGCCGC TCTACGGCAA GCTCGACGCC GAGCTGGCGG CGGCCCTGAT GTCGATCAAC GCCGCCAAGG GCGTGGAGAT CGGCGAGGGC TTCGCCAGCG CCGCCCTGTC GGGGGAAGAG AACGCCGACC AGATGCGGAT GGGCGATGAC GGGCCGATGT TCCTGAGCAA CCATGCCGGC GGCGTGCTGG GCGGGCTGTC GACCGGCCAG CCGGTGGTGG CCCGGGTGGC CTTCAAGCCG ACTTCGTCGA TCCTGACCCC GCGCCAGAGC CTCAACGAGG CCGGCGAGGA GATCGACCTG CGCACCAAGG GCCGTCACGA CCCCTGCGTG GGCATCCGCG GCGTGCCGGT GGTCGAGGCC ATGACCGCCT GCGTGCTGGC CGACGCCTTC CTGCGCCACC GCGCCCAGAC CGGCGGCGGG GCTTTCGTTC CGGGGATGAG CGGGCGGCAG GGCTAA
|
Protein sequence | MTTFSATDPY AIALPPVGED DREAIRRGRV FTTPTFAHRP PFRYTAPMSH NTFGHLFRVT TWGESHGPAL GCVIDGVPPG VAVTAEQIQA FLDKRRPGNG KFVTQRQEPD AVRILSGVFE DARSDGQRTT GTPISLMIDN TDQRSKDYGE IAQAFRPGHA DYPYFAKYGV RDYRGGGRSS ARETAARVAA GAVARLVIPG VTVRAALVQI GPHRIDRGNW DWDQTNENPY WSPDAAIIPV WEEHLEKVRK AGSSTGAVVE VEATGVPAGW GAPLYGKLDA ELAAALMSIN AAKGVEIGEG FASAALSGEE NADQMRMGDD GPMFLSNHAG GVLGGLSTGQ PVVARVAFKP TSSILTPRQS LNEAGEEIDL RTKGRHDPCV GIRGVPVVEA MTACVLADAF LRHRAQTGGG AFVPGMSGRQ G
|
| |