Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0101 |
Symbol | |
ID | 5897813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 110491 |
End bp | 111810 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641560585 |
Product | 3-phosphoshikimate 1-carboxyvinyltransferase |
Protein accession | YP_001681737 |
Protein GI | 167644074 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0128] 5-enolpyruvylshikimate-3-phosphate synthase |
TIGRFAM ID | [TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.249045 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGG CTGGATTGAA GAGCACCCCC GGTGGACCCC TCCGCGGGAC GGTCCGCGCG CCCGGCGACA AGTCGATTTC GCACCGGTCG ATGATCCTCG GCGCGCTGGC TTCTGGGACC ACCACGGTGG AGGGCCTGCT GGAGGGCGCC GACGTTCTGG CGACCGCCCA GGCCATGCGG TCGTTCGGCG CGCGGGTCGA GCAGGAAGGC GTCGGCCGCT GGCGGATCGA GGGCCAGGGC GGCTTCCTGG AGCCGTCGGA CGTCGTCGAC TGCGGCAACG CCGGCACCGG CGTGCGGCTG ATCATGGGCG CGGCGGCGGG CTTTCCCCTC TGCGCCACCT TCACCGGCGA CGGATCCCTG CGCAGCCGGC CGATGAGCCG TGTGCTGGAC CCGCTGGCCC GCATGGGCGC CACCTGGCTG GGCCGCGACA AGGGCCGCCT GCCGCTGACC CTGAAGGGCG GCAACCTGCG CGGACTGCAA TACACGCTGC CGATGGCCTC GGCCCAGGTG AAGTCCGCCG TGCTGCTGGC CGGCCTGCAC GCCGAGGGCG GGGTCGAGGT GATCGAGCCG GAAGCCACCC GCGACCACAC CGAGCGCATG CTGCGCGCCT TCGGGGCCGA GGTGATCGTC GAGGATCAGG GCGGCGTGCG GCATATCCGC CTGCCGGCTG GCCAGAAGCT GACCGGAACC CACGTGGCGG TGCCGGGCGA CCCGTCGTCG GCGGCCTTCC CGCTGGTGGC CGGGCTGATC GTTCCCGGCT CGGAAGTGAC GGTCGAGGGC GTGATGCTCA ACGAACTGCG CACCGGCCTG TTCACCACCC TGCGGGAAAT GGGCGCGGAT CTGGTGATCT CGAACGTCCG TGAAAGCAGC GGCGAGGAGG TCGGCGACAT CACCGCCCGC TACTCGCGGA TGCATGGCGT CGTCGTGCCG CCCGAACGGG CCCCGGCGAT GATCGACGAA TATCCGATCC TGGCCGTCGC CGCCGCCTTC GCGACCGGCG ACACCGTGAT GCGCGGCGTC GGCGAGATGC GGGTCAAGGA AAGCGACCGC ATCGCCCTGA CGGCCGCCGG CCTGGAGGCC TGCGGCGTCG ATGTCGAGGA GGAGCCGGAG GGCTTCATCG TCCATGGGAC CGGCCAGGCG CCGCGCGGCG GGGCCATGGT CGAGACCCAT GGCGACCATC GCATCGCCAT GAGCCACCTG ATCCTCGGCC TGGCCGCCCA GTCGGCGGTG TCGATCGACG AGCCGGGCAT GATCGCCACC AGCTTCCCGG GCTTCGCCGA GATGATGCGC GGCCTGGGCG GCGACCTGGT CGAGGCCTAG
|
Protein sequence | MTAAGLKSTP GGPLRGTVRA PGDKSISHRS MILGALASGT TTVEGLLEGA DVLATAQAMR SFGARVEQEG VGRWRIEGQG GFLEPSDVVD CGNAGTGVRL IMGAAAGFPL CATFTGDGSL RSRPMSRVLD PLARMGATWL GRDKGRLPLT LKGGNLRGLQ YTLPMASAQV KSAVLLAGLH AEGGVEVIEP EATRDHTERM LRAFGAEVIV EDQGGVRHIR LPAGQKLTGT HVAVPGDPSS AAFPLVAGLI VPGSEVTVEG VMLNELRTGL FTTLREMGAD LVISNVRESS GEEVGDITAR YSRMHGVVVP PERAPAMIDE YPILAVAAAF ATGDTVMRGV GEMRVKESDR IALTAAGLEA CGVDVEEEPE GFIVHGTGQA PRGGAMVETH GDHRIAMSHL ILGLAAQSAV SIDEPGMIAT SFPGFAEMMR GLGGDLVEA
|
| |