Gene Caul_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0101 
Symbol 
ID5897813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp110491 
End bp111810 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content72% 
IMG OID641560585 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001681737 
Protein GI167644074 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.249045 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG CTGGATTGAA GAGCACCCCC GGTGGACCCC TCCGCGGGAC GGTCCGCGCG 
CCCGGCGACA AGTCGATTTC GCACCGGTCG ATGATCCTCG GCGCGCTGGC TTCTGGGACC
ACCACGGTGG AGGGCCTGCT GGAGGGCGCC GACGTTCTGG CGACCGCCCA GGCCATGCGG
TCGTTCGGCG CGCGGGTCGA GCAGGAAGGC GTCGGCCGCT GGCGGATCGA GGGCCAGGGC
GGCTTCCTGG AGCCGTCGGA CGTCGTCGAC TGCGGCAACG CCGGCACCGG CGTGCGGCTG
ATCATGGGCG CGGCGGCGGG CTTTCCCCTC TGCGCCACCT TCACCGGCGA CGGATCCCTG
CGCAGCCGGC CGATGAGCCG TGTGCTGGAC CCGCTGGCCC GCATGGGCGC CACCTGGCTG
GGCCGCGACA AGGGCCGCCT GCCGCTGACC CTGAAGGGCG GCAACCTGCG CGGACTGCAA
TACACGCTGC CGATGGCCTC GGCCCAGGTG AAGTCCGCCG TGCTGCTGGC CGGCCTGCAC
GCCGAGGGCG GGGTCGAGGT GATCGAGCCG GAAGCCACCC GCGACCACAC CGAGCGCATG
CTGCGCGCCT TCGGGGCCGA GGTGATCGTC GAGGATCAGG GCGGCGTGCG GCATATCCGC
CTGCCGGCTG GCCAGAAGCT GACCGGAACC CACGTGGCGG TGCCGGGCGA CCCGTCGTCG
GCGGCCTTCC CGCTGGTGGC CGGGCTGATC GTTCCCGGCT CGGAAGTGAC GGTCGAGGGC
GTGATGCTCA ACGAACTGCG CACCGGCCTG TTCACCACCC TGCGGGAAAT GGGCGCGGAT
CTGGTGATCT CGAACGTCCG TGAAAGCAGC GGCGAGGAGG TCGGCGACAT CACCGCCCGC
TACTCGCGGA TGCATGGCGT CGTCGTGCCG CCCGAACGGG CCCCGGCGAT GATCGACGAA
TATCCGATCC TGGCCGTCGC CGCCGCCTTC GCGACCGGCG ACACCGTGAT GCGCGGCGTC
GGCGAGATGC GGGTCAAGGA AAGCGACCGC ATCGCCCTGA CGGCCGCCGG CCTGGAGGCC
TGCGGCGTCG ATGTCGAGGA GGAGCCGGAG GGCTTCATCG TCCATGGGAC CGGCCAGGCG
CCGCGCGGCG GGGCCATGGT CGAGACCCAT GGCGACCATC GCATCGCCAT GAGCCACCTG
ATCCTCGGCC TGGCCGCCCA GTCGGCGGTG TCGATCGACG AGCCGGGCAT GATCGCCACC
AGCTTCCCGG GCTTCGCCGA GATGATGCGC GGCCTGGGCG GCGACCTGGT CGAGGCCTAG
 
Protein sequence
MTAAGLKSTP GGPLRGTVRA PGDKSISHRS MILGALASGT TTVEGLLEGA DVLATAQAMR 
SFGARVEQEG VGRWRIEGQG GFLEPSDVVD CGNAGTGVRL IMGAAAGFPL CATFTGDGSL
RSRPMSRVLD PLARMGATWL GRDKGRLPLT LKGGNLRGLQ YTLPMASAQV KSAVLLAGLH
AEGGVEVIEP EATRDHTERM LRAFGAEVIV EDQGGVRHIR LPAGQKLTGT HVAVPGDPSS
AAFPLVAGLI VPGSEVTVEG VMLNELRTGL FTTLREMGAD LVISNVRESS GEEVGDITAR
YSRMHGVVVP PERAPAMIDE YPILAVAAAF ATGDTVMRGV GEMRVKESDR IALTAAGLEA
CGVDVEEEPE GFIVHGTGQA PRGGAMVETH GDHRIAMSHL ILGLAAQSAV SIDEPGMIAT
SFPGFAEMMR GLGGDLVEA