Gene Caul_3961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3961 
Symbol 
ID5901423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4290654 
End bp4291919 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID641564482 
Productchorismate synthase 
Protein accessionYP_001685584 
Protein GI167647921 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.168181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.429407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACTT TTTCCGCAAC AGACCCCTAT GCGATAGCCC TGCCCCCGGT GGGGGAAGAC 
GATCGTGAAG CGATCCGCAG GGGGCGGGTG TTCACCACGC CGACCTTTGC CCACCGCCCG
CCTTTCCGCT ACACCGCCCC CATGTCGCAC AACACCTTCG GCCACCTGTT CCGCGTCACC
ACCTGGGGCG AAAGCCACGG CCCGGCCCTG GGCTGCGTGA TCGACGGCGT CCCGCCGGGC
GTCGCCGTCA CCGCCGAACA GATCCAGGCC TTCCTCGACA AGCGCCGCCC CGGCAATGGC
AAATTCGTCA CCCAGCGCCA GGAGCCCGAC GCCGTGCGCA TCCTGTCGGG GGTGTTCGAG
GACGCGCGCA GCGACGGCCA GCGGACCACC GGCACGCCGA TCAGCCTGAT GATCGACAAC
ACCGACCAGC GCTCCAAGGA CTATGGCGAG ATCGCCCAGG CCTTCCGGCC AGGCCACGCC
GACTATCCCT ATTTCGCCAA GTACGGCGTG CGCGACTATC GCGGCGGCGG GCGCAGCTCG
GCGCGCGAGA CCGCCGCGCG GGTGGCGGCC GGGGCGGTGG CGCGCCTGGT GATCCCGGGC
GTGACGGTGC GCGCGGCCCT GGTGCAGATC GGCCCGCACA GGATCGATCG CGGCAACTGG
GACTGGGACC AGACGAACGA GAACCCCTAC TGGTCGCCCG ACGCGGCGAT CATCCCGGTC
TGGGAAGAAC ATCTGGAAAA GGTCCGCAAG GCGGGCTCCT CGACCGGCGC CGTCGTCGAG
GTCGAGGCCA CGGGCGTGCC GGCCGGCTGG GGCGCGCCGC TCTACGGCAA GCTCGACGCC
GAGCTGGCGG CGGCCCTGAT GTCGATCAAC GCCGCCAAGG GCGTGGAGAT CGGCGAGGGC
TTCGCCAGCG CCGCCCTGTC GGGGGAAGAG AACGCCGACC AGATGCGGAT GGGCGATGAC
GGGCCGATGT TCCTGAGCAA CCATGCCGGC GGCGTGCTGG GCGGGCTGTC GACCGGCCAG
CCGGTGGTGG CCCGGGTGGC CTTCAAGCCG ACTTCGTCGA TCCTGACCCC GCGCCAGAGC
CTCAACGAGG CCGGCGAGGA GATCGACCTG CGCACCAAGG GCCGTCACGA CCCCTGCGTG
GGCATCCGCG GCGTGCCGGT GGTCGAGGCC ATGACCGCCT GCGTGCTGGC CGACGCCTTC
CTGCGCCACC GCGCCCAGAC CGGCGGCGGG GCTTTCGTTC CGGGGATGAG CGGGCGGCAG
GGCTAA
 
Protein sequence
MTTFSATDPY AIALPPVGED DREAIRRGRV FTTPTFAHRP PFRYTAPMSH NTFGHLFRVT 
TWGESHGPAL GCVIDGVPPG VAVTAEQIQA FLDKRRPGNG KFVTQRQEPD AVRILSGVFE
DARSDGQRTT GTPISLMIDN TDQRSKDYGE IAQAFRPGHA DYPYFAKYGV RDYRGGGRSS
ARETAARVAA GAVARLVIPG VTVRAALVQI GPHRIDRGNW DWDQTNENPY WSPDAAIIPV
WEEHLEKVRK AGSSTGAVVE VEATGVPAGW GAPLYGKLDA ELAAALMSIN AAKGVEIGEG
FASAALSGEE NADQMRMGDD GPMFLSNHAG GVLGGLSTGQ PVVARVAFKP TSSILTPRQS
LNEAGEEIDL RTKGRHDPCV GIRGVPVVEA MTACVLADAF LRHRAQTGGG AFVPGMSGRQ
G