Gene Caul_0311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0311 
Symbol 
ID5897585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp349911 
End bp351008 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content71% 
IMG OID641560795 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001681946 
Protein GI167644283 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.494385 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCT CGACCGCCGC CAATCTCGCC TCGCTGCCCG TCCCCACCGA CGACCTGCGA 
ATCCGACAGC TTCAGACCCT CAGTCCGCCG GCCCAGGTGA TCGGCGAGGC GCCGGCCACC
TCATCGACGG CCGAGGTCGT CGGCGACGCG CGCCGCGCGG TGCATGAGAT CCTGGAAGGC
CGCGACGACC GCCTGGTGGT GGTGATCGGG CCATGCTCGA TCCACGATCC CAAGGCCGCC
CTCGACTACG CGCGCCGCCT GGCCGTCGAG CGCGAACGTC ACGCCGGCGA GTTGGAAGTG
ATCATGCGGG TGTATTTCGA GAAGCCGCGC ACCACGGTCG GCTGGAAGGG CCTGATCAAC
GACCCGGACA TGGACGGCGG CTTCCGGATC AATGAAGGCC TGCGGCTGGC GCGGCGGGTG
CTGCTCGACA TCAGCGCCCA GGGCCTGCCG GCGGCCTGCG AGTTCCTGGA CGTCACCACC
CCGCAATACA TCGCCGACCT GGTGGCCTGG GGCGCGATCG GAGCCCGCAC CACCGAAAGC
CAGATCCACC GCGAGATGGC GTCGGGCCTG TCCTGCCCGG TCGGATTCAA GAACGGCACC
AACGGCGACG TCAAGGTCGC GGTCGACGCG GTCCTGGCCG CCGCCCAGCC GCACCATTTC
CTGGCCGTGA CCAAGGAAGG GCGCGGCGCC ATCGCCACCA CCACGGGCAA CGCCGACTGC
CACGTCGTGC TGCGCGGCGG CAAGACGCCC AACTACGACG CCGCCAGCGT CGCGGCGGTG
GCCCAGGCCC TGACCGCCGC CGGCCTGCCG CCACGCGTCA TGGTGGACGC CAGCCACGCC
AACAGCGGCA AGGACCACGA GAACCAGCCC GGCGTGGTCG CCGATCTCTG CGCCCAGGTG
GCGACCGGCC ATTCGCCGAT CATGGGGGTG ATGATCGAGA GCCATCTCGT CGCCGGCCGG
CAGGACATCG TTCCAGGCCG GCCCCTGACC TACGGTCAGT CGGTCACCGA CGCCTGCATC
GACTGGGAGA CCTCGGTGCG CCTGCTGGAT CAGCTGGCGG CGGCGGTGCG CGCCGGACGC
CGGTCCATCG ACCGATAG
 
Protein sequence
MPPSTAANLA SLPVPTDDLR IRQLQTLSPP AQVIGEAPAT SSTAEVVGDA RRAVHEILEG 
RDDRLVVVIG PCSIHDPKAA LDYARRLAVE RERHAGELEV IMRVYFEKPR TTVGWKGLIN
DPDMDGGFRI NEGLRLARRV LLDISAQGLP AACEFLDVTT PQYIADLVAW GAIGARTTES
QIHREMASGL SCPVGFKNGT NGDVKVAVDA VLAAAQPHHF LAVTKEGRGA IATTTGNADC
HVVLRGGKTP NYDAASVAAV AQALTAAGLP PRVMVDASHA NSGKDHENQP GVVADLCAQV
ATGHSPIMGV MIESHLVAGR QDIVPGRPLT YGQSVTDACI DWETSVRLLD QLAAAVRAGR
RSIDR