Gene Caul_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0721 
SymbolaroB 
ID5898176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp780163 
End bp781272 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content71% 
IMG OID641561203 
Product3-dehydroquinate synthase 
Protein accessionYP_001682352 
Protein GI167644689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.879879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCACCA CCATCCCCGT GGGCCTGGGC GCGCGCGCCT ATGACGTGGT CATCGGAACC 
GGCCTGATCG ACCGGGCGGG CGAGCACATC GCGCCGCTGC TCAAGCGCAA GCGCGTGGCC
ATCGTCACCG ACACCATCGT CGGCGAGCAC CACGGCGAGC GGCTGGCCAA TGCGCTGGAG
CACGCCGGCG TGGCGACCGA CGTGATCCTG GTGCCGCCCG GCGAGGAGAC TAAGAGCTTC
GAGGGCCTGG CCGACCTCAG CGATCGCCTG CTGGCCCTGG GCCTGGAGCG CGGCGACATG
GTCATCGCGT TCGGCGGCGG GGTGGTCGGC GACCTGACCG GCTTCGCGGC GGCGATCTAC
AAGCGCGGGA TCGACTTCAT CCAGATCCCC ACCACCCTGC TGGCCCAGGT GGACTCGTCG
GTGGGCGGAA AGACCGCCAT CGACACCCCG CGCGGCAAGA ACCTGATCGG CGCCTTCCAC
CAGCCGCGCC TTGTGCTGGC CGACCTCGAC ATCCTGGCCA CCCTGCCCGC CCGCGAGCTG
GCCTGCGGCT ATGCCGAGGT CATCAAGTAC GGCCTGCTGG GCGATTTCGC CTTCTTCGAA
TGGCTGGAGG CCAACGTCCA CGCCGTGCTG GAGCGCGACA CCGCCGCCCT GGTGAAGGCC
GTGGGCCGCT CGGTGGAGAT GAAGGCGCAG ATCGTCGCCG AGGATGAGCG GGAGGTCGGC
CGCCGGGCGC TGCTGAACCT GGGTCACACC TTCGGCCACG CGGTCGAGGG CGAGATGGGC
TTTGGCGACG CGCTCAAGCA CGGCGAGGCC GTCGGTCTGG GCATGGCGCA GGCCTTCCGG
TTCTCGGTCC GCCAGGGCCT ATGCTCGGCC CAGGACGCCG CCCGCGCCGA GGCCGCGATC
AAGGCCGCCG GCCTGCCGAC CAAGCTGTCG GACATCCGCC CCGAACCGTT CAGCGCCGAC
GCCCTGATCG CCCACACCGC CCAGGACAAG AAGGCGCAGG GCGGGACCTT GACCTTCGTC
CTGGTCCGCG CGATCGGCGA CGCCTTCGTG GCCAAGGACG TGGACCGGCA AGCACTGCGG
GCGTTCCTGG TGGAGGAAGG CGCGGTTTAA
 
Protein sequence
MITTIPVGLG ARAYDVVIGT GLIDRAGEHI APLLKRKRVA IVTDTIVGEH HGERLANALE 
HAGVATDVIL VPPGEETKSF EGLADLSDRL LALGLERGDM VIAFGGGVVG DLTGFAAAIY
KRGIDFIQIP TTLLAQVDSS VGGKTAIDTP RGKNLIGAFH QPRLVLADLD ILATLPAREL
ACGYAEVIKY GLLGDFAFFE WLEANVHAVL ERDTAALVKA VGRSVEMKAQ IVAEDEREVG
RRALLNLGHT FGHAVEGEMG FGDALKHGEA VGLGMAQAFR FSVRQGLCSA QDAARAEAAI
KAAGLPTKLS DIRPEPFSAD ALIAHTAQDK KAQGGTLTFV LVRAIGDAFV AKDVDRQALR
AFLVEEGAV