Gene Caul_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0731 
Symbol 
ID5898185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp789041 
End bp790090 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content68% 
IMG OID641561211 
ProductHpcH/HpaI aldolase 
Protein accessionYP_001682360 
Protein GI167644697 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2301] Citrate lyase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.629928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC CCAGAGGCTT TTTCAAACCC CTGGCCATCG GAGCCCCCAC GCCCTGGCGC 
GAGCCGCCGG CCCGGGTGGA GCGGATGATC CATTTCGTGC CGCCGCACCT GGACAAGGTC
CGCGCCAAGG TTCCAGAGAT CGCCGCCACG GTCGACGTCA TCCTGGCCAA TCTGGAGGAC
GCCATCCCCG CCGACGCCAA GGGCGCGGCC CTGGCCGGAA CGATCGCCAT GGCGCGCGAG
ACCGACTTCA AGGCCCTGGG CGTGGGCCTG TGGGTGCGGA TCAACTGCCT CAACTCGCCC
TGGCATCTGG ACGAGGTGGC GACCCTGGTC GAGAAGGCGG GCAACCAGAT CGACGTGATC
ATGGTCCCCA AGGTCGAGGG GCCGTGGGAC ATCTTCTACA TGGACCAACT GCTGGCCTCG
CTGGAGGCCA AGCACGGCGT CGTCCGGCCG ATCCTGCTGC ACGCCATCCT GGAGACCGCC
GAAGGGGTGA TGAACGTCGA GCAGATCGCC GGCGCCAGTT CACGCATGCA AGGCATCAGC
CTGGGTCCGG CGGATCTCGC CGCCAGCCGC GCCATGAAGA CCACCCGCGT GGGCGGCGGT
CATCCCGGCT ATCGGGTGAT CGAGGACCCG CACGCTGACG GCTCCCCCCG CGTCTCGGTG
CAGCAGGATC TTTGGCACTA CACCTTCGCC AAGATGGTCG ACGCCTGCGC CGCCCACGGC
ATCAAGCCGT TCTACGGCCC GTTCGGGGCC ATCGACGACC CGGTCGCCTG CGAGCAGCAG
TTCCGCAACG CCTTCCTGAT GGGCTGCGCC GGGGCCTGGA GCCTGCACCC CAGCCAGATC
GAGATCGCCA AGCGGGTGTT CTCGCCGGCC CCCGACGAGG TGATCTTCGC CAAGCGCATC
CTGGAGGCCA TGCCCGACGG CACGGGCGTG GCCATGCTGG ACGGCAAGAT GCAGGACGAT
GCGACCTGGA AGCAGGCCAA GGTCATGGTC GATTGCGCGC GGCAGATCGC GGCCAAGGAT
GCGGAGTATG CGGCGCTGTA TGGGTTTTAG
 
Protein sequence
MKTPRGFFKP LAIGAPTPWR EPPARVERMI HFVPPHLDKV RAKVPEIAAT VDVILANLED 
AIPADAKGAA LAGTIAMARE TDFKALGVGL WVRINCLNSP WHLDEVATLV EKAGNQIDVI
MVPKVEGPWD IFYMDQLLAS LEAKHGVVRP ILLHAILETA EGVMNVEQIA GASSRMQGIS
LGPADLAASR AMKTTRVGGG HPGYRVIEDP HADGSPRVSV QQDLWHYTFA KMVDACAAHG
IKPFYGPFGA IDDPVACEQQ FRNAFLMGCA GAWSLHPSQI EIAKRVFSPA PDEVIFAKRI
LEAMPDGTGV AMLDGKMQDD ATWKQAKVMV DCARQIAAKD AEYAALYGF