Gene Caul_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1962 
Symbol 
ID5899417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2104660 
End bp2105871 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content67% 
IMG OID641562452 
Productpatatin 
Protein accessionYP_001683589 
Protein GI167645926 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.308138 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.105403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGGA CATCGACTTG GCTCGCCCTA ACGCTGGCGA CGGTGTCGCT GGGGTTGGCG 
GCCTGCGGAA CGATCTCTCG ACCGGAGGAT GGCGTCCTGC AGACGCCGGT CCCGCTGAGG
GCGGTGACCG ATCCGCGCAT CAACGCCAGA GACAGCGTGC GTCTGCGGGC TCTCGAGGGC
GAGATTGTCG GCCGGATGTC GGCCTCTGGA GACGCGTCGA TCCTGTCGAT TTCGGGAGGC
GGCGCCAACG GGGCCTACGG CGCGGGCGTC ATCGTTGGCT GGACGAAGGC GGGCGATCGA
CCCTCATTTC CCATTGTCAC CGGTGTGAGC ACGGGCGCAT TGACGGCGCC CTTCGCCTTC
CTGGGCCCTG ACTGGGACGA CGAGCTCGCA GCAGCCTATG CTGGCGGACA AGCCCATCAG
CTCCTGAACT GGCGGCGCTT GGCCGCGCTG GTGGCGCCCA GCCTGTACAG CCCGACCACC
CTGCGCGACT TGATCCAGCA CAGCGTGACG CCGCAGATGT TGTCGCAGAT CGCCGCCGAG
CACGCAAAGG GACGGCGTTT GCTGGTGGTC ACCACCAATC TCGACACGGA AGAGACCATC
ATCTGGGACA TGGGCCTGAT CGCCACTCAA GGCGGCCCCC AGGGTCTTCG CCTTTTTCGC
GATGTGCTGC TGGCGTCGGC GAGCATTCCG GGGGTTTTTC CGCCGGTGAT CATCGGCGCT
CGGTCGTCGG ACGGCCGCGT GGTCGGCGAG ATGCATGTCG ACGGCGGCGT CAACACGCCC
TTCCTCGCCG TGCCCGAGGG TCTCCTGCTG TGGACCGCGC CAAGCTCGCT GGCCACCGGT
AGCGGCCTCT ATGTCCTGGT CAACAGCAAG GTCGCGCCTG ACCGGCAGAT CACCCGCGGG
CGCTTGCCTG ATATTCTCAG GCGCAGCTAC GACAGCGGCA GCAAGGCGTC GCTTCGCGCC
CACTTGGCCG TCAACGTCGC CTTCGCCAAA CGCAACGGCA TGGCGATCTA CGTGGCGTCG
ATACCCAGCG ATCTGCAGGC CAGCAGCCTC GATTTCAACC AGAACGCCAT GCGCGCCTTG
TTCGAGGCCG GCCGCAACAG CGGGATGTCC GGGCAAGCTT GGCGCTCGGT CGCCAATCTC
GCAGAGCCTT CATCGCCGTC GCCATCGGCG CCGGGACCGT CAGCGACGCC ACCCGCCCGC
GCCGTCCCCT GA
 
Protein sequence
MKWTSTWLAL TLATVSLGLA ACGTISRPED GVLQTPVPLR AVTDPRINAR DSVRLRALEG 
EIVGRMSASG DASILSISGG GANGAYGAGV IVGWTKAGDR PSFPIVTGVS TGALTAPFAF
LGPDWDDELA AAYAGGQAHQ LLNWRRLAAL VAPSLYSPTT LRDLIQHSVT PQMLSQIAAE
HAKGRRLLVV TTNLDTEETI IWDMGLIATQ GGPQGLRLFR DVLLASASIP GVFPPVIIGA
RSSDGRVVGE MHVDGGVNTP FLAVPEGLLL WTAPSSLATG SGLYVLVNSK VAPDRQITRG
RLPDILRRSY DSGSKASLRA HLAVNVAFAK RNGMAIYVAS IPSDLQASSL DFNQNAMRAL
FEAGRNSGMS GQAWRSVANL AEPSSPSPSA PGPSATPPAR AVP