Gene Caul_2447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2447 
Symbol 
ID5899902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2669859 
End bp2670986 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID641562938 
Productpatatin 
Protein accessionYP_001684072 
Protein GI167646409 
COG category[R] General function prediction only 
COG ID[COG1752] Predicted esterase of the alpha-beta hydrolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.191371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.648479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAAGT CTTGGATGAA GCAGCTTGGC CGGACCGATG CGCCGCCACC CGGCGTCGCC 
GCGGAACCGG CCGGGTTTCC AAAAGTGCGT TTCATCAATG GCGCGGCGCA GGATGATCCG
GTTTGGTTGT TCGGCGGCTT GTCGCCGGGC GACCCGTCGG CGCCCGCGCA AAACCTCAAC
GTCCTGGCGC TCTCCGGCGG CGGGGCTGGC GGGGCGTTCG GGGCCGGCGC CCTGGTCGGC
CTCACCGAGA CTGGAACGCG TCCGATCTTC GACTTGGTGA CTGGCGTCAG CACCGGAGCG
TTCATCGCGC CGTTCGCCTT CCTCGGATCG ACCTGGGACC ATCGCCTGGC CGACGCCTAT
TGCGATGGGC ACGCCGCCGA CCTCCTGGCG CTCAAGGGCC TAAGGCCAGG AGCCAGCCTG
TTCGGCGCCG AGCCCCTCAC GAACCTCGTC GAACGCCACA TCGACGCGCC GTTGCTGGAG
GCCGTCGGCG CGGCCCACCT CGCTGGGCGG CGCCTCTTTG TCGCCACCGC CAATCTCGAC
ACCGAGGCCA CATCGATCTG GGACATGGGC GCGATCGCCA GCCAGGGCGG CGAGGCGGGT
CTGACCCTGT TTCGCGACAT CCTGGTGGCC TCGGCGAGTC TTCCGGGACT GTTCCCGCCC
AAGATGATCG CGGTGGAGAG CGAGGGGCGC CGCTATGAAG AAATGCATGT GGATGGCGGC
ACGATCAGTC CGCTGTTCGT GACGCCAGAA CCCCTGACAT TTGCGCGCCC GTCAGGGTGG
TCAGACCGGG CCGTCGATGT CTATGCCTTG GTCAACACCA CGCTCAATGG CGGGGCGACG
ACAACGTCCA TGAACGTGAT TCCCATCCTG ATGCGCAGCT TCGAGCTGAT GCTCAAGACC
TCGTATCGCA ACGCTCTGAG GACCGTGGCC GCCTTTTGCG AGATCAATGG CTTCGCGCTC
CACACCGCCT GCATACCCGC TGAACTTGGC GGGGTCAGCA TGCTGCGCTT CGAAGAGCCG
GCGATGATCG ACATGTTCGA GCGTGGCGTT CGGGCCGCCC GCGAGGGCCA GCTATGGTCG
ACCGTGGCCG CCCCCGCCGA GTCCTCCCGG CCGGCGGCGG CGTCCTGA
 
Protein sequence
MSKSWMKQLG RTDAPPPGVA AEPAGFPKVR FINGAAQDDP VWLFGGLSPG DPSAPAQNLN 
VLALSGGGAG GAFGAGALVG LTETGTRPIF DLVTGVSTGA FIAPFAFLGS TWDHRLADAY
CDGHAADLLA LKGLRPGASL FGAEPLTNLV ERHIDAPLLE AVGAAHLAGR RLFVATANLD
TEATSIWDMG AIASQGGEAG LTLFRDILVA SASLPGLFPP KMIAVESEGR RYEEMHVDGG
TISPLFVTPE PLTFARPSGW SDRAVDVYAL VNTTLNGGAT TTSMNVIPIL MRSFELMLKT
SYRNALRTVA AFCEINGFAL HTACIPAELG GVSMLRFEEP AMIDMFERGV RAAREGQLWS
TVAAPAESSR PAAAS