Gene Caul_5134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5134 
Symbol 
ID5897360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp53506 
End bp54777 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content68% 
IMG OID641555237 
Productepocide hydrolase domain-containing protein 
Protein accessionYP_001676568 
Protein GI167621783 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TATCCGGCGC CCTGAACGCG CCCAAGCCCA CACGCCGCCG CCTGCTGACC 
AGCGCCGCGG GCCTGACCGT TCTGGCCACC GCGACCCACG GCGTCCAAGC CCTGGCCGCG
CCCGCCACCG AAGCGGTCGC GCCTTTCAAG GTCCAGGTCG ATCCGGACGT CATCGCCGAT
CTGCGACGTC GCTTGACCGC CACGCGGTGG CCCGACGCCG GCGGCGCGGT CGATTGGAGC
CAAGGCGTGC CGCTGGCCAA GGCCAAGGCC CTGACCGAGT ACTGGCGGAC AACCTACGAC
ATGACGCGGC TGGAACGGCG GCTCAACGCC TTCCCGCAGT TTCGCACGGC GATCGACGGC
CTGGGCGTGC ACTTCATCCA CGTCAAATCC AAGCACGCCG ACGCCATGCC GATGATCCTG
ACCCATGGTT GGCCAGGTTC GGTCATCGAG TTCCTGGATG TGATCGATCT GCTGACCGAT
CCCACGGCGC ATGGCGGCTC GGCCGAGGAC GCCTTCCATG TGGTGATCCC CTCGTTGCCC
GGCTACGGCT TCTCCGACAA GCCGGCGGTC CTGGGCTGGG GACTCCCAAA GATCGCCAAG
GCTTGGGACA CCCTGATGAA GCGCCTGGGT TATGGCCGCT ACGTGGCCCA GGGCGGGGAC
TTGGGCGCTG GCGTCGCCAG CTGGATGTCC AAGCAGGCGC CGCAGGGTCT GGCCGCCATT
CACTTGAACC TGCCCATCCT GTTCCCGCCG CCGCCGCCCG GGCCCTCCGG CTACAGCGCC
GAGGAGCAAG CGGCCGTGAG CCAGCTGGTG CGCTATGGCT CTGACCTGTC GGCCTACGCC
GCCATCCAGG GCACCCGCCC GCAGACGCTC GGCTACGGCC TGGCGGACTC GCCGGTCGGC
CAGGCGATGT GGATCTACGA GAAGTTCCAG GCCTGGAGCG ACAACAAGGG CGACCCGGCA
GACGCGATCG CCGTCGACAA GATGCTCGAC GACATCATGC TTTACTGGGT GACCGATACG
GCCGCCTCGG CCGCGCGCCT CTACAAGGAA AGCTTCTTCA CCGACTTCGC CCGCTTCGAG
CTGACCGGGC CGGTCGCTGT GACGATCTTC AAGGGCGACA TCTTCACCCC GCCCAAGAGC
TGGGGCGAGC AGACCTACAA GGGCCTGGCC TACTGGAGCG AGCAGGACAA GGGCGGCCAC
TTCGCCGCTC TGGAGCAACC GCGCGCCTTC GCCGAGGAAG TCCGCAAGGC CTTCAAGCCC
TACCGAGCCT GA
 
Protein sequence
MTDLSGALNA PKPTRRRLLT SAAGLTVLAT ATHGVQALAA PATEAVAPFK VQVDPDVIAD 
LRRRLTATRW PDAGGAVDWS QGVPLAKAKA LTEYWRTTYD MTRLERRLNA FPQFRTAIDG
LGVHFIHVKS KHADAMPMIL THGWPGSVIE FLDVIDLLTD PTAHGGSAED AFHVVIPSLP
GYGFSDKPAV LGWGLPKIAK AWDTLMKRLG YGRYVAQGGD LGAGVASWMS KQAPQGLAAI
HLNLPILFPP PPPGPSGYSA EEQAAVSQLV RYGSDLSAYA AIQGTRPQTL GYGLADSPVG
QAMWIYEKFQ AWSDNKGDPA DAIAVDKMLD DIMLYWVTDT AASAARLYKE SFFTDFARFE
LTGPVAVTIF KGDIFTPPKS WGEQTYKGLA YWSEQDKGGH FAALEQPRAF AEEVRKAFKP
YRA