Gene Caul_4358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4358 
Symbol 
ID5901819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4736427 
End bp4737512 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content68% 
IMG OID641564876 
Productproline iminopeptidase 
Protein accessionYP_001685976 
Protein GI167648313 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0513684 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.622811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTT CCAGACTCCA GGCCACCAGC AAGGTCACGA GCGAGTGGAG GTACCCGCAA 
GCGGAGCCTA ACCGGACCGG AATGCTGAAG GTCGATCAGG CGCCTGACCA CACGCTCTAC
TGGGAGGAAT ACGGCGCGCC AGACGGCGAG CCGGTGATGT TCCTGCATGG CGGACCGGGC
GGAGCCTGCG CGCCGGTGAT GGCGCGGTTC TTCGACCCGG CGCGCTACCG CGTGATCCTG
TTCGACCAGC GCGGCTGCGG CAAGAGCACG CCGACCGTGG CCTCGCATGG CCCGGCCGTC
GCCCTGGTCC GCAACGACAC CGACCACCTA GTGGCCGACA TCAACCGCCT GCGCGAGGCG
CTGAACATCA CCGGCAAGAT GCACGTGTTC GGCGGCAGCT GGGGCAGCAC CCTGGCGCTC
GTCTACGCCA TCCGCCATCC CGAGCACGTC GCCTCGCTGA TCCTGCGCGG CATCTTCCTG
GGGACGCGCG AGGACCTGCT TTACATGTAC CAGGGCAACG CGGCGGTGTT CGACAAGACG
CCCTACGCCC TGAGCGAGCC GGGCGCCTAT GTGACCTATC CCGACGAGTG GAAGGCCTTC
GTCGAGGTCA TACCGGCCGA CAGGCGCGGC GACATAATGG GCGCCTACAA GGCGATCTTC
GACGGCAAGC CCGACGACGC GGCCGGGCGC GAGGCCCAAC TTCAAGCCGC CCTGGCCTGG
TCGGTCTGGG AGGGCGCGAT CTCGAACATG ATCCCCGAGC AGGGCGATCC GGGGAAGTTC
GGCGAGGCCG ATTTCGCCCT GTGCTTCGCC CAGATCGAGG CCCACTTCTT CGCCAACAAT
CTGTTCCTGG AGCCGGACGA GATCACGCGC GACATCGCGC GGATCGCCAA GCTGCCGATC
CACATCGTCC ACGGCCGCTT CGACCAGGTT TGCCCGCTGA CCCAGGCCTC GCGTCTGGTG
GCGGCCCTAG CGGCGGTGGG CGCTACGCCG GCTAGCTATG TGCGCACCAA CGCCGGCCAC
AGCGCCATGG AGGCTCAGAC GGTGCTGGCC CTGACGGCGA TCATGGACGG GTTGCCGAGG
CTCTGA
 
Protein sequence
MDFSRLQATS KVTSEWRYPQ AEPNRTGMLK VDQAPDHTLY WEEYGAPDGE PVMFLHGGPG 
GACAPVMARF FDPARYRVIL FDQRGCGKST PTVASHGPAV ALVRNDTDHL VADINRLREA
LNITGKMHVF GGSWGSTLAL VYAIRHPEHV ASLILRGIFL GTREDLLYMY QGNAAVFDKT
PYALSEPGAY VTYPDEWKAF VEVIPADRRG DIMGAYKAIF DGKPDDAAGR EAQLQAALAW
SVWEGAISNM IPEQGDPGKF GEADFALCFA QIEAHFFANN LFLEPDEITR DIARIAKLPI
HIVHGRFDQV CPLTQASRLV AALAAVGATP ASYVRTNAGH SAMEAQTVLA LTAIMDGLPR
L