Gene Caul_1195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1195 
Symbol 
ID5898650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1257199 
End bp1259136 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content66% 
IMG OID641561678 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001682823 
Protein GI167645160 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.430413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAGAA CCATTCTGGC CGCCGTGGCG GCGAGTTTTG CCATGGCCGG CGCCGCATTG 
GCCGAGGTCA CGCCGCTGTC GGTCTATGGC GGCTTGCCCA ACCTTGAGCA GGTCGAAATC
TCGCCGGACG GCAAGTTCCT GGCGATCGCG GTCACCGACG GCGAAAAGCG GATGCTGGTC
GTGCGCGAGG CCGGCGAGAA CGGTAAGCTG CTGAGCGCCA TGAACTTCGG CGACACCAAA
CTTCGCGCCG TGCAATGGGC GGGTCCAGAG CACGTGCTGA TCACCACCTC AAGCACCGCC
GAGGTCTATG GCCTGAGCGG ACCCAAGCGC GAATACCTGA TGGGCTTCGA CTTCAACCTG
GTCACCAAGA AGCAGATTCC GCTGCTCAAG AACCAGGAAG ACGCGATGAA CGTCATCCTG
GAGACCCCGG ATGTGCGGTT CATTGAAGGC GTGCCTCACG CTTTCGTCGA GGGGATCCAT
TTTCACGACG GCCGGGGCTG GAACACCCTT TACCGGATCA ATCTGAAAAC CGGCGCCACC
CGGATGCTGG ACAAGGGCGG CACGGAGAAC ACCGACGACT GGCTGGTCTC GCCTGAAGGT
CAGCCTCTGG CCCAGTCGAT CTACGACGAA AAGCAGGGCG CCTGGAGCCT GAAGATCAGG
GGGGCGGACG GCTGGACCAC GGTCGAGAAG ACCGTCTCGA AAATGGGCTC CTTCGGGCTG
AGGGGCATGG GGCGCGACGG CCAAAGCGTG GTGGTCTGGG CCTATGACGA AGACACTGAC
AAGACCCTGC TACGCGAATA CGCCCTGGAC GGCAGCCACG TCGACGTGCC GGGCAGCGGC
GACTACGACC GCCCGATCCA TGCGCCGGAC GGGTCGCGCC TGCTGGGCGG CTACAGCCTG
GTGGGCGACG AGAACCGCTA CGCCTTCTTC GACGCCAAGA CCCAGGCCAG CTGGAACGCC
GTGCGCAAGG CCTTCCCAGG CGACCAGGTG TCGCTGGCGT CCTGGTCGGA TGATCGGCGC
AAGGTGGTCG TCCAAGTCGA CTCGCCGACC CTGGGTCCGG CCTTCGCCCT GGTCGATCTC
GACGCCAAGA GCGCGCGCTG GTTGGGCGAG ATCTACCGCG CCCTGACCGC CGACGGCGTC
TCCGAGGTCC GCCCGATCAG GTACAAGGCC GCCGACGGTC TGGAGATCAC CGGCTACCTG
ACCGTGCCGC GCGGCAAGGA CGCCAAGAAC CTGCCGCTGG TGGTGCTGCC GCACGGCGGT
CCGGCGGCCC GCGACAAGCC GGGCTTCGAC TGGTGGTCCC AGGCGCTCGC CTCGCGCGGC
TACGCGGTGC TGCAACCCAA TTTCCGTGGC TCCGACGGCT TTGGCCAAGC CTTCCTCGAA
AAGGGCTATG GCCAGTGGGG CAAGGCCATG CAGACCGACC TGTCGGACGG TGTGCGCCAC
CTGGCCAAGC AGGGCGTGAT CGATCCCAAA AGGGTCTGCA TCGTCGGCGC CAGCTATGGC
GGCTATGCCG CCCTGGCCGG GGCGACGCTG GATCACGGCG TCTATCGCTG CGCCGTCTCG
GTCGCCGGCC CCTCGGAGCT CAAGCGGTTC GTGTTCGACA GCAGCAAGCG CTACGAGACG
GGCCGCAACT CGGCCCAGCG CTACTGGCTG CAGTTCATGG GCGCCGACGG CCTTAAGGAC
CCCGACCTGG CCCTGATCTC GCCGGCCAAG CTGGCCGACA AGGTCGAGAT CCCGATCCTG
TTGATCCATG GCAAGGACGA CACCGTCGTC CCCTACGTCC AGAGCACCCT GATGGCCGAC
GCCCTGAAGA AAGCCGGCAA ACCGGTGGAG TTGGTCAGCC TGGACGGCGA GGATCACTTC
CTGTCGCGCG GCGCCACCCG TCTGCGGATG CTGACCTCGG TGGTCGGCTT CCTCGAAAAG
AACAACCCGC CGAACTGA
 
Protein sequence
MLRTILAAVA ASFAMAGAAL AEVTPLSVYG GLPNLEQVEI SPDGKFLAIA VTDGEKRMLV 
VREAGENGKL LSAMNFGDTK LRAVQWAGPE HVLITTSSTA EVYGLSGPKR EYLMGFDFNL
VTKKQIPLLK NQEDAMNVIL ETPDVRFIEG VPHAFVEGIH FHDGRGWNTL YRINLKTGAT
RMLDKGGTEN TDDWLVSPEG QPLAQSIYDE KQGAWSLKIR GADGWTTVEK TVSKMGSFGL
RGMGRDGQSV VVWAYDEDTD KTLLREYALD GSHVDVPGSG DYDRPIHAPD GSRLLGGYSL
VGDENRYAFF DAKTQASWNA VRKAFPGDQV SLASWSDDRR KVVVQVDSPT LGPAFALVDL
DAKSARWLGE IYRALTADGV SEVRPIRYKA ADGLEITGYL TVPRGKDAKN LPLVVLPHGG
PAARDKPGFD WWSQALASRG YAVLQPNFRG SDGFGQAFLE KGYGQWGKAM QTDLSDGVRH
LAKQGVIDPK RVCIVGASYG GYAALAGATL DHGVYRCAVS VAGPSELKRF VFDSSKRYET
GRNSAQRYWL QFMGADGLKD PDLALISPAK LADKVEIPIL LIHGKDDTVV PYVQSTLMAD
ALKKAGKPVE LVSLDGEDHF LSRGATRLRM LTSVVGFLEK NNPPN