Gene Caul_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1987 
Symbol 
ID5899442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2133707 
End bp2136148 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content71% 
IMG OID641562476 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001683613 
Protein GI167645950 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT TCGCGCCGCG CCTGCTCGCC TGCGCCCTGA TGGGCGCGGG CGCCGCCCAG 
GCCTCGCCCT TCACGGTCGA TCAGCTGCTG GCCCAGCAAC GCCTGGGTCC GGTCAGCGTC
GATCCCTCGC AACGCTGGCT GGTCGTGCCC ACCACGGGGC CTTACGATTC CGCCCCGCGG
TGGGACCTGG AGGACCTGAC GCCGATGACC ATCACTCGGT TGTCGGTGTT CGACCTGCGC
AAGGGCGGCG CGCCCCGAGT CTTTCCGTCC AACGCGGGGT CGCCGGAGGC CTGGGGGTAC
AACCCCGGGC CCTACGCGCC CTCGGGCGCG AAGATGGCGG TGACGCGGGC GCGGAGCGGG
AGCCTGGAGG TGGGGATCTT GAACCTCACC GACGGTGCGG TAGTCTGGAC AGGGCTTTTC
CCGGTCACCG ACGTGATGGG ACGCTCGCTG CAATGGCGCT CGGACGACGT GCTGCTGGTC
CTGGCCAAGG ACCCGGCGAC CCCGACCACG GCCGGGGTGG TCGGTCCGCC CGCCGAGCAA
CGCCTGATCG ACCGATGGCG GGCGACCCGC GAGAACCGTG TGGGCGTGAC CGTGATGGGC
AGCGGCCGGT ATCTGGACCA GACGCCGGCG CTGCCTCCCA GCCGACTGGT GGCGGTCGAT
GCGACCCGGG GCCTTGGCCA GGTGCTGGCC CAAGGTGATT TCTTCGACCT GGAGGTCGCG
CCGGGCGGGC GGTTCGTGGG CCTGCTGTCG AACGGCCCGC CGCTGGCGCC GGACCCTTCG
GTTCCCGCCA TGACCACCGA CCCTTTCCGC CGTCACCGTC TGACCGTGGT GGATCTGGTG
GCCGGCGGCG CCTGGTCGCC CTGCCCCAGC TGCGACGTCG CCGTGGATCT CCTGGCCTGG
TCGCCTGGCG GCGCGGGCCT GCTGGTCTAC GCGCGGGGCG ACAACGATCC ATGGTCGGCG
GCCCGCTATT GGCGGATCGA CGCCCTGACG CGCCGCGCCG CGCCGCTCCA TGACCGCGAC
ATCACCCCCA TGGCCGAAAC AACCGGCTAT GGCAGCCGTG TCCCCCGCGC CGACTGGATG
AGCGAAACCC CCGTGGTTCT GGGACGGCGC GGCGACGCGT CGCAAGACGG GGCAGACTGG
TTCGCATGGG GCGCCAAGGC CCCCCTCAAT CTCACCGCGG CCCTGCCACC AGGGCCGCGC
CGCCTCGAAG CCACGGGACC GGCGGGAATT GTCGCCTCGC AAGGCGGCCG TCTCTGGCGC
ATCGACCCGC TGGGCCGCGC CACGCTGCTG GGGGCCGGCC GAAGCCTGCA GGGCGCGGGC
CTGCCCGGCG GCGAACGGCT GGTCTTCAAC AATCGGCCCC GCCCCGAAGA CTTGGCGCTG
ATGCTCGACG ACGGCCCGCG CCCCACCCCC GTCGTTCTCG ACAAGGCCCG GCTTCGCCCG
CTCGGTCTGG ACGCTCCCAC CGACGAAACC CTTCTCTTGG TCGCGGGTCA GGCGCGGGCG
GCGGTCACCC TCAAACAGGA CGCGCACGGC GTGGAGACGG TTCTGCTGCG TCGCGCCGGC
CAGGCGCCGC AAGCCTTGGC GACGCTGAAT GCGCATCTGG CGCAGGTGGA CTTTTCGGCG
CCGCGCGCGG TCAGCCACCT CGGCCCTCGG GGCGAGACAC TGACCAGTTG GCTCTACATG
CCCACCACGC CGCCTTCCGG CGCCAAGGTC CCGCTGGTCG TCATTCCCTA TCCAGGCAAG
GTCTATCCCA CCGCCCCGAC CAGCCAGGGT CCGCCCGCGC GCCAGCTTTA TCTCAACCTC
CAGATCCTGG CCGGGGCCGG CTACGCCGTC TTGCTGCCCA GCTTGCCCGT CGATACGCGC
CGCGAGCCGG CCGAAGGTCT CGCCGACCGC ATTCTGGCGG CGGCCGACGC GGCCGCCACT
GTCGAGCCGC GGCTGGATCT CAACCGCATG GGCTTGTGGG GCCATAGCTA TGGGGGATAC
GCCGTGCTCA GCGCCGCCGC CCAAAGCCGC CAGTTCAAGG CGGTGATCGC CGGGGCCTTC
GCCGCCGATC TGGCCAGCCA CTATACCCGC AAGAGCCTGC TGGCGACGGT CGCCCCCGAC
GCCGCTGTCG AGATCATGGT CGGGGCGGGC TGGATGGAAC AGGGCCAGGG ACGGATGGGC
GCGCCGCCCT GGGTCGATCC TGATCGCTAT GTCCGCAACA GCCCCCTGCT CCATGCCGAC
AGGATTACCG CCCCGGTCAT GCTGGTGATG GGCGACCTGG ACAGCGATCC CGGCCAGGCC
CTGACCATGT TCGGCGCCCT GTTCCGCCAG AACAAGGACG CGGTCTCCTT GCAGTATCAT
GGCGAGACCC ACGTGATCAT GACGGCCGCC AATGTCGCCG ATTTCCACCG CCGGCTTCTG
GCTTTCCTGC GCGATAATCT CGGGACCTCC CCAGGATCAT GA
 
Protein sequence
MRRFAPRLLA CALMGAGAAQ ASPFTVDQLL AQQRLGPVSV DPSQRWLVVP TTGPYDSAPR 
WDLEDLTPMT ITRLSVFDLR KGGAPRVFPS NAGSPEAWGY NPGPYAPSGA KMAVTRARSG
SLEVGILNLT DGAVVWTGLF PVTDVMGRSL QWRSDDVLLV LAKDPATPTT AGVVGPPAEQ
RLIDRWRATR ENRVGVTVMG SGRYLDQTPA LPPSRLVAVD ATRGLGQVLA QGDFFDLEVA
PGGRFVGLLS NGPPLAPDPS VPAMTTDPFR RHRLTVVDLV AGGAWSPCPS CDVAVDLLAW
SPGGAGLLVY ARGDNDPWSA ARYWRIDALT RRAAPLHDRD ITPMAETTGY GSRVPRADWM
SETPVVLGRR GDASQDGADW FAWGAKAPLN LTAALPPGPR RLEATGPAGI VASQGGRLWR
IDPLGRATLL GAGRSLQGAG LPGGERLVFN NRPRPEDLAL MLDDGPRPTP VVLDKARLRP
LGLDAPTDET LLLVAGQARA AVTLKQDAHG VETVLLRRAG QAPQALATLN AHLAQVDFSA
PRAVSHLGPR GETLTSWLYM PTTPPSGAKV PLVVIPYPGK VYPTAPTSQG PPARQLYLNL
QILAGAGYAV LLPSLPVDTR REPAEGLADR ILAAADAAAT VEPRLDLNRM GLWGHSYGGY
AVLSAAAQSR QFKAVIAGAF AADLASHYTR KSLLATVAPD AAVEIMVGAG WMEQGQGRMG
APPWVDPDRY VRNSPLLHAD RITAPVMLVM GDLDSDPGQA LTMFGALFRQ NKDAVSLQYH
GETHVIMTAA NVADFHRRLL AFLRDNLGTS PGS