Gene Caul_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0184 
Symbol 
ID5897896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp200833 
End bp202662 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content68% 
IMG OID641560668 
Productpeptidyl-dipeptidase A 
Protein accessionYP_001681819 
Protein GI167644156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.984755 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC GATGGCTGGC CGCCTGCGCG GCCGTGGCGC TATGCGCTGG CGTCCTGCCC 
ACGCCCGCCG CCCGCGCCCA GACCACCGCC GCCGCCGCGA CGCCGAGCGC CGACCAGGCC
AAGGCCTTCG TCGACGGGGC CGAAAAGGAC CTGATGGCCA TGGGCGAATA CGCCGCCCGC
ATGGCCTGGG TGCAAGCCAC CTACATCACC GACGACACCA ACTGGCTGAA CGCCAAGATC
GACGCCGAGT CCAACGCCCT ATCGGTCAAG TACGCCAAGG AGGCCGCCCG GTTCGACAAG
ACCGACGTCG ACCCGGTCAC CCGCCGCAAG CTGAAACTTC TGAAGGCCGC CCTGGTGCTG
CCAGCCTCCA GTCGGCCCGG CGCGGCCCAG GAACTGGCCG ACGTGCAGTC GCGGCTGCGG
GCGCTCTATT CCACCGGCAA GGTGACGATC GACGGCGAGA CCCTGACGCT CGACGATCTG
GACGACCGGC TGCGCACCGA GCGCGACCCG GCCAAGATCA AGGCCATGTG GGAGGCCTGG
CACGCCGTGG CCAAGCCGAT GGCCGGCGAC TATCCCAAGC TGGTCCAGCT GGCCAACGAG
GGCTCGGTCG AACTGGGCTA CAAGGACACC GGGGCGCTAT GGCGCTCGTG GTACGACATG
GAGCCCGACG CTTTCGCGAC CAAGACCGAC CAGCTGTGGG CCCAGGTCGC GCCGTTCTAC
AGGAACCTGC ACTGCTATGT GCGCGGACGC CTGAACGCCA AGTATGGCGA CGCGGTGCAG
CCGAAAACGG GCCCGATCCG CGCCGACCTG ACCGGCAACA TGTGGGCCCA GAGCTGGGGC
AACATCTACG ACATCGCCGC GCCAGAGGGA CTGGCCGGAC CGGGCTACGA CCTGACCCAG
TCGCTGGTCG CCAAGGGCTA CGACGCCACC AAGATGATGA AGACCGGCGA GGGCTTCTAC
ACCTCGCTGG GCATGGCCCC GCTGCCGGAG ACGTTCTGGA CGCGGTCGAT GATCGTCCGC
CCCCGCGACC GCGAGGTGGT CTGCCACGCC TCGGCCTGGG ACGTCGACAA CGACCAGGAC
CTTCGGGTCA AGATGTGCAC CCGGGTCAAT GCCGACGACT TCTACACGGT GCACCACGAG
CTGGGGCACA ACTTCTACCA GCGGGCCTAT GCGGGCCAGC CCTACTTGTT CAGGGGCGGC
GCCAATGACG GTTTCCACGA GGCGATCGGC GACTTCGTCG GCCTGTCGGC CCTAACCCCG
ACCTATCTCA AGCAGATCGG CGTGATCGAC ACGGCGCCCG GCGCGGAGGC CGACATCCCC
TATCTGCTGG ACATGGCGAT GGACAAGATC GCCTTCCTGC CGTTCGGCCT GCTGGTCGAC
AAGTGGCGCT GGCAGGTCTT TTCGGGCCAG GTCGACCCGG CCCACTACAA CGAGGCCTGG
TGGAAGCTGC GCACCCAGTA CCAGGGCGTC GCGCCGCCTG GCCCGCGCCC GGCCGACGCC
TTCGACCCCG GGGCCAAGTT CCACGTCGCC GACTCGACGC CCTACACCCG CTACTTCCTG
GCCCAGGTCT ATCAGTTCCA GTTCTACCGC GCCGCCTGCC GCCAGGCCGG CTGGAAGGGT
CCGCTGAACC GCTGCTCGGT CTATGGCGAC AAGGCGGTGG GCGAGACGTT CGAGAAGATG
CTGGCCATGG GCCAGTCCAA GCCCTGGCCC GAGGCGCTGG AGGCCTTCAC TGGCGAGAAG
GACCTGGACG CCAGCGCCAT CGCCGACTAT TTCGCGCCGC TGAACACCTG GCTGATCAAG
CAGAACAAGG GCCAATCCTG CGGCTGGTAG
 
Protein sequence
MKTRWLAACA AVALCAGVLP TPAARAQTTA AAATPSADQA KAFVDGAEKD LMAMGEYAAR 
MAWVQATYIT DDTNWLNAKI DAESNALSVK YAKEAARFDK TDVDPVTRRK LKLLKAALVL
PASSRPGAAQ ELADVQSRLR ALYSTGKVTI DGETLTLDDL DDRLRTERDP AKIKAMWEAW
HAVAKPMAGD YPKLVQLANE GSVELGYKDT GALWRSWYDM EPDAFATKTD QLWAQVAPFY
RNLHCYVRGR LNAKYGDAVQ PKTGPIRADL TGNMWAQSWG NIYDIAAPEG LAGPGYDLTQ
SLVAKGYDAT KMMKTGEGFY TSLGMAPLPE TFWTRSMIVR PRDREVVCHA SAWDVDNDQD
LRVKMCTRVN ADDFYTVHHE LGHNFYQRAY AGQPYLFRGG ANDGFHEAIG DFVGLSALTP
TYLKQIGVID TAPGAEADIP YLLDMAMDKI AFLPFGLLVD KWRWQVFSGQ VDPAHYNEAW
WKLRTQYQGV APPGPRPADA FDPGAKFHVA DSTPYTRYFL AQVYQFQFYR AACRQAGWKG
PLNRCSVYGD KAVGETFEKM LAMGQSKPWP EALEAFTGEK DLDASAIADY FAPLNTWLIK
QNKGQSCGW