Gene Caul_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2242 
Symbol 
ID5899697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2435158 
End bp2437581 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content70% 
IMG OID641562733 
Productpeptidase S9 prolyl oligopeptidase 
Protein accessionYP_001683867 
Protein GI167646204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.140093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAAT TGCTCAAGCG CCGCGCCGCG GCGATCGCCC TGGCATGCCT GGTCGCTGCC 
GCGCCTCCCG CGCTGGCGAC CACCAGACCC TACGCTGTCG AGGATCTCCT TGAATTACGC
ACCCTGGGTC CGGCGATGAT CGACCCCAGC CAGCGATGGC TGGTCGTCGG CACAACCGCG
CCGTGGTCAC AGGCCCCGCG CTACGATCTG GACTGGGCGA CTTTCGAAGC GCTCGGTGAG
CTGACGGTAG TGGATTTGGA GGCGCCCGGT TCCCCTCGGC CGCTGCTGCC GACTGAGCGG
GGGGTCGGCT ACATCGCCGG CCCCTTCTCG CCCTCGGGAG CGAAGATGGT TGTGTTCCGC
CTTCGGGGTC ACAGCCGAGA GATCGGCGTG GTCGTGGTGG CGACCGGGGC GGTTCGGTGG
CTCGGCCTGG AGGTCGAGCC CGCGCCTTGG GGCCGATCGG TGCAATGGCG CGACGATGAG
TCGGTGGTCG CCATCGTAAG GCCTCCAGGC GCGGCCTCCC GAGGCGTCGG CTACGGATGG
CAGGTCCAGG CCCGCCTGGA GGCGCTCTGG GCGCGAGCGG CGCGGGGCGA GGCGGCCGTG
ACCGCCTTGG GCGCGGGCCG TTACGCCACA ATCAACCCGC GCCTGCAGGG CGAGCGGCTG
GTGGCGGTCG CGGTCTCCAC TGGCGCGGTC CGCCCCCTCG TCGAGGGGGA GATTGTCGAT
CTGGAGATCG CCCCAGGCGG CCGCACGGCG GCGATCGTGG TCGAGGGCGA GCCCATCGCC
GTCACGGCGA CCGACACCGT CACGCCCTCC ACGTCCTGGC GCCGCCGGCG ACTGCGATTG
GTTGATTTGG CCAGCGGCGA CGTCCGCGAC CCCTTCGCCC GTGCCGACCT CCTGCCCACC
AGCCTCGTCT GGTCGCCCAG CGGCCGGCGG CTGCTTGCGT TCGCCAGGGC GGACGGCGCG
GATTGGGCCT CCGGATCGCT GCGCGAGATC GCCGTCGACG GCGCGGTCAA GGACCTGGCC
GGCTCGGGTG TCGCGCCCAA GCTGACCACG GCGCGCGATG GCCAGGTTTC GGTGCGGGCG
GGCTTCGTTG GCGAGCGGGC GGCGATCTTC GGCGCGCCGA TCGGCGGCGG GACGGACGTA
ACCGAAGCCT GGCGCGGGTG GGACGGCCTG CCTTTGCCGG TGTCGCAGAG CGTTCGGCTC
GAGGTCAGCG ATCGCGATGG CGCGGTCTTC TCCGGTCCCG AGGGAGTCGT TCGTCTGAAG
GCGAGCGGCG GCCTGCAGCG CCTGGCCTTG CCTGGATCGA GATTGCAGCG CCCGGCCGAA
CCTTCGGTGG GCGTTCGGCC GCTGGTCACG CCAACTCTGG GCGGGCAGCT CTGCTGCGCC
CTGGTCTCTG CCACGAACTT GAGGGTGGGC GGCAAGACCT TGGCCTTGAA GCCGGATGAG
ACCGTTTTGG CCTATGCGCC GACCAAGGGC ATGGTGGTCG TCGAGCAGCG CGCCTCCAGC
GGGGTCTCGA CGGTAGCGCT TCGGACGTCG GCCGGCGATC GCCTCCTGCT GACGCTCAAC
CCAGCGCTGG CGCAGGTCGA TCGGCCTGTA ATCCAGGCGA TCGGGCATCG TGGCCCCCAG
GGCGAGACCT TGCCCAGTTG GCTGTTTCGG CCCGCCGATG CGCCCCCTGA CAAACGGTTG
CCGGTGATCA TCGTGCCTTA TCCTGGCTCC ATCTATCCCG CGCCGCCGGC GATGACGCAG
CCGCAGCATC CGCAATTTTC GGCGAGCATC CAGGCGATGG TGGGTCAGGG CTACGCCGTC
ATCGCGCCCA GCCTGCCCTT GTCGGCGCAG TCGGAGCCCG GCGCGTCCCT CGCCCAGTCA
ATGCTGGACA TCGTCGACAA GGCCGCCGTG AGCGGCGGCG TGGATCCGGA CCGGGTCGCG
ATATGGGGAC AAAGCTTTGG TGGCTACGCC GCTCTGCTGG CGGCCGTCCA GAGCGAGCGC
TTCTCCGCCG TGATCGCTTC GGCGCCGGTC TCTGACCTCG CCAGCTTCTG GGCGGCGGTG
CCGCCCCAGG TTTCCCTGAT CGCCGAGCCC GGCCTGCCGG TGGGCGCCTT GGCCGGTTGG
GCCGAGGCTG GTCAGGGGCG GATGCTGGGT CCGCCCTGGC AGGACCCCGA ACGCTGGCGG
CGCAACAGCC CCCTGTGGTC GGCCCAGCGC GTCAAAGCGC CGGTTCTGCT GATCCAGGGC
GACATCGACG CCGATCCAAC CCAGTCGGCC ATGATGTTCC AAGCCCTGGC GCGCCAGAAC
AAGGATGTCC TCTGGTTGAC CTATCACGGT GAGGGCCACG TGGTGATCGG GCCGGGCAAT
CTGCGTGACC TCTATTCGCG CGCGTTCGCG TTCCTGGCCG ACAGCTTCGC CGCGAAGCGG
GCGACGATAG AAGACGCGCC TTAG
 
Protein sequence
MTQLLKRRAA AIALACLVAA APPALATTRP YAVEDLLELR TLGPAMIDPS QRWLVVGTTA 
PWSQAPRYDL DWATFEALGE LTVVDLEAPG SPRPLLPTER GVGYIAGPFS PSGAKMVVFR
LRGHSREIGV VVVATGAVRW LGLEVEPAPW GRSVQWRDDE SVVAIVRPPG AASRGVGYGW
QVQARLEALW ARAARGEAAV TALGAGRYAT INPRLQGERL VAVAVSTGAV RPLVEGEIVD
LEIAPGGRTA AIVVEGEPIA VTATDTVTPS TSWRRRRLRL VDLASGDVRD PFARADLLPT
SLVWSPSGRR LLAFARADGA DWASGSLREI AVDGAVKDLA GSGVAPKLTT ARDGQVSVRA
GFVGERAAIF GAPIGGGTDV TEAWRGWDGL PLPVSQSVRL EVSDRDGAVF SGPEGVVRLK
ASGGLQRLAL PGSRLQRPAE PSVGVRPLVT PTLGGQLCCA LVSATNLRVG GKTLALKPDE
TVLAYAPTKG MVVVEQRASS GVSTVALRTS AGDRLLLTLN PALAQVDRPV IQAIGHRGPQ
GETLPSWLFR PADAPPDKRL PVIIVPYPGS IYPAPPAMTQ PQHPQFSASI QAMVGQGYAV
IAPSLPLSAQ SEPGASLAQS MLDIVDKAAV SGGVDPDRVA IWGQSFGGYA ALLAAVQSER
FSAVIASAPV SDLASFWAAV PPQVSLIAEP GLPVGALAGW AEAGQGRMLG PPWQDPERWR
RNSPLWSAQR VKAPVLLIQG DIDADPTQSA MMFQALARQN KDVLWLTYHG EGHVVIGPGN
LRDLYSRAFA FLADSFAAKR ATIEDAP