Gene Caul_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2371 
Symbol 
ID5899826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2574026 
End bp2576098 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content66% 
IMG OID641562862 
Productprolyl oligopeptidase 
Protein accessionYP_001683996 
Protein GI167646333 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCC ATTCCCAAGA AGAACCTCTC AGCCCTGGCT ATCCGCCCGC GGAGGCGCGT 
CCGGAAGCGG TCGCGTTCGG CGACGGCGCC GTGCGCTATG TCGATCCGTT TCGCTGGCTG
GAGGAGGGGG ATCGCGACGT CCTCGCCTGG CAACGCGCTC AGGACGACCT GGCGCGCGCC
CATCTCGCTA GCCTGCCGCG CTATCAGGAG TTTCTCGCGG CCGTCGAGTC GATGGGCGCG
TCCGAGGACT TCATGCTCCC GGCCTTCGTC GGCCATCGCT ATGTCCACAG GTTCGTCCCC
GAAGGCGAGG ACCTGGCGGT GGTCGAACTG TCGGAGACCC CCACTGGCCC TCGCCGGCGG
GTCGTCGATC TGAACGCGAT CCAAGCCGAC GAACCGTTGC AGATGGCGGC CTACGCCCTG
TCTCACAGCG GCAAGCTGGC GGTCGTCGCC TGGACCGCCG GCGGGCACGA AAAGCCGATC
GTTCAACTCG TGGAGGTCGA AACCGGCAGG GTCCTGTCGC AAGGGCTGCC GAGCGAACGG
CTCGGCCCTT TTACCTGGCT GCCAGACGAT AGCGGGTTCT TCTACATGAG CCTGGATCCG
GCGGATATCA GCGCCGGAAA GACCTTGTTC CGAGTGATGG TCGAGGACCC GAAGGTGGCA
AGGCTCGAGG CCGTGGAGCC CAGCCACTTC CACGTGCGCC CGGTCGTGGG CGGTGACCAA
CGTCATGTCC TGCTCTTCGT GAACCATCTC GCGCCCAGGC CAGAATACAT TCTCGACACT
CAGAGCAAGG GGGCCTGGCG ACCTTTCCTG AAGGACGTCG AGGGCATCTT TCGCGGCGAT
ATCGTCGGGG ACCGCTTCTT CGCGATCACC GACGACGGCG CGCCACGCGG AAGGCTGGTG
TCGATCCCGC TCGCCACGCC GACCCGGCGC GAGACGTGGA AGGAGATCAT CCCGCCCTCA
GACAGCGTGC TCGCCAACGT TCTGTCGGTC GGGGATCGCG TTGTCGTCCT CGACTATGTC
GACACCTATT CGCGCCTTCG CGTGTTTTGC AGGGACGGTC AGCTGGAGGG CGAGATCGCG
TTGCCGGGCC AGGGACTGAT CAATCGAACG GGCAGCTTCT ACTCGTTCTT CAATGTCACC
AATACGATGT TGCGAGGCCA GGGCGAGCAG ATCGATTTCC TGTTCGCGTC GCCCACGACC
TCGCCCGCCT ACTACACGGC CGACGTGAAG ACGCGGAAAC TGACCCAGGT CACGCCGTGC
GAACGAACGC TCGACGCCCA GGTGCTGGAT CGCAGGGTGC TGAGCAAGGA CGGCGCCGAG
GTCCTCTATC ATGTGGTCGC ACGCAAGGAC CTGGACCTGT CGGAACCGCA TCCGACGGTG
ATCACCGGCT ACGGCGGCTA TAACGTGGCC GTGCTCCCGG GTTGGTTCGG CAACCGGTGG
GCGGCGTGGA TTGAGGCCGG CGGGATCTTC GTCCTCAGCC ACCTGCGCGG TGGCGGCGAG
TTCGGAACGC CCTGGTGGAA GCAGGGGCGT CTCGAGCACA AGCAGAACAC CTTCGACGAC
CTCTACGCGA CGGCGGAGGA TCTGATCGCC CACGGGATCA CGACGCCCTC GCAGCTCGGC
GTCACCGGCG GATCCAACGG CGGGGTGATG GCGGCGGTCG CTGCGGTTCA GCGGCCGGAT
CTGTTTCGCG CCAGTTCCCC CGAAGCGCCG ATCACCGACC TTCTGGCGCG CTCGCGCGAT
CCGTTCACGA TGGCCGCGAC CTTGGACTAT GGCGATCCGT CGGACCCGGT GATGGCCCAG
ATCCTCAAGG GCTGGTCACC CTACCAGAAC ATCAAGGACG CCACCGACTA CCCCGCGATG
CTGATCGACT GCGGCGCGAA CGATCCGCGT TGTCCGCCCT GGCACGGTCG CAAGCTCGCA
GCGCGACTGC AGCAGGCAAG CACCAGCGGC CTGCCCGTTT GGCTGCGCGT GCGCGAGGGC
GCCGGCCACG GCGCGATAGG CGATGAAGAG CTGGCGCGCC AGTCGGCTGA AGTCCTCGCG
TTCTTTGCGA AAAATCTGGG TCTGGCCGGC TGA
 
Protein sequence
MNAHSQEEPL SPGYPPAEAR PEAVAFGDGA VRYVDPFRWL EEGDRDVLAW QRAQDDLARA 
HLASLPRYQE FLAAVESMGA SEDFMLPAFV GHRYVHRFVP EGEDLAVVEL SETPTGPRRR
VVDLNAIQAD EPLQMAAYAL SHSGKLAVVA WTAGGHEKPI VQLVEVETGR VLSQGLPSER
LGPFTWLPDD SGFFYMSLDP ADISAGKTLF RVMVEDPKVA RLEAVEPSHF HVRPVVGGDQ
RHVLLFVNHL APRPEYILDT QSKGAWRPFL KDVEGIFRGD IVGDRFFAIT DDGAPRGRLV
SIPLATPTRR ETWKEIIPPS DSVLANVLSV GDRVVVLDYV DTYSRLRVFC RDGQLEGEIA
LPGQGLINRT GSFYSFFNVT NTMLRGQGEQ IDFLFASPTT SPAYYTADVK TRKLTQVTPC
ERTLDAQVLD RRVLSKDGAE VLYHVVARKD LDLSEPHPTV ITGYGGYNVA VLPGWFGNRW
AAWIEAGGIF VLSHLRGGGE FGTPWWKQGR LEHKQNTFDD LYATAEDLIA HGITTPSQLG
VTGGSNGGVM AAVAAVQRPD LFRASSPEAP ITDLLARSRD PFTMAATLDY GDPSDPVMAQ
ILKGWSPYQN IKDATDYPAM LIDCGANDPR CPPWHGRKLA ARLQQASTSG LPVWLRVREG
AGHGAIGDEE LARQSAEVLA FFAKNLGLAG