Gene Caul_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1119 
Symbol 
ID5898574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1184735 
End bp1186657 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content69% 
IMG OID641561601 
Productpeptidase M61 domain-containing protein 
Protein accessionYP_001682747 
Protein GI167645084 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0159239 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGAC CTTTATTCCT GTCCCTTGCC TTGGTTCTGT TGGCCGCCGC CACCGCAGCC 
TTCGCCGAAC CCCGGCCCGC GCCGCCGGCC CCGCCGATCC CGGCCGCCCG CGACGTCGCC
TATCCCGGCG TCATCGACCT GCGGCTCGAC GTCAGCGACA CCACCCGCAA GATCTATCGC
GTCGTCGAGA CCATTCCGGT CCGCCCCGGC CCGCTGGTGC TGTCCCTGCC CAAGTGGATC
CCGGGCGAGC ACTCGCCCAG CGCCCAGATC GCCCTGATGT CGGGCTTCAA GGTCACGGCC
AACGGCAAGC CTCTGGAATG GCGGCGCGAT CCGGTCGAGA TGACCGCGTT CCACCTGGAC
ATCCCGGCCG GTGTCGAGGC CATCGAGGTT TCGCTGCGAC AGCCCACCGC CCGGCCCGAC
GGCCCGGTGC GCATCGCCGT GACGCCCAAC CTGCTGATCG TCAAATGGAC CGCCGTGGCG
CTGTATCCGG CGGGCTACAC TGTCGATCGC ATCCGCGTGC GGCCGTCCCT GACCCTGCCC
AAGGGCTGGC GGCTGGCCAC CGCCCTGGAC GGCGCGGTGG TCGCCGGCGA CACCAGCGCC
TTCCCCGAGA CCGACTTCGA GACCCTGATG GACTCGCCGG TCTATGCCGG CCGCAACCTG
CGAACCTTCG ACCTGGACCC CGGCGGTCGG CGCCCCGTGC GCCTGAACGT CTTCGCCGAC
GCCGCCTCCA GCCTGGCCGC CAGCGACGCC CAGATCGAGA CCCATCGCGA ACTCATCCGC
CAGGTCGACA AGCTGTTCGG CGGCGCGCGC AACTACGACC ACTACGACTT CCTGCTCAGC
CTCAATCCCG ACATCGGCTA TCTGGGCGCC GAGCATCAGC GGTCCAGCGA GAACGGCTAT
AACGTCGCCG GCTATTTCAC CGACTGGGAC AAGGCCTTCA CCGGCCGCGA TATCCTGGCC
CACGAATATG TCCACGCCTG GAACGGCAAG CACCGCCGCC CGGCCGACCT GTGGACGCCG
GACTACACCA CCCCGATGCG CGACAGCCTG CTGTGGGTTT ATGAAGGCCT CACCGAATAC
TGGGGCGACA TGCTGGCGAC GCGTTCGGGC CTGTTCACGC CCGAGCAGAT GCGCCAGCGC
CTGGCGCTGA TCGCCGCCAA CGCCCAGGCC ACGCCGGGCC GCGACTGGCG CTCGCTGCGC
GACACCACCA GCGGCTACAT CATGAACGCC GCCGGCGGCA CGGGCTCGAC CGCCTGGATC
CGCTCGCTGG ACTATTACGA GGAAGGCCAG CTGCTGTGGC TGGACGTCGA CACCCTGATC
CGTGAACGGA CGAACGGGTT GAAATCGCTG GACGACTTCG CCAAGGCGTT CTTCGGCGTC
GATGATGGCG ACATGACCGT GTCGACCTAC ACCTTCGAGG ACGTGGTCGC CGGATTGAAC
GCGGTGACGC CCTACGACTG GGCGGGCTTT CTGAATGCTC GCCTCGACGC CCACGACAAG
GCCCCGCTCG ACGGCCTGGC CCGCGGCGGC TGGACCCTGG CGTTCGGCGA CACGCCGACC
AGCTATTTCA CCGCCTACGA GACGGCCCAG GAAACCCGCC TGTTCACCTT CTCGATCGGT
CTGGACCTGG ACGAGGACGG GACGGTGAAG GAGTCGCTGT GGGACGGCCC GGCGTTCACG
GCGGGCATCG TCGCCGGGGC GAAGATCGTC TCGGTCGGCG GTAAGGCCTA TTCCGCCAAC
CGCCTCGCCT CGGCCATCGC GGCGGCCGCC AAGCCAGGCG CCAGGCCCAT CATCTTGACC
ATCAAGGCCG ACGGGGTCGT CCGCAAGGTG GATGTGCCGT ACCACGGCGG CCCGCGCTAT
CCTCGCCTAG AAAAGCTCGC GGGCGCCCAG GACCGGCTCG GCGCGATACT GGCGCCCAGA
TAG
 
Protein sequence
MPRPLFLSLA LVLLAAATAA FAEPRPAPPA PPIPAARDVA YPGVIDLRLD VSDTTRKIYR 
VVETIPVRPG PLVLSLPKWI PGEHSPSAQI ALMSGFKVTA NGKPLEWRRD PVEMTAFHLD
IPAGVEAIEV SLRQPTARPD GPVRIAVTPN LLIVKWTAVA LYPAGYTVDR IRVRPSLTLP
KGWRLATALD GAVVAGDTSA FPETDFETLM DSPVYAGRNL RTFDLDPGGR RPVRLNVFAD
AASSLAASDA QIETHRELIR QVDKLFGGAR NYDHYDFLLS LNPDIGYLGA EHQRSSENGY
NVAGYFTDWD KAFTGRDILA HEYVHAWNGK HRRPADLWTP DYTTPMRDSL LWVYEGLTEY
WGDMLATRSG LFTPEQMRQR LALIAANAQA TPGRDWRSLR DTTSGYIMNA AGGTGSTAWI
RSLDYYEEGQ LLWLDVDTLI RERTNGLKSL DDFAKAFFGV DDGDMTVSTY TFEDVVAGLN
AVTPYDWAGF LNARLDAHDK APLDGLARGG WTLAFGDTPT SYFTAYETAQ ETRLFTFSIG
LDLDEDGTVK ESLWDGPAFT AGIVAGAKIV SVGGKAYSAN RLASAIAAAA KPGARPIILT
IKADGVVRKV DVPYHGGPRY PRLEKLAGAQ DRLGAILAPR