Gene Caul_5081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5081 
Symbol 
ID5897307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp1415 
End bp2725 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content72% 
IMG OID641555184 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001676515 
Protein GI167621730 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000372495 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCTACC GGGCCTTCGT TGACCAGATC GTCCTGAGCG ATCCGGCCAC CGGCAGGCCC 
CAGGGGCAGA TCACCTTCAC GGCCTATCGG GTCGCCGCCA AGGGTCCGCC GCGCCCCATC
GCCTTCATCT GGAACGGGGG TCCGGGCGCG CCTTCGACCC TGCTGCACTT CCAGGCCTTT
GGGCCCAAGC GCCTGGAAGA CGGGCGCCTG AGCGACAACG GCGACACCCT GCTGGACCGT
ATGGACCTGG TCTTCGTCGA TCCGATCGGC ACGGGGTTTA GCCGCGCGGT CTCGCCGGAG
GCGGCAATCG GGTTCTACGG AACACGCGGC GATTTCGAGG CGACGGCCCG TTTCGTCCGC
CAGTGGCGCG CGCAATACGC CGACCCCCGG GCTGCGGTGT TCCTGGTCGG CGAAAGCTTT
GGCGTCTGGC GCGCGGCCGG CGCGGCCGAA TTGCTGGCCA AGGCCGGACA GGCGCCGGCG
GGACTGGTGC TGATTTCCGG CGGCGCCGGG GTCGGCGCGG CCGGCGATCC CCCCGCCCTG
GCCGCGGCCC TGCGCGTCCC GGGTCGCGCG GCGGCCGCCT TGCACCACCG ACGGAGTGAT
CCAGCCTTGG GCGCGGATCG CGCAAGCCTA GTCGCGGCCG CCGAAGCCTG GGCCAGGGCC
ACCTATGCGC CGGCCCTGGA GAACGTCGCG CGATTGGATC CCCAAGCCCG CCAGGCCGTA
GCGCGAGACC TCGCGCGCTT CACCGGCTAT CCGCTCGAGC GCATCGACCA GACGACCCTG
GCGCTCACCG CGCGGCAGTA TCGCGAGGGT CTTCTGCTGG ATCAGGGACG CACGCTCTAC
ACCTTCGATA TGCGCCTGAC CGCCGCGCCC AAGGACGAGG CGGATGGATC ACTGGTCGAG
AGCTACTATC GCACGGCGCT GGGCTATGGC GGGCCAGGCC CGTATCTGGG CATGGCCGAC
CCGCCCGCGG CGGGCGAGGC CACGATCAAT AGCCGCTGGC GCTACGACAG CGCCGGTCCG
CCGCGCGGCC AAGCCGGCGA TGGGCCGCCC GGGGCCGAGC CCTGGACCCT GCGCGCCCTG
GCGATCGCGC CAAGGCTGAA GGTCCTGGTG GCCACGGGCC TCTATGACTC GCTCAACAGC
TGCGCGGCGA TGGGGGATCT GGCCCAGCGC CTGGAGCCCG CGCAGGCGGG CGCCTTCACG
TTCCATTGCT ACGCCGGCGG TCACATGATG TACGCCGACG CGCCGCTGCG CGCCCAGCTG
TCCGCCGATG TGAAGGCCTT CGTCGGTCAG GCGATCGGCC CGCGCCCCTA G
 
Protein sequence
MAYRAFVDQI VLSDPATGRP QGQITFTAYR VAAKGPPRPI AFIWNGGPGA PSTLLHFQAF 
GPKRLEDGRL SDNGDTLLDR MDLVFVDPIG TGFSRAVSPE AAIGFYGTRG DFEATARFVR
QWRAQYADPR AAVFLVGESF GVWRAAGAAE LLAKAGQAPA GLVLISGGAG VGAAGDPPAL
AAALRVPGRA AAALHHRRSD PALGADRASL VAAAEAWARA TYAPALENVA RLDPQARQAV
ARDLARFTGY PLERIDQTTL ALTARQYREG LLLDQGRTLY TFDMRLTAAP KDEADGSLVE
SYYRTALGYG GPGPYLGMAD PPAAGEATIN SRWRYDSAGP PRGQAGDGPP GAEPWTLRAL
AIAPRLKVLV ATGLYDSLNS CAAMGDLAQR LEPAQAGAFT FHCYAGGHMM YADAPLRAQL
SADVKAFVGQ AIGPRP