Gene Caul_5080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5080 
Symbol 
ID5897428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp184 
End bp1437 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content68% 
IMG OID641555183 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001676514 
Protein GI167621729 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000112364 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0461552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAC CCTGGCGCGC CGGCGCGGCG ATCATGGCCC TGGCGCTTTG TAGCGGCCTG 
GCCTGGGCTG GCGAGCCGAA CAAGCCCGGC GTGGTGGAGG CCGCTGGCCA AACGGTGCGC
TATGTCGCGC GCCTGGACGA AACGGTGCTG ACCAAGGCCG ACGGCGCGCC CGAAGCACGC
CTCATCACGC CAGTCTACGA GCGCGCCGGC CCGCGCGCGC GCTCCCGCCC CGTTCTCTTT
GCCTTCAACG GCGGGCCCGG TTCCTCAAGC GTCTATCTGC ACCTGGGCGT TCTGGGCCCA
CGACGTCTCG ATCTGCCAGG CGACGCCTCG CCGCCCGCCC CGCCCTACCG AATGGTCGCA
AACGCCGAGA CGGTGCTGGA CACCGCGGAT CTGGTGCTGA TCGATCCGGT CGGGACGGGC
TTGAGCCGCC TGCTCGATCC AGGCCAGCAT GCGGCCTACT ACTCCGTGGA CGGCGAGGGC
CGCTATCTGG CGCGTTTCAT TCGCCAGTGG TTGGCCCAGC ATGGCAGGAC CGACGCGCCG
GTGTTTATCC TGGGCGAGAG CTACGGGGCC ATGCGAGCGG TGGCCATCAC CAAGCACCTG
ATCCTGGATC CCGGACCGAC GGTTGATCTT CGGGGGCTGA TGCTGGTGTC GCAAAGCATC
GGCGTGCATG AGACGGTTCA ACGTCGCGGA AACCTTGTGG GCCAGGCCGT AGCGCTGCCC
ACATTGGGCG CGATCGCCTG GTATCATCAT CGCGCCCAGA CCGAGGGGCT GGACCTGCAG
GCCTTTCTCG ACAAGGTCCA GGCCTTCGCC ACCGACGACT ACCTACCTGC GCTCTATGCC
GGCAATGGGC TGCCTGCGGC CCGCCGCGAA GCGATCGCCG ACGTGCTTGC CCGCTTGACG
GGCGTCTCGG CCAAGACCTG GCTGGCCAAT GACCTGCAGA TCTCCAAGGA AGCCTATCGG
CGCCTGATCC TGGCTGACCA GGGCCAGCAG GTTGGCCGCA ACGACGCACG GTTCACCGGG
CCGCTCGGTG GCGAGGACCC GTCTACAAAA GGACTGACCC AGGCGCACGC CTGGGCCATC
GACCAGGTGC TTGGCCAGGA GTTTGGCGCC AGCGCCAAAG ATTATCGCGT CAGTGACGAT
CCCCAGCCCA GCCGATGGAT CTACGCGCGC GACCAAGGCC AGCGCCAAGC CGGCGCAGGC
GATCGCGTCG ATCCGGCTTG GGGCGCGGTC TTTGGCCTAC CGGGCCTTCG TTGA
 
Protein sequence
MTRPWRAGAA IMALALCSGL AWAGEPNKPG VVEAAGQTVR YVARLDETVL TKADGAPEAR 
LITPVYERAG PRARSRPVLF AFNGGPGSSS VYLHLGVLGP RRLDLPGDAS PPAPPYRMVA
NAETVLDTAD LVLIDPVGTG LSRLLDPGQH AAYYSVDGEG RYLARFIRQW LAQHGRTDAP
VFILGESYGA MRAVAITKHL ILDPGPTVDL RGLMLVSQSI GVHETVQRRG NLVGQAVALP
TLGAIAWYHH RAQTEGLDLQ AFLDKVQAFA TDDYLPALYA GNGLPAARRE AIADVLARLT
GVSAKTWLAN DLQISKEAYR RLILADQGQQ VGRNDARFTG PLGGEDPSTK GLTQAHAWAI
DQVLGQEFGA SAKDYRVSDD PQPSRWIYAR DQGQRQAGAG DRVDPAWGAV FGLPGLR