Gene Caul_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2159 
Symbol 
ID5902553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2342277 
End bp2343254 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content70% 
IMG OID641562650 
ProductPDZ/DHR/GLGF domain-containing protein 
Protein accessionYP_001683785 
Protein GI167646122 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0222521 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTTCAC CGTTTCAAGG TCATGAGGTC GAGGCGCGAC TGCGGCCCGC CGCCAGCGGT 
TACGCCTTCG ACCTGGACCA CGCACTGTCC GCCGTCGTCG CGCTGGAGGC GCGGGTTCCC
GCCGACGCCT TCACCGCCGG GATCCTCGGC ACCGAGCGGC TGGGCAATGG CGTGGTGATC
AGCGAGAACG GCCTGGTGCT GACCATGGGC TACCTGATCA CCGAGGCCAG CCAAGTGGTG
CTGACGCTGA ACGACGGCGC GCGGGTGCAC GCCCACGTCC TGGGGTTCGA CTCGCAGACG
GGCCTGGGCC TGGTGCAGGC GCTGGAGCCC CTGGGCCTGC CGCCGCTGCA CCTGGGGTCT
TCGAAGGACC TGAGGGCGGA AAGCCCGGTC ATCATCGCCG GCGCGGGCGG GCGGGCGCAC
GCGGCCGCCG GCCAGGTGCT GGCGCGCATG CCCTTCGCCG GCTACTGGGA ATATCTGCTG
GACGACGCGA TCATCACCGA ACCGGCTCAC CCGCACTGGA GCGGCGCGGC GCTGATCGAT
TCGACGGGAA ACCTCGTCGG CGTGGGCTCG CTCAGCCTGG CGGGACAGTC GCGCGGCGGC
CAGGCCAAGC CCATGAACAT GTTCGTTCCC GCCGACCTCC TGCCGCCAAT CCTGGACGAT
CTGGCGCGCG GCCGGCCGGC CCATCCGCCT CGCCCCTGGC TGGGGGTCTT CGCCCAGGAG
ACGGATTCCC ACGTCATCGT GGTCGGCGTT TCGCCCAGTA GTCCGGCGGC CCGCGCCGAG
CTTCGAGCCG GCGACCTGAT CCTCGCCGTC GCGGGCGAGC CCGTTTCCGA CCTCGCCGAA
TTCTACACGG GCCTCTGGGA TCAGGGCCTG GCGGGCGCGA CCATCCCGCT ACGCATCCTG
CGCGAACAGG ACGTGTTCGA GGTCGAGGTG CGCTCGGTGG ACCGCAACAC GCTGTTGAAG
AAGCCTCGGT TCAATTAG
 
Protein sequence
MASPFQGHEV EARLRPAASG YAFDLDHALS AVVALEARVP ADAFTAGILG TERLGNGVVI 
SENGLVLTMG YLITEASQVV LTLNDGARVH AHVLGFDSQT GLGLVQALEP LGLPPLHLGS
SKDLRAESPV IIAGAGGRAH AAAGQVLARM PFAGYWEYLL DDAIITEPAH PHWSGAALID
STGNLVGVGS LSLAGQSRGG QAKPMNMFVP ADLLPPILDD LARGRPAHPP RPWLGVFAQE
TDSHVIVVGV SPSSPAARAE LRAGDLILAV AGEPVSDLAE FYTGLWDQGL AGATIPLRIL
REQDVFEVEV RSVDRNTLLK KPRFN