Gene Caul_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3900 
Symbol 
ID5901362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4219589 
End bp4220530 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content72% 
IMG OID641564421 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001685523 
Protein GI167647860 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.195824 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGCGCG ACCCGTATCT GGAGCTTGGC GTTTCCCGCA CCGCGAGCGC GGCGGAAATC 
CGCAAGGCGT TCCACAAGCT CGCCAAGCAG CATCACCCCG ACGCCAACAA GGGCGACAAG
AAGTCCGAGG AGCGCTTCAA GCAGGTCAGC GCCGCCTTCG ACATCCTGGG CGACGCCGAC
AAGCGCAAGA AGTTCGACGC CGGCGAGATC GACGCCGACG GCCGCGAGAC CATGCGGGCC
GGCGGGTTCG GCGGCGGCGG CTCGCCGTTT GGCGGCGGCT TCAACCGCAG CGGCGGCTTT
GGACGCGGGG GCGGCGCGGC CGAGGGACCC GAGATCGACC TCAACGACCT GTTCGGCGAC
ATCCTGGGCC GCAATCGCGG CGCGGGGGCG GGCGCTGGAG GCTTTGGCGG CGGGTTCTCG
CCCAAGGGCG CCGACGTGCG GGCCCGCCTC GACATCGACC TGGAAGAGTC GATCAAGGGC
GGCAAGAAGC GGGTGGCCTT CTCCGACGGC CGCACCATCG ACGTCACCAT CCCGGCCGGC
GCCCAGGAAG GCCAGACGCT TCGCTTGAAG GGACAAGGCA GCCCGGGCCG GGGCGGGCAG
GGCGACGCCC TGATCGAGCT GGCGATCAAG CCGCACGCGA TCTATCGCCG TGAGAACGAC
ACCCTGGTCA TGGACCTGCC GATCTCGGTG CCCGACGCCG TGCTGGGCGG CAAGGTCGAG
GCCCCCACGC CCGACGGCCC GGTGACCCTG TCGATCCCCA AGGGCTCCAA CAGCGGCGCC
AGGCTGCGGC TCAAGGGCCG GGGCCTGTCC GACGGCAAGG GCCACCGCGG CGACCTGTTC
GCCCGGCTGG TGGTGACCCT GCCCGACGCG CCAGACACCG AGCTGGAGGC GTTCGCCGAC
ACCTGGCGCA AGGACCGGCC GTACGCGCCG AAGCGGCGGT AG
 
Protein sequence
MARDPYLELG VSRTASAAEI RKAFHKLAKQ HHPDANKGDK KSEERFKQVS AAFDILGDAD 
KRKKFDAGEI DADGRETMRA GGFGGGGSPF GGGFNRSGGF GRGGGAAEGP EIDLNDLFGD
ILGRNRGAGA GAGGFGGGFS PKGADVRARL DIDLEESIKG GKKRVAFSDG RTIDVTIPAG
AQEGQTLRLK GQGSPGRGGQ GDALIELAIK PHAIYRREND TLVMDLPISV PDAVLGGKVE
APTPDGPVTL SIPKGSNSGA RLRLKGRGLS DGKGHRGDLF ARLVVTLPDA PDTELEAFAD
TWRKDRPYAP KRR