Gene Synpcc7942_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1789 
Symbol 
ID3774364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1860032 
End bp1861012 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID637800230 
Productheat shock protein DnaJ-like 
Protein accessionYP_400806 
Protein GI81300598 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0101299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCGA CGGACTTCAA AGACTACTAC GCAACCCTCG GAGTGGGGCG TGCTGCCAGT 
GCCGATGAGA TCAAAAAAGC TTTCCGTAAG CTGGCTCGCC AGTACCACCC CGATATGAAT
CCGGGCGACA AGGTTGCTGA GGCACGCTTT AAGGAAATCA ACGAAGCTTA CGAGGTGCTC
TCCGACACCG ATAAGCGCCG CAAGTACGAC CAGTTTGGCC AATATTGGAG CCGAGTTGGT
GGCCCCACAG GTGGCCCAGG GCCTGGGGTC GGCTTCGAGG ACTTTGAGTT TGGCCGCTAT
GGCAGCTTTG ATGACTTCAT CAACGAACTG CTCGGCCGTT TTGGCGGCGG CGCGACGGCT
AGCGCCAGTG CCGGTTATCG CAGTCCTGGT TTTCAGGATT TTGCCGGCGG TTTTGGTAGC
CAGGCCACTG CTGGGGCTCG TGCCGTCAAT TTGGATGCTG AAGCCAGTAT TAGTCTCAGC
CTCAGCGATG CTTTTCGGGG GACGCAAAAG CAGCTCCGCA TCAACAGCGA AATGGTTGAG
GTCAAGGTGC CGGCTGGCAT CAAAGCAGGG AGTAAACTGC GCCTGCGGGG CAAGGGCAAC
ATCATGCCCA ATACGGGCAA GCGCGGCGAT CTCTACCTGA AGATTGAGGT TAAGCCCCAC
GAGTTTTTCC AGCTAGAGGG CGACCAGTTG AGCTGTGAGG TGCCGATCGC ACCGGATGAA
GCAGCCCTCG GTGCCACGAT CGCGGTTCCC ACACCGGATG GCTTGGTCAA CGTCACGATT
CCGGCCGGAG TTCGCACCGG ACAATCCCTG CGGCTGCGGG GTAAGGGCTG GCCAACTCGC
ACGGGCCGCG GGGATCTGCT GGTGAAAGTG GCGATCGCGG TACCGAAAAG CCTGACCGAG
GCAGAACGTC AGGCCTACGA ACAGTTGCAG CGGTCGCGCA GTACCGATCT GCGATCGGCA
CTCATGCAAT ACAGCCTCTA G
 
Protein sequence
MAATDFKDYY ATLGVGRAAS ADEIKKAFRK LARQYHPDMN PGDKVAEARF KEINEAYEVL 
SDTDKRRKYD QFGQYWSRVG GPTGGPGPGV GFEDFEFGRY GSFDDFINEL LGRFGGGATA
SASAGYRSPG FQDFAGGFGS QATAGARAVN LDAEASISLS LSDAFRGTQK QLRINSEMVE
VKVPAGIKAG SKLRLRGKGN IMPNTGKRGD LYLKIEVKPH EFFQLEGDQL SCEVPIAPDE
AALGATIAVP TPDGLVNVTI PAGVRTGQSL RLRGKGWPTR TGRGDLLVKV AIAVPKSLTE
AERQAYEQLQ RSRSTDLRSA LMQYSL