Gene Cyan8802_2871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_2871 
Symbol 
ID8392198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp2907406 
End bp2909673 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content46% 
IMG OID644980821 
Productheat shock protein DnaJ domain protein 
Protein accessionYP_003138556 
Protein GI257060668 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.193929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.104085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATCC CCCTTGATTA CTATAAAATT TTGGGAATTT TTCCCCAAGT AACAGATGAG 
CAGTTAGAGC ACGCTTATCG AGATCGTAGT CTTCAATTAC CTCGTCGAGA GTATAGTGAA
GCAGCGATCG CGGCTCGCAA ACAACTGCTC ACCCAAGCCC ATGACGTTTT ATCAAATTCA
GCGAAAAGAA CCGCATACGA GGCCTTATTT TTAGAAGATA TTTTGCCTAA GGACTCAAAC
TTAGCCCCAG AATCTTCCTC AGAAACCACG GATAGACCCC AAGAAGATTT CCCCCACCCA
GGAACCTCAA CCCTAGACAT TGCCCCTGAG CAATTAGTGG GGGCTTTGAT GATTCTTCAG
GAGTTAGGCG AATACGAATT GGTGATTAAA TTGGGGGAAC CTCATTTGTT GAGTTTTCCC
TCACTAACCC TTTTCAATAG CCCAGAAGAC CAATTATCTT CCACTCGTGC TGACATCATT
CTGACCCTAG CCTTAGCTAA GCTAGAACTC AGTCGAGAAC AATGGCAACT CGAAGAATAC
GAGCAAGCTG CTACTTTAGG AAGCCAAGCA CTAGAGCTAC TGCAAAAAAA TTCCCTATTT
CCTGGCTTAC AAACCGAAAT TTGCCATGAA TTGAACAAGC TTCGCCCCTA CCGCATTCTC
CAATTATTAG CCCAACCCGA GAACAACAAA AGCGATCGCC AGCGAGGCCA ACACCTGCTT
CAAGAAATGT TACAAGAAAG GCAAGGCATT GATGGCTTAG GAAATGATCA CTCAGGCTTA
GATATTGATG ATTTTTTGCG GTTTATCCAG CAACTTCGCC ACTATTTAAC CGCCGAAGAA
CAACAAGAAC TTTTTTTAGC TGAAGCACAC CGTCCCTCGG CCGTTGCTGC CTATCTAGCC
GTTTATGCCT TAATAGCCCA AGGCTTTGCC CAGAAACAGC CCGCCTTACT CCTAGAAGCC
CAAACCATGC TCAGTGGACT GGCCAAACGA CAAGATGTTT CCCTAGAACA AGGCATTTGT
GCGTTATTAT TAGGACAAAC TCAGGCCGCG AGTCAAATTC TCGAAAGCTG TCAAGACACT
GAAGCGTTAG CCTTGATCCG AGAACACTCT CAAGGATCAC CCGATCTCCT ACCTGGGTTA
TGCTGGTATG GAGAATACTG GCTAAAAATA GAAGTATTGG CTCATTTTCG GGATTTACGC
CAATATTCAG TTTCTTTAGC CGACTATTTT GCGGAAGAGG AAGTTCAGAC CTATCTAGAA
CAGTTATCTG GCGAGCCTCA AGAAATTCTG CTTGAGGAGC AACAACAAAG AGGAGTGACG
ATGACGAGAT CAGTTTCTCG ACGTAACCAC AGTTTACAAG AACAAGGAAC CGTCCATAGA
CGGCGAGTCC CCTTAGCTCG TTCTCACCAA GTTGAACCCC AGTTATTGGC CAGTGGGGGG
GGCGGTGTAG CAGCGATCGC TACAAGCGTT CCGACTCCTG TGGCTCCCCA TCGTCGGCGT
TTTCGAGATC AGTCCACTAG ACCCTCTAGA ACCAACCGAG ATAGGAGACT GAACAATACT
CGAACCTATC CTAACCCCCA GGAGAAACCT CCCCTCCCAG TGGGTTCACC AGCCCCCTCA
GAAGGTGTTC AACTAGAGCC AGTTCCCTCT GGTTTATCCA AGCGCAAAAA ACAACAACGC
CCAACAACTT TTAAACCTCG GCCTTGGTTC GTTTGGATCG CCGTATTCGC TGCTTTGGGG
ATGGTAGGAT TATCCGTTAA GTGGATTCAA TATTCTATGT CCCCCTTAGC GGCTTTAGAA
GAAGAACAAT TGGTTCTGAA CTTGGCTCAG CCTCCGATTC AAATTCCCTC AGAGCAATCT
CCCCTAACGA CGAAAGAAGG TCTGTTGAGT CCACAAGGGG CGAAACAGGT GATTCAACTG
TGGTTGTCGA GCAAATCTCA AGCCTTTGGT TCTAATCATG AAATTGAATC ATTAAACCAA
ATTTTAGGGA CTTCTCTATT AGCTCTTTGG AAAGATCGGG CGCAAAAGTT GAAAGAAAAT
AGAAATTATT GGCAATATAC TCATGATTTT AAGATAGAAT CTCTGAAAAC TACTAAAAAC
AGCCCTAAAA CAGCCATTGT CAAGGCTAAA GTCACAGAAA GAGCCAAATT CTATGAGAAA
GGTCAACTCA ATTCAGGTCG CTCCTACAAC GATCAGCTAC GGGTGGAATA TCAGTTAACC
CATCAAGGGG ATAGCTGGCT CATTGAGTCT ATTCGAGTCA TTAATTAA
 
Protein sequence
MRIPLDYYKI LGIFPQVTDE QLEHAYRDRS LQLPRREYSE AAIAARKQLL TQAHDVLSNS 
AKRTAYEALF LEDILPKDSN LAPESSSETT DRPQEDFPHP GTSTLDIAPE QLVGALMILQ
ELGEYELVIK LGEPHLLSFP SLTLFNSPED QLSSTRADII LTLALAKLEL SREQWQLEEY
EQAATLGSQA LELLQKNSLF PGLQTEICHE LNKLRPYRIL QLLAQPENNK SDRQRGQHLL
QEMLQERQGI DGLGNDHSGL DIDDFLRFIQ QLRHYLTAEE QQELFLAEAH RPSAVAAYLA
VYALIAQGFA QKQPALLLEA QTMLSGLAKR QDVSLEQGIC ALLLGQTQAA SQILESCQDT
EALALIREHS QGSPDLLPGL CWYGEYWLKI EVLAHFRDLR QYSVSLADYF AEEEVQTYLE
QLSGEPQEIL LEEQQQRGVT MTRSVSRRNH SLQEQGTVHR RRVPLARSHQ VEPQLLASGG
GGVAAIATSV PTPVAPHRRR FRDQSTRPSR TNRDRRLNNT RTYPNPQEKP PLPVGSPAPS
EGVQLEPVPS GLSKRKKQQR PTTFKPRPWF VWIAVFAALG MVGLSVKWIQ YSMSPLAALE
EEQLVLNLAQ PPIQIPSEQS PLTTKEGLLS PQGAKQVIQL WLSSKSQAFG SNHEIESLNQ
ILGTSLLALW KDRAQKLKEN RNYWQYTHDF KIESLKTTKN SPKTAIVKAK VTERAKFYEK
GQLNSGRSYN DQLRVEYQLT HQGDSWLIES IRVIN