Gene Cyan8802_4356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4356 
Symbol 
ID8393708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4507821 
End bp4509191 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content30% 
IMG OID644982266 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_003139977 
Protein GI257062089 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.593221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.193587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATA AGTCTACTAT TGAACAACTT GAATTATTTA ATCTTGCTAA ACCCCTTAAT 
TTAAGCTTTT GTCAATCGAA ATTTACTTTT ATAGATTTAT TTTCTGGAAT TGGAGGATTT
CGTATCCCCT TAGAACAATT AGGGGGAAAA TGTTTAGGGT ATTCTGAAAT TGATACAGAA
GCAATTAAAG TCTATAGACG TAATTTCATT AGGTATAGTA ATCGGGATGA AACCTATTTA
GGAGATATTA CACAACTTAA TCAAATTCCT TTTAAAGTTG ATGTAATTGT GGGAGGAGTA
CCCTGTCAAC CCTGGTCTAT TGCGGGAAAA TTAAGGGGAT TTGAAGATCC TAGAGGTCAA
CTTTGGTTTG ATGTTATTAG ATTAATTAAG GATAATAAAC CTAAAGGATT TATCTTTGAA
AATGTTAAAG GATTAACCGA TCCCAGAAAT CAAGAAAGTT TTGATTATAT TCTCAATCAA
TTAAAGCAAT CAGGGTATTA TGTACAGCAT AAGGTTCTTA ATTCCTATGA TTTTGGTTTA
TCCCAGGATA GAGATAGAGT ATTTATTGTA GGCATTCATC AACAAATTGA AAATGCTGCT
CAATTTTCTT TTCCAGAACC CTTAAATCTG AGTCCTAAAC TCTATGAATT TATTGAAGGA
ATAGAAAAAC AAGAATTAGT GAAAAAGAAA TTTTCACCAG ACATTTTATT TGAGGGTAAA
ATTCCTGCTT CTAGAGGGAG ATTCCAAAAA AATGATGAAT TAAACGATTT CTTTATCTTC
TCAGATATTA GGGATGGACA CACAACCATT CACTCTTGGG ATTTGATTCA AACTAATTGC
CAAGAAAAGC TAATTTGTCA AACAATTTTA CGAAATAGAA GGAAAAAGAA ATACGGAGGT
AAGGATGGAA ACCCTCTAAA CTTTGACATC CTAGAAACAC TGATACCTAA TTTAGAAAAA
GATGAATTAG AAAAATTAGT TGATAAAAAA ATTCTGCGCT TTATTAAGGA TAGAGATTAT
GAATTTGTTA ATTCAAAAAT ATCATCTGGA ATTAATGGGA TTTCCAAAAT CTTTTTACCT
CATTCTGCTG TTATTGCTAC TTTAACAGCT ACTGGAACTA GGGATTATGT TGCTACAAAA
TTTTTTAATT GCCAAGATCC TATAATTTAT AAAGAAAAGT TTATTAAAGA GATTTATCTT
AAAAACAAAT ATAAACCATT ATCAAGTCGA GATTATGCTA GACTACAAGG ATTTCCTGAT
TGGTTTATTA CCGCAGATAA TGAAAGTAAT GCTAAAAAAC AATTAGGTAA TGCAGTATCC
ATCCCTGTTG TTTATTATTT AGCAGAATCT TTACTAAAAA TAATTGGGTA A
 
Protein sequence
MKNKSTIEQL ELFNLAKPLN LSFCQSKFTF IDLFSGIGGF RIPLEQLGGK CLGYSEIDTE 
AIKVYRRNFI RYSNRDETYL GDITQLNQIP FKVDVIVGGV PCQPWSIAGK LRGFEDPRGQ
LWFDVIRLIK DNKPKGFIFE NVKGLTDPRN QESFDYILNQ LKQSGYYVQH KVLNSYDFGL
SQDRDRVFIV GIHQQIENAA QFSFPEPLNL SPKLYEFIEG IEKQELVKKK FSPDILFEGK
IPASRGRFQK NDELNDFFIF SDIRDGHTTI HSWDLIQTNC QEKLICQTIL RNRRKKKYGG
KDGNPLNFDI LETLIPNLEK DELEKLVDKK ILRFIKDRDY EFVNSKISSG INGISKIFLP
HSAVIATLTA TGTRDYVATK FFNCQDPIIY KEKFIKEIYL KNKYKPLSSR DYARLQGFPD
WFITADNESN AKKQLGNAVS IPVVYYLAES LLKIIG