Gene Cyan8802_3846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3846 
Symbol 
ID8393196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3959571 
End bp3963902 
Gene Length4332 bp 
Protein Length1443 aa 
Translation table11 
GC content43% 
IMG OID644981771 
Productpentapeptide repeat protein 
Protein accessionYP_003139485 
Protein GI257061597 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTATTA TTGATGATAT TATTGCTGAG TTAGTCTCTC AAGCTGTTAA TGAAGCAGAA 
AAGAAAGCCA ATCATAGTGA GAGAGTTTTA CGAATATTAA AAAAGTTTAA CCTTTCTCCT
GATAGTCCTC CTAATGATAT TGAAGGGATT TATAAATATA CGTTAGTTGA ATATGGAGTA
GATCAACCTA AAGCGGTTTT AGATTTGTTT CGTGAAGCAG CAATTCAAAA AGCTTTTAGT
GATGCTTTTA GCAATAATAA TTATGCTCTT TTGCAACAAC AAGTAGACAA GGGAATTGAT
AGTTATGCTT GGGGAGATGA AATTAAAAGA CTTAATATTG ATTATCAAAA AGAATTAACT
CAATTTAGTA TTTTATTCGT TGAAATAGTC AAGCGAACTA GAACTGCTAC AGAGATTTTA
GAATCGAATA AATTAGAGAG TTTACAACGT CAATTAAATA CAGTTCAACA ACAAATCAAA
ACCCTACCTT TATCTCAACT TTATCAACAA CTTGCTTTAA TTGCTAACAA TACTCAAGCT
TTATTACCCG CAGTAGAAGA AACTAGAGAA ACTCAATTAG CTAAGAATAT TAAACAATGG
TTTAAGACCT TAAATTATAC TTTTGAAGGT TATGAGCAAT ATAATGATGA TTATTTTGAG
TTTGTTATTA ATATTCCCGT AAGGCGAGGA TATGATCGAA TTTTTGTTAG GGGAGTGGAA
GCAGAAGCAA GTATTAAAGA TGTTAACGAA GTAATAAAAT TAGTCAAAAC CCACAGAACT
GATGAAGGAT GGATCGTTGC ACCCCGTCGT ATTGCTTCAG CAGCGAAAAA TATAGCAAAA
ACTAATAATA ACGTTTATTG TTATACTTTT GATGAATTAA TCGATGATAA TGCTAATTTT
ACTCAATATT TTAATTGGTT AGAGGACTAT GTAAAAACCA GAGAGATTGA AAAATTTTAT
GTTCCTTTAG CTTGTCGAAA AGAAGAAATC GATCCTGATA CTAAACATAA ATTAGGAATG
AGTATTTATG ATGAAAAAGA AGGCTGGATC GAGGGCTATA TTGACCGATG GTTAGATGAT
CCCGTCAAAG AACATATCTC AATTTTAGGG GAGTTTGGGA CGGGTAAAAC CTGGTTTGTG
TTTCACTATG CTTGGCAAAA GTTACAGGAG TATCAGAAAG CGAAAGAAAG AGGAACTCAA
CGACCCCGTT TACCTTTAGT GATTCCTTTA AGAGATTATG CAAAAGCAGT TACAGTAGAA
TCTCTTTTAT CTGAGTTTTG TTTTCGTAAA CATGAAATTG GTTTACCTGG ATATACTGCC
TTTGAACAAC TCAACCGCAT GGGTAAATTA TTGATTATTT TTGATGGTTT TGATGAAATG
GCAGATCGAG TTGATCGGCA AAAAATGATT AATAATTTTT GGGAATTGGC TAAGGTTATT
GTTCCTGGTG CAAAAGCCAT TTTAACCTGT CGGAATGAAC ATTTCCCCGA AGCAAAAGAA
GGACGCGCAT TGCTTAATGC AGAATTAAAA GCTTCCGTGG CTAATTTAAC GGGAGAACCT
CCTCAATTCG AGATTTTAGA GTTAGAAAAG TTTAATAATC ATCAGATCAG AACTGTCTTA
GAAAAACGCA CTGACGGGGA TACTATTGAA CGAATTATGG CTAATGCTGA ATTATTAGAC
TTGGCGCGTC GTCCCTTAAT GATAGAACTA ATTTTAGCAG CAATGCCAGA CATTGAAGCA
GGAAAAGCGA TTGATATTTC TCGTATTTAT TGGTATGCGG TTCGACGTAA ATTAGAACAG
GATATTAAAC AAGAACGAAC GTTTACCAGT ATTGCTGATA AACTTTATTT TATGTGTGAA
TTAGCCTGGG AAATGCTATC CACTGATCAA ATGAGTTTGA ATTATCGTCA ATTTCCCGAT
CGCTTGCGTA GGTTATTTAG TCAGGAAGTT CAAGAACAAA AAGATCTCGA TCATTGGCAT
TATGACATGA TGGGTCAAAC GTTATTAACC CGCAATTCTG AAGGGGATTA TATGCCAGCC
CATCGTTCCT TATTGGAGTT TTTTGTGGCT TATAAATTTG CGGTAGAATT GAATATTCTA
GCCCCAGATT TTGCCGAAGA AGAAGATTTA AAGCGGTTTA ATTCTCCAGT CTCCCCCTTA
GATCTCTCCA AAACCTTCGG AAAACAACCC TTAAGCAAAG CAGTCCTAGA TTTATTAATT
CCCATGATGG AACCTGGAGA ACAGACCATC ACTAAATTAA AGGAAGTGGT CATGGCCACC
CAAGGACAAA CCCCCGAAAC TGTGGGGTAT TTGGGGGGAA ATGGGGTCAC TTTACTATTA
AAGCTTAATT GTTCTAGCTT AATTCGGCAG AATCTCAGGG AAACGGTGAT TTTAGGGGCA
GATTTCAGTC AAGCTATCTT ACATCAAGTT GATTTCACGG GGGCAAATTT AACTGAGACG
CGGTTTGCTA ACCTTTTAGG AGGTGTTCTG ACTTTGGCCT TTAGTCCTGA TGGTCAATGG
TTGGCGACGG GGGATCGCCA AGGAGTAGTT CGGGTTTGGG ATGCAGTGAC AGGGAAAGAG
GTCTTAACTT GTCGTGGTCA TCATTACTCG GTATGGTCAG TGGCCTGGAG TGGGGATAGT
CAAACCCTGG CCAGTAGCAG TGATGACAAA ACCATCAAAC TCTGGGATGT CTCCACGGGA
AACTGTCGTC TCACCTTAAC GGGTCATCAT TACTCGGTAT CTTCAGTGGC CTGGAGTGGG
GATAGTCAAG CCCTGGCCAG TTGCAGTTAT GACAAAACCA TCAAACTCTG GGATGTCTCC
ACGGGAAACT GTCGTCTCAC CTTAACGGGT CATGATGCCT GGGTATCTTC AGTGGCCTGG
AATGGGAATA GTCAAACCCT GGCCAGTGGC AGTGGTGACA ATACCATCAA ACTCTGGGAT
CTCTCCACGG GAGAGTGTCA TCTCACCTTG ACCGGTCATG ATGACTCGGT ATCTTCAGTG
GCCTGGAGTG GAGATAGTCA AACCCTGGCC AGTTGCAGTT ATGACAAAAC CATCAAACTC
TGGGATGTCT CCACCGGACT GTGTCGTCTC ACCTTAACCG GTCATCATGG CTGGGTATCT
TCAGTGGCCT GGAGTGGGGA TAGTCAAACC CTGGCCAGTG GCAGTTCAGA CAAAACCATC
AAACTCTGGG ATGTCCAGAC ACGCCAATGT CGTCTCACCT TAACCGGTCA TGATGACTGG
GTATCTTCAG TGGCCTGGAG TGGGGACAGT CAAACCCTGG CCAGTGGCAG TGAAGACAAA
ACCATCAAAC TCTGGGATGT CTCCACGGGA AACTGTCGTC TCACCTTAAC CGGTCATGAT
GCCTCGGTAT CTTCACTGGC CTGGAGTGGG GACAGTCAAA CCCTAGCCAG TGGCAGTTAT
GACCATACCA TCAAACTCTG GGATGTCTCC ACCGGACTGT GTCGTCTCAC CTTAACGGGT
CATCATGGCT CGGTATATTC AGTAGCCTGG AGTGGGGATA GTCAAACTCT GGCCAGTGGC
AGTGAAGACA AAACCATCAA ACTCTGGGAT GTCTCTACCG GAAACTGTCG TCTCACCTTA
ACCGGTCATC ATGGCTGGGT ATCTTCAGTG GCCTGGAGTG GGGATAGTCA AACCCTGGCC
AGTGGCGGCG ACGATACCAT CAAACTCTGG GATGTCTCTA CCGGAAACTG TCGTCTCACC
TTAACCGGTC ATCATGGCTG GGTATATTCA GTGGCCTGGA GTGGGGATAG TCAAACCCTG
GCCAGTGGCG GCGACGATAC CATCAAACTC TGGGATGTCT CTACCGGAAA CTGTCGTCTC
ACCTTAACGG GTCATGATGA CTTGGTATGC TCAGTGGCCT GGAGTAGGGA TAGTCAAACC
CTGGCCAGTG GCAGTTCAGA CAAAACCATC AAACTCTGGG ATGTCTCCAC GGGAGAGTGT
CGTCTCACCT TAACGGGTCA TGATGCCTCG GTATCTTCAG TGGCCTGGAG TGGGGATAGT
CAAACCCTGG CCAGTGGCAG TTCAGACAAA ACCATCAAAC TCTGGGATGT CTCCACGGGA
GAGTGTCGTC TCACCTTAAC GGGTCATGAT GACTTGGTAT GGTCAGTGGC CTGGAGTAGG
GATAGTCAAA CCCTGGCCAG TTGCAGTAGG GACGGAACCA TCAAACTCTG GGATGTCCAG
ACAGGGAAAT GTCTCCAAAC CTTCGATAAC CATCCTTACT GGGGAATGAA TATCACGGGA
GTTCAGGGGT TAAGCGACGC AGAAATAGCC ACTTTGAAGG CATTAGGCGC AGTAGAGGTC
AACGACCATT GA
 
Protein sequence
MFIIDDIIAE LVSQAVNEAE KKANHSERVL RILKKFNLSP DSPPNDIEGI YKYTLVEYGV 
DQPKAVLDLF REAAIQKAFS DAFSNNNYAL LQQQVDKGID SYAWGDEIKR LNIDYQKELT
QFSILFVEIV KRTRTATEIL ESNKLESLQR QLNTVQQQIK TLPLSQLYQQ LALIANNTQA
LLPAVEETRE TQLAKNIKQW FKTLNYTFEG YEQYNDDYFE FVINIPVRRG YDRIFVRGVE
AEASIKDVNE VIKLVKTHRT DEGWIVAPRR IASAAKNIAK TNNNVYCYTF DELIDDNANF
TQYFNWLEDY VKTREIEKFY VPLACRKEEI DPDTKHKLGM SIYDEKEGWI EGYIDRWLDD
PVKEHISILG EFGTGKTWFV FHYAWQKLQE YQKAKERGTQ RPRLPLVIPL RDYAKAVTVE
SLLSEFCFRK HEIGLPGYTA FEQLNRMGKL LIIFDGFDEM ADRVDRQKMI NNFWELAKVI
VPGAKAILTC RNEHFPEAKE GRALLNAELK ASVANLTGEP PQFEILELEK FNNHQIRTVL
EKRTDGDTIE RIMANAELLD LARRPLMIEL ILAAMPDIEA GKAIDISRIY WYAVRRKLEQ
DIKQERTFTS IADKLYFMCE LAWEMLSTDQ MSLNYRQFPD RLRRLFSQEV QEQKDLDHWH
YDMMGQTLLT RNSEGDYMPA HRSLLEFFVA YKFAVELNIL APDFAEEEDL KRFNSPVSPL
DLSKTFGKQP LSKAVLDLLI PMMEPGEQTI TKLKEVVMAT QGQTPETVGY LGGNGVTLLL
KLNCSSLIRQ NLRETVILGA DFSQAILHQV DFTGANLTET RFANLLGGVL TLAFSPDGQW
LATGDRQGVV RVWDAVTGKE VLTCRGHHYS VWSVAWSGDS QTLASSSDDK TIKLWDVSTG
NCRLTLTGHH YSVSSVAWSG DSQALASCSY DKTIKLWDVS TGNCRLTLTG HDAWVSSVAW
NGNSQTLASG SGDNTIKLWD LSTGECHLTL TGHDDSVSSV AWSGDSQTLA SCSYDKTIKL
WDVSTGLCRL TLTGHHGWVS SVAWSGDSQT LASGSSDKTI KLWDVQTRQC RLTLTGHDDW
VSSVAWSGDS QTLASGSEDK TIKLWDVSTG NCRLTLTGHD ASVSSLAWSG DSQTLASGSY
DHTIKLWDVS TGLCRLTLTG HHGSVYSVAW SGDSQTLASG SEDKTIKLWD VSTGNCRLTL
TGHHGWVSSV AWSGDSQTLA SGGDDTIKLW DVSTGNCRLT LTGHHGWVYS VAWSGDSQTL
ASGGDDTIKL WDVSTGNCRL TLTGHDDLVC SVAWSRDSQT LASGSSDKTI KLWDVSTGEC
RLTLTGHDAS VSSVAWSGDS QTLASGSSDK TIKLWDVSTG ECRLTLTGHD DLVWSVAWSR
DSQTLASCSR DGTIKLWDVQ TGKCLQTFDN HPYWGMNITG VQGLSDAEIA TLKALGAVEV
NDH