Gene PCC8801_2505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2505 
Symbol 
ID7101690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2596539 
End bp2597654 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content42% 
IMG OID643475547 
Productpentapeptide repeat protein 
Protein accessionYP_002372669 
Protein GI218247298 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACT ATTCGCCTGT TCTTCATCTT GTCACCTTCC ATAACCCCTA TCCTAACTGT 
GTTTCCCTCA AATTAGACAC CATGGGGAGT CAATCCTCAT CGGGCCAGAT AATGCTTCAG
TTACAGGGAC ACTTCAACGA ACAGGAAAAA GACCTCCTCA ATGGTCATCT GAAATTCGGT
TTAAAGGGGG GTATACTATC ACTTGAACTG GAAAATGGAG AAATAATCTA TCCTGAACCC
TTACTAGAAG ATTGGGCACA ACTTCAAACC CAGTCGTCTG TCAATCCAAG TTGGGAATTA
ACCCCAAAAA CTGGGGCATC TATCCTAAAA ATCGATAATA TTACTGTTCC TTTCGCCATT
ATTCAACCTC AAACTGAGCC ATTATATCTA ACAGTCACCT TAAAAGTCAC TCCTCAAAAC
CTTTCTATTA CCAATGCAGA AGGGTTATGG CGGCACGATA TTCACCCCAA CCAACACGCG
ATATTAGAAC GGGTATTAGC CCAATTTTTG TATAAAAATC GCTTATCTTC CCATTTGTGC
CGTTTAGTTT TTAGCTCTAA TAAGGGGACT CATCAGGCAA CCCTAGAAGA CTATCCTAGC
CAAGAACTTG ACTCCCATGA ATTAGCTCAA CTGCATCAAC GCATTGAACA GCTTTACGCT
GCCAATACCC ATAATTTAGC TGAATTGATC AAATTAGCGC ATTTTAACCC TTTAACAGAC
CTAGCAGGAG GCAATTTTTT AGCGGCTGAA TTAAGCGCAG TGGAGTTAAG TGGAGCGAAT
CTGACTCAAA CCAATTTTCG AGGAGCGAAT TTGACCGATG CAGAGTTAAG CGAGGCTATC
CTAAACTATT GTAAATTCAG TGGAGCCGAC TTAAGTGGGG CTTATTTAGG CAATGCTCAA
TTAGTGAAAG CGGATTTTCA TCGCGCGAGT TTAGCCGTTG CTAACCTCAT TGGGGCGAAT
CTAACGGAAG CTAACTTAAG GGAAGCTAAC TTAATTGACA CTAATTTAAG CGGAGCAACC
GTTAAAAACG CAAAATTCGG CGAAAATCCA GGCATGACCC CAGAATTAGA GCAGAGTTTA
CGCGAACGCG GTGCAATTTT TGTCCATAAT CCTTAA
 
Protein sequence
MSDYSPVLHL VTFHNPYPNC VSLKLDTMGS QSSSGQIMLQ LQGHFNEQEK DLLNGHLKFG 
LKGGILSLEL ENGEIIYPEP LLEDWAQLQT QSSVNPSWEL TPKTGASILK IDNITVPFAI
IQPQTEPLYL TVTLKVTPQN LSITNAEGLW RHDIHPNQHA ILERVLAQFL YKNRLSSHLC
RLVFSSNKGT HQATLEDYPS QELDSHELAQ LHQRIEQLYA ANTHNLAELI KLAHFNPLTD
LAGGNFLAAE LSAVELSGAN LTQTNFRGAN LTDAELSEAI LNYCKFSGAD LSGAYLGNAQ
LVKADFHRAS LAVANLIGAN LTEANLREAN LIDTNLSGAT VKNAKFGENP GMTPELEQSL
RERGAIFVHN P