Gene PCC8801_3366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3366 
Symbol 
ID7103020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3514629 
End bp3515894 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content46% 
IMG OID643476381 
Productpentapeptide repeat protein 
Protein accessionYP_002373490 
Protein GI218248119 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC CAGAGACACC AGACACTCTC CCTCAACAAA ATGGTCAGAA AGCCACTCCA 
TCCCTGCCAG AGAAAATAGA GTCTTTGTAT CCATCGGGTC TACCGTTCTC TAAGACACTA
TTGCCTCCCC CTAACTCAGA ACCCCAACCC CTCTATCTAC GGGACGTAAA ATCTTTTAAT
CCTTGGCTAC TGTTGAGTTC AGCCGTCATC ATGATAGTAG GATTAGAGTT TAATTTCCCG
TGGTTGGGCT TTTCGGCAGC TTTGTTGTCC CTTTTTCTCT CGCTTCAGGT GATTTTACCC
TCACTACGAG GATGGGTCAT TCGCTATTTA ACTCCCCAAG AACGACAAAC CTTGTTAGGA
TTTTTGGTGT TTATTGCAGC GATCGCCGGA TTAGCTTATT ATTTTGGATT CTACGATCGC
CTCAGAATTT GGCTTAATCA GTTCAAATAC GATGAATTTG GCTCTTGGGC TGAATGGGTG
GGCGCATTGG GTCAAATTAT GATTGCCTTA CTCGCGGTTT ATATCGCTTG GGCACAATAC
GTCATTTCTA AGGATTTAAC CCTCCAACAA AACCTGATTA CCCAACAACA AACCATTGAT
ACCTATTTTC AGGGGATCTC CGACCTAGTG TTGGATGGCG AAGGAATGCT CGAAGACTGG
CCTCAAGAAC GATCTATCGC TGAAGGCAGA ACCGCCGCTA TTTTCAGCAG CGTAGATGAA
ACAGGAAAAG CCAAAATTTT GCGTTTTCTG TCCCAGTCTC GATTATTAAC TCCTTTAATG
CGCGATAGTC GCTTAGGAAG ACCTATCCTC GATGGAGCAG GGGGATACGC TGAAGATCGT
CCATCAGGGG TGCGGGTGAT TAACTTAGGG GTGATGTTAG CAGGGGCTAA ACTATCCGGT
CAAGATTTAC GCTGGACAGA TTTAAGCGAA GCCAATATGG TACGCGCTGA TTTAAGTCAC
TGTGACTTGG TTAAAGCCAA TTTATCCCGC ACGGTTCTCT ATGATGGCAA CTTAAAAGGA
GCCGATCTCA AAGGGACTCG TTTGTTCTAT GGCTCAGTGG AAACGGCTAG TCCGCGATCG
CGTAGTGCCC CCCCAGACTA TGAAACGGGA GCCTATACCG GGGTCGTTTT AGAAAATTGT
AATTTAGAAG ACGTACAAAA CCTCAGTGAC GAACAGCGTT ATTATTGCTG TGCTTGGGGA
GGGGAAAAAA CCCGCGCCAC TATTCCAGGG GGATGTTATG GTGTTCCGAA TAAATTGGGA
CGTTAG
 
Protein sequence
MTTPETPDTL PQQNGQKATP SLPEKIESLY PSGLPFSKTL LPPPNSEPQP LYLRDVKSFN 
PWLLLSSAVI MIVGLEFNFP WLGFSAALLS LFLSLQVILP SLRGWVIRYL TPQERQTLLG
FLVFIAAIAG LAYYFGFYDR LRIWLNQFKY DEFGSWAEWV GALGQIMIAL LAVYIAWAQY
VISKDLTLQQ NLITQQQTID TYFQGISDLV LDGEGMLEDW PQERSIAEGR TAAIFSSVDE
TGKAKILRFL SQSRLLTPLM RDSRLGRPIL DGAGGYAEDR PSGVRVINLG VMLAGAKLSG
QDLRWTDLSE ANMVRADLSH CDLVKANLSR TVLYDGNLKG ADLKGTRLFY GSVETASPRS
RSAPPDYETG AYTGVVLENC NLEDVQNLSD EQRYYCCAWG GEKTRATIPG GCYGVPNKLG
R