Gene PCC8801_0920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0920 
Symbol 
ID7102013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp971272 
End bp972396 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content39% 
IMG OID643474013 
Productpeptidase M50 
Protein accessionYP_002371153 
Protein GI218245782 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACA ATATAAGGGT TGGCAATTTA TTCGGGATTC CTTTTTATGT AAATCCTTCT 
TGGTTCCTAG TATTAGGGTT AGTTACCTTA AGCTATGGAG GACAGTTAGC CTTGTTCCCC
CAATTAGGAG GAATTACTCC CTGGATTCTG GGTTTTGTTG CTGCATTACT CTTATTTTCT
TCAGTCGTTG CCCATGAATT GGGACATAGT TTTGTAGCGA TGTCCCAAGG GATTGAAGTT
AAATCGATTA GCCTCTTTTT GTTTGGGGGA TTAGCCAATT TAGAAAGAGA ATCTGAGACA
CCTTTTGAAG CCTTTTTAGT GGCGATCGCA GGTCCTGCGG TTAGTTTAAT TCTCTTTCTT
TTTTTAACCC TAATTGTTAG CAATTTTGCC TTTAGTGCCC CCATTACAGC CATCCTAGGT
TTACTTGCCT ATATTAACTT AATTCTGGGC TTATTTAACC TAATTCCTGG GCTACCTTTG
GACGGGGGTA ACATTCTAAA AGCCCTTGTT TGGAAGATTA CAGGTAATCC GAATAAAGGC
ATTATTTTTG CCAGTCGAGT CGGACAACTG TTTGGTTGGA TAGCCGTTAC TATCGGTGGA
TTAGCGATTT TAGGGATTAG TCCTATCGGC AGTTTCTGGA CTTTATTAAT TGGCTTTTTC
TTGTTACAAA ATGCAGGATT TTCGGCTCAA TCGGCTCAAT TCCAAGAAAC CCTAAGCGGT
TATACGGCTG AAGATGCGGT TATTCCTGAT AGTCCAGTCG TTTCTGATAG CTTAAATGTC
AGAGAATTTG TTAACGACTA TGTAATCGGT AAGAGTGTCT GGAAAAAGTT TTTAGTGACT
AATGAAGAAG GGAAACTATC AGGTATTCTT GAAATAGATA GTTTGAAAAA AGTGTCTACT
TCCCAATGGA CTGAATTAAA ACTTGCTGAG ATAATGGAAC CCATTAGTCC TAATATCACT
CTAATTCAAG CGGATCAATC TTTGTTAGAG GTGGTTAAAC TATTAGAGAA TGATCCTCGT
CAACAATTAA CCGTCGTCAA AGATAATGGT GTCGTTCTCG GATTATTAGA GAAAGCTTCT
GTTATCAAGT TTCTCCAACA AAAAGCACAA GCTAAAGCTA TTTAA
 
Protein sequence
MNNNIRVGNL FGIPFYVNPS WFLVLGLVTL SYGGQLALFP QLGGITPWIL GFVAALLLFS 
SVVAHELGHS FVAMSQGIEV KSISLFLFGG LANLERESET PFEAFLVAIA GPAVSLILFL
FLTLIVSNFA FSAPITAILG LLAYINLILG LFNLIPGLPL DGGNILKALV WKITGNPNKG
IIFASRVGQL FGWIAVTIGG LAILGISPIG SFWTLLIGFF LLQNAGFSAQ SAQFQETLSG
YTAEDAVIPD SPVVSDSLNV REFVNDYVIG KSVWKKFLVT NEEGKLSGIL EIDSLKKVST
SQWTELKLAE IMEPISPNIT LIQADQSLLE VVKLLENDPR QQLTVVKDNG VVLGLLEKAS
VIKFLQQKAQ AKAI