Gene PCC8801_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0859 
Symbol 
ID7103193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp900141 
End bp902225 
Gene Length2085 bp 
Protein Length694 aa 
Translation table11 
GC content41% 
IMG OID643473952 
ProductOligopeptidase A 
Protein accessionYP_002371093 
Protein GI218245722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACG CTACCGTAAC GAAGAATCCC TTACTCATTG GCAGAGGACT TCCTCCTTTC 
AATGAAATTC AACCCGATCA AGTAGTTTCT GCCATCACTG AATTATTAGA AAATTTAGAT
TCCGAATTAA CTAAGTTAGA AGGAACTGTT ACCCCAACTT GGGAAGGGTT GGTTGACCCT
CTAACGGAAA TTGAAGAACG CTTAACTTGG ACTTGGGGAA TTGTGGGACA TTTAATGTCT
GTTAAAAATA GTCCTGAGTT ACGGGAAACC TACGAAACAG TACAGCCTAA TGTGGTTCAA
TTTATCAATA AATTAAGCCA AAGTGAACCG CTATATAAAG CTTTTAAAGC CCTCCAAAAT
AGCGAGGTTT GGAACACCCT AGAATCTGCT CAAAAACGTA TTGTAGAAAC GGCTATCCGC
GAAGCCGAAC TCGCTGGTGT CGGGTTAGCA GGAGAACAAC GGGAACGGTT CAATCAAATT
CAATTGGAAC TAGCGGAACT TTCAACTAAA TTTTCTAATA ATGTTTTGGA TGCAACCAAA
GCCTTTCAAC TGAAATTAAC TACTCCAGAA GAAGTCGATG GTTTACCCGC TAGTTTACTC
AGTTTAGCAG CACAAACGGC GCGATCGCAA GGAGAAGAAA ATGCGACTCC AGAAGCGGGA
CCTTGGGTCA TTACTTTGGA CTATCCTAGT TATGTACCCT TTATGAAATA CAGCACGAGA
AGCGATTTAC GCGAACAAGT TTACAAAGCC TTTTTAACCC GTTCGTCTCA AGGAGATTTA
GATAATAATC CTTTGATTGA ACGGATTTTA GAACTGCGTC AAGAACAAGC TCAATTATTA
GGATATAAGA CCTATGCTGA GGTCAGTTTA GCGCGTAAAA TGGCTCCCGA CGTGGAAACC
GTTGAAAAAC TCCTAGAAGA ATTACGTCAA GTTAGTTATG AAGCTGCTGT CAAGGACTTA
GAAACCTTAA AAACCTTTGC TAAAACTGAT GATTTGCAAC ACTGGGATAT TAGTTTTTGG
GCAGAAAAAC AACGGGAAGC TAAGTTTAAT TTTACGGCTG AAGAATTGCG GCCTTATTTT
CCTTTGCCTC AAGTATTAGA AGGCTTATTT ACGTTGGCTA AGCGGATTTT TGGGGTAACA
ATAACGTCTG CGGATGGTCA AGCCCCAGTA TGGCATGAAG ATGTGCGTTA TTTCCAGGTT
AATAATGAAT TAGGAGAAGC GATCGCCTAC TTTTATTTAG ACCCCTACAG TCGTCCCGCA
GAAAAACGGG GAGGAGCCTG GATGAATGAT TGTATTGGTC GAGCAAAAAT TCGCCTAGAT
GGAACATTTT CTACCCGTTT ACCCGTTGCC TATCTTATCT GTAATCAAAC CCCTCCTGTC
GATGGAAAAC CCAGTTTAAT GACCTTTGAT GAGGTGACAA CGTTATTTCA TGAATTTGGC
CACGGACTAC AACATATGTT GACTAAAGTG GACTATCCTG GGGCATCAGG AATTAATAAT
GTAGAATGGG ATGCGGTGGA ATTGCCCAGT CAATTTATGG AGAATTGGTG TTACGATCGC
CAGACTTTGT TCAACTTAGC TAAACATTAT GAAACAGGGG AAACCTTACC CGAACACTAT
TATCAAAAGT TAGTAGATGC TCGTAATTAT ATGAGTGGTT CGGCCATGTT GCGTCAATTA
CATTTTAGTT TCTTGGACTT AGAATTACAC CACCGTTATC AACCCAATGG AAACGAAACC
CCTAGTCAAG TCCGCGATCG CATTGCCCAG AATACCACGG TTATGAAACC CTTACCAGAG
GATGCTTTTT TGTGTTCTTT TGGTCATATT TTTGCGGGAG GGTATGCCGC AGGATATTAT
AGTTATAAAT GGGCAGAAGT TTTAAGTGCA GATGCCTTTG CTGCTTTTGA AGAAGTCGGG
TTAGAAGATG AGCACGCAGT GGCAAAAACG GGTCAACGTT TTCGGGATAC AGTATTAGCT
TTAGGGGGAA GTATTCACCC TATGGAGGTT TTTAAAACCT TCCGAGGACG AGAACCTAAG
ACTGAACCAT TATTAAGACA TAGTGGTTTA TTACAGGTGG CTTAA
 
Protein sequence
MTNATVTKNP LLIGRGLPPF NEIQPDQVVS AITELLENLD SELTKLEGTV TPTWEGLVDP 
LTEIEERLTW TWGIVGHLMS VKNSPELRET YETVQPNVVQ FINKLSQSEP LYKAFKALQN
SEVWNTLESA QKRIVETAIR EAELAGVGLA GEQRERFNQI QLELAELSTK FSNNVLDATK
AFQLKLTTPE EVDGLPASLL SLAAQTARSQ GEENATPEAG PWVITLDYPS YVPFMKYSTR
SDLREQVYKA FLTRSSQGDL DNNPLIERIL ELRQEQAQLL GYKTYAEVSL ARKMAPDVET
VEKLLEELRQ VSYEAAVKDL ETLKTFAKTD DLQHWDISFW AEKQREAKFN FTAEELRPYF
PLPQVLEGLF TLAKRIFGVT ITSADGQAPV WHEDVRYFQV NNELGEAIAY FYLDPYSRPA
EKRGGAWMND CIGRAKIRLD GTFSTRLPVA YLICNQTPPV DGKPSLMTFD EVTTLFHEFG
HGLQHMLTKV DYPGASGINN VEWDAVELPS QFMENWCYDR QTLFNLAKHY ETGETLPEHY
YQKLVDARNY MSGSAMLRQL HFSFLDLELH HRYQPNGNET PSQVRDRIAQ NTTVMKPLPE
DAFLCSFGHI FAGGYAAGYY SYKWAEVLSA DAFAAFEEVG LEDEHAVAKT GQRFRDTVLA
LGGSIHPMEV FKTFRGREPK TEPLLRHSGL LQVA