Gene PCC8801_4245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4245 
Symbol 
ID7105876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4455272 
End bp4457338 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content36% 
IMG OID643477226 
ProductProlyl oligopeptidase 
Protein accessionYP_002374325 
Protein GI218248954 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCTT ATTTTTCTAA ATTTTCCTAT CCTTTGAGTC AACAACAAGA TATTATTGAT 
ATTTATCATG GAATAATTGT TAAAGATCCC TATCGTTGGT TAGAAAATCC TGACTCCGAA
GAAACTCAAA CTTGGATCAA GGCGCAAAAT CAACTAACCT TTGATTATTT AGCCAATATT
TCTGTTAGAG AACCTCTTAA AAAACGGCTA ACTGAACTGT GGAATTATGA AAAGTATGGT
ATTCCTTTTA AAGAAGGCGA TCGCTACTTT TATTTTAAAA ATGATGGACT GCAAAATCAA
AGCGTTTTCT ACACTTTAAA AACCCTTAAA GACGAACCCC AAGTTCTTTT AGATCCTAAT
ACCTTATCCT CAGATGGAAC CGTTGCATTA TCAGGTTTAG CTATCTCAAA AAATGCCCAA
TATTTAGCCT ATGGGTTATC AACATCTGGA TCAGATTGGG TAGAATGGAA AGTCAAAAAT
ATTGAAACTG GAGAAGACTT ATCAGACCAT TTAAAATGGA TTAAATTTTC TGGTGCATCT
TGGACAAATG ATCACCAAGG ATTTTTCTAT AGTCGCTACA ATGAACCCAA TGAAAAAAGC
AAACTCGAAG ACATTAATTA TTACCAAAAA CTCTATTATC ATCGCCTAGG AACAACCCAA
GATCAAGATA TTTTAGTTTA TGATCGTCCC GATCAAAAAG AATGGGGTTT TAATGGCAAC
GTCACCGAAG ATGGACGCTA TTTAATTATT AGTGTTTGGC AAGGAACTGA TCCCAAGAAT
TTGCTTTTTT ATAAAGATTT ACACGATCCT AACGCTGCTG TTATCGAACT GATTAATCAA
TTTGAAGCGA GTTATGGATT TATTGATAAT GAGGGGTCAA CTTTTTGGCT AAGAACAGAC
TTAAACGCGC CTAAAAAACG GATTATTGCC ATTGATATCA ACAACCCAAG TCAAGATAAT
TGGCAAGAAA TTATTCCCGA AACAGAAGAT ACATTAGACG GAGTAGGCAT TTTAAATAAT
CAATTTGTTT GTGATTATCT AAAAAACGCA AAATCTGCTA TTAAAATCTT CGATCTCCAA
GGAACTTTGA TCCGAGAAGT TGACTTACCT GACTTGGGAA TTGTCGGAGG ATTTGAAGGC
AAACGATACG AGACAGAAAC CTTCTACAGT TTTGCTAATT TTACCACACC ATCAACCATT
TATCATTATG ATATGATCAC GGGTAAAAGT ACTCTATTCC GTCAACCCAA TGTTCATTTT
AATCCTCAAG ACTTTGAAAG CAAACAAGTT TTTTATATCA GCAAGGATGG AACTAAAATT
CCTATGTTTA TTACCCATAA AAAAGGGTTA AAATTAGAGG GAAAAAATCC TACTTATTTG
TATGGATATG GTGGGTTTAA TGTTTCTCTA ACTCCAAGCT TTTCCATTAG CAATATTGTC
TGGATGGAAC AGGGAGGAAT TTATGCTGTC CCTAACCTAA GAGGAGGAGG AGAATACGGA
GAAGAATGGC ATCAAGCAGG GATGAAATTA AACAAACAAA CTGTTTTTGA TGACTTTATT
GCTGCGGCAG AATGGTTAAT AAAAAATAAC TATACATCAC CCCAAAAATT AGCTATTGGA
GGGGGAAGTA ATGGGGGTTT ATTAGTGGGA GCTTGCATGA CCCAAAGACC CGATTTATTT
AAGGCTGTCT TGCTATCCGT TGGGGTATTA GATATGCTAA GATTTAATCA ATTTACCATT
GGTTGGGCTT GGTGTCCAGA GTATGGTAGT CCCGAAAATG AAGCAGAGTT TAAAGTACTT
TATGCCTATT CTCCCTTACA TAATGTTAAG CCACAAACCG TCTATCCAGC TACCTTGATC
ATAACAGCAG ACCACGATGA TCGCGTCGTT CCTGCCCATA GTTTTAAATT TGCTGCAGCC
TTACAAACCG CTCATCAAGG CAATAATCCT ATTCTAATTC GAATTGAAAC AAAAGCAGGA
CATGGTGCAG GAAAACCCAC CACAAAAATG ATTGAAGAAA TTGCAGATAA GTGGGCATTT
TTAATCAATA ATTTAAAAGA GGGTTAG
 
Protein sequence
MSSYFSKFSY PLSQQQDIID IYHGIIVKDP YRWLENPDSE ETQTWIKAQN QLTFDYLANI 
SVREPLKKRL TELWNYEKYG IPFKEGDRYF YFKNDGLQNQ SVFYTLKTLK DEPQVLLDPN
TLSSDGTVAL SGLAISKNAQ YLAYGLSTSG SDWVEWKVKN IETGEDLSDH LKWIKFSGAS
WTNDHQGFFY SRYNEPNEKS KLEDINYYQK LYYHRLGTTQ DQDILVYDRP DQKEWGFNGN
VTEDGRYLII SVWQGTDPKN LLFYKDLHDP NAAVIELINQ FEASYGFIDN EGSTFWLRTD
LNAPKKRIIA IDINNPSQDN WQEIIPETED TLDGVGILNN QFVCDYLKNA KSAIKIFDLQ
GTLIREVDLP DLGIVGGFEG KRYETETFYS FANFTTPSTI YHYDMITGKS TLFRQPNVHF
NPQDFESKQV FYISKDGTKI PMFITHKKGL KLEGKNPTYL YGYGGFNVSL TPSFSISNIV
WMEQGGIYAV PNLRGGGEYG EEWHQAGMKL NKQTVFDDFI AAAEWLIKNN YTSPQKLAIG
GGSNGGLLVG ACMTQRPDLF KAVLLSVGVL DMLRFNQFTI GWAWCPEYGS PENEAEFKVL
YAYSPLHNVK PQTVYPATLI ITADHDDRVV PAHSFKFAAA LQTAHQGNNP ILIRIETKAG
HGAGKPTTKM IEEIADKWAF LINNLKEG