Gene Cyan8802_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4307 
Symbol 
ID8393659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4446734 
End bp4448800 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content36% 
IMG OID644982217 
ProductProlyl oligopeptidase 
Protein accessionYP_003139928 
Protein GI257062040 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0613524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCTT ATTTTTCTAA ATTTTCCTAT CCTTTGAGTC AACAACAAGA TATTATTGAT 
ATTTATCATG GAATAACTGT TAAAGATCCC TATCGTTGGT TAGAAAATCC TGACTCCGAA
GAAACTCAAA CTTGGATCAA GGCACAAAAT CAACTAACTT TTGATTATTT AGCCAATATT
TCTGTTAGAG AACCTCTTAA AAAACGGCTA ACTGAACTGT GGAATTATGA AAAGTATGGT
ATTCCTTTTA AAGAAGGCGA TCGCTACTTT TATTTTAAAA ATGATGGACT GCAAAATCAA
AGCGTTTTCT ACACTTTAAA AACCCTTAAA GACGAACCCC AAGTTCTTTT AGATCCTAAT
ACCTTATCCT CAGATGGAAC CGTTGCATTA TCAGGTTTAG CTATCTCAAA AAATGCCCAA
TATTTAGCCT ATGGGTTATC AACATCTGGA TCAGATTGGG TAGAATGGAA AGTCAAAAAT
ATTGAAACTG GAGAAGACTT ATCAGACCAT TTAAAATGGA TTAAATTTTC TGGTGCATCT
TGGACAAATG ATCACCAAGG ATTTTTCTAT AGTCGCTACA ATGAACCCAA TGAAAAAAGC
AAACTCGAAG ACATTAATTA TTACCAAAAA CTCTATTATC ATCGCCTAGG AACAACCCAA
GATCAAGATA TTTTAGTTTA TGATCGTCCC GATCAAAAAG AATGGGGTTT TAATGGCAAC
GTCACCGAAG ATGGACGCTA TTTAATTATT AGTGTTTGGC AAGGAACTGA TCCCAAGAAT
TTGCTTTTTT ATAAAGATTT ACACGATCCT AACGCTGCTG TTATCGAACT GATTAATCAA
TTTGAAGCGA GTTATGGATT TATTGATAAT GAGGGGTCAA CTTTTTGGCT AAGAACAGAC
TTAAACGCGC CTAAAAAGCG GATTATTGCC ATTGATATCA ACAACCCAAG TCAAGATAAT
TGGCAAGAAA TTATTCCCGA AACAGAAGAT ACATTAGACG GAGTAGGCAT TTTAAATAAT
CAATTTGTTT GTGATTATCT AAAAAACGCA AAATCTGCTA TTAAAATCTT CGATCTCCAA
GGAACTTTGA TCCGAGAAGT TGACTTACCT GACTTGGGAA TTGTCGGAGG ATTTGAAGGC
AAACGATACG AGACAGAAAC CTTCTACAGT TTTGCTAATT TTACCACACC ATCAACCATT
TATCATTATG ATATGATCAC GGGTAAAAGT ACTCTATTCC GTCAACCCAA TGTTCATTTT
AATCCTCAAG ACTTTGAAAG CAAACAAGTT TTTTATATCA GCAAGGATGG AACTAAAATT
CCTATGTTTA TTACCCATAA AAAAGGGTTA AAATTAGAGG GAAAAAATCC TACTTATTTG
TATGGATATG GTGGGTTTAA TGTTTCTCTA ACTCCTAGCT TTTCCATTAG CAATATTGTC
TGGATGGAAC AGGGAGGAAT TTATGCTGTC CCTAACCTAA GAGGAGGAGG AGAATACGGA
GAAGAATGGC ATCAAGCAGG GATGAAATTA AACAAACAAA CTGTTTTTGA TGACTTTATT
GCTGCGGCAG AATGGTTAAT AAAAAATAAC TATACATCAC CCCAAAAATT AGCTATTGGA
GGGGGAAGTA ATGGGGGTTT ATTAGTGGGA GCTTGCATGA CCCAAAGACC GGATTTATTT
AAGGCTGTCT TGCTATCCGT TGGGGTATTA GATATGCTAA GATTTAATCA ATTTACCATT
GGTTGGGCTT GGTGTCCAGA GTATGGTAGT CCCGAAAATG AAGCAGAGTT TAAAGTACTT
TATGCCTATT CTCCCTTACA TAATGTTAAG CCACAAACCG TCTATCCAGC TACCTTGATC
ATAACAGCAG ACCACGATGA TCGCGTCGTT CCTGCCCATA GTTTTAAATT TGCTGCAGCC
TTACAAACCG CTCATCAAGG CAATAATCCT ATTCTAATTC GAATTGAAAC AAAAGCAGGA
CATGGTGCAG GAAAACCCAC CACAAAAATG ATTGAAGAAA TTGCAGATAA GTGGGCATTT
TTAATCAATA ATTTAAAAGA GGGTTAG
 
Protein sequence
MSSYFSKFSY PLSQQQDIID IYHGITVKDP YRWLENPDSE ETQTWIKAQN QLTFDYLANI 
SVREPLKKRL TELWNYEKYG IPFKEGDRYF YFKNDGLQNQ SVFYTLKTLK DEPQVLLDPN
TLSSDGTVAL SGLAISKNAQ YLAYGLSTSG SDWVEWKVKN IETGEDLSDH LKWIKFSGAS
WTNDHQGFFY SRYNEPNEKS KLEDINYYQK LYYHRLGTTQ DQDILVYDRP DQKEWGFNGN
VTEDGRYLII SVWQGTDPKN LLFYKDLHDP NAAVIELINQ FEASYGFIDN EGSTFWLRTD
LNAPKKRIIA IDINNPSQDN WQEIIPETED TLDGVGILNN QFVCDYLKNA KSAIKIFDLQ
GTLIREVDLP DLGIVGGFEG KRYETETFYS FANFTTPSTI YHYDMITGKS TLFRQPNVHF
NPQDFESKQV FYISKDGTKI PMFITHKKGL KLEGKNPTYL YGYGGFNVSL TPSFSISNIV
WMEQGGIYAV PNLRGGGEYG EEWHQAGMKL NKQTVFDDFI AAAEWLIKNN YTSPQKLAIG
GGSNGGLLVG ACMTQRPDLF KAVLLSVGVL DMLRFNQFTI GWAWCPEYGS PENEAEFKVL
YAYSPLHNVK PQTVYPATLI ITADHDDRVV PAHSFKFAAA LQTAHQGNNP ILIRIETKAG
HGAGKPTTKM IEEIADKWAF LINNLKEG