Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0748 |
Symbol | |
ID | 8390055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 746733 |
End bp | 748667 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644978767 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_003136522 |
Protein GI | 257058634 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.049006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCCC TAAAAGTTGC CCCCTTCGGT TCTTGGAAAT CTCCCATCAC AGCAGACCTG ATCGTGGCTG AAAGTCTTGG ACTCGGTGCA GTGATCTATG AGGGTGAAGA TATCTATTGG TTAGAAGCAA GACCAACGGA AGGAGGACGC AACGTTCTAA TGAAACGGAC GTTAGATGGC CAAGTCACCG AAATGACCCC CCAACCCTTC AATGTGCGAA CCCGTGTCCA TGAATACGGA GGGGGAGCCT TTTTAATCGT TCAAGGGACT CTTTATTTTA TTAATTTTAG TGACCAACGT CTTTATCAAA AATTACCCAA TCAAGACCCC ACACCCCTGA CCCCAGAAGG AACTTATCGC TACGCTGATC TGATTTTAGA CCCCTTGCGT CATCGTTTAA TTTGCGTCGG TGAAGACCAT AGTCAAGGGG AAAAAGAACC CGAAAATACC CTCGTTAGTA TTGATCTTAA TAGCGGAAAG ATTAATACTC TAGTATCAGG CTGTGACTTT TATTCTAGTC CCCGTCTGAG TCCCGATGGA ACCCAACTCA CTTGGATCAG TTGGAACCAC CCGAATTTAC CCTGGGATGG TAGCCAATTA TGGTTAGCTA CCGTTCAAGC TGATGGTAGC TTAGATCAGG TACGTTTAAT TGCCGGAGGG ACTAACGAGT CCATTTGTGA ACCCAAATGG TCCCCCGATG GTCAATTATA TTTCTCCAGC GATCGCCGAG GATGGTGGAA TCTCTACCGC TACACCCAAA AAGGAGCCAT TGATCCCCTA TTCCCTCTTA ATGCGGAATT TTCTTACCCA CACTGGGTCT TTGGTTTATC TAACTATAGC TTTATCTCAG AATCTCACGT GATCTGTAGT TTTAACCAAG ACGGACAATG GTATCTCGCC AGTTTAGATA CCCAACAAAA ACAGCTAAGC GTCATTGAAA CCCACTATAC CAATATTTCC TCCCTCGATG CCAATGGGCA TAAAATTGTT TTAATTGGGG GTTGTCCCAC AGAACCCACC GCAATTGTTC AATTAAACCT AAAAACAGGG GAAACAACAG TCCTCAAGCA ATCCCATACC TTAAACATTG ATAGCGGTTA TTTGTCAACC CCCGAAATGG TGTCTTTTCC GACCGAAAAT GGCTTAACCG CCTATGCTTG GTATTATCCT CCTAAAAATA AAGACTATAA ACCCCCTAAA GGCGAATTAC CTCCCCTATT AGTCAAAAGT CATGGAGGAC CTACCGCTTG CGCATCCCCT AGCTTTAACC TACGTCTGCA ATATTGGACC AGTCGCGGTT TTGGTTATCT TGATGTCAAT TATGGCGGAA GTACGGGGTT TGGTCGGGAA TATCGCCAAC GCTTAGACGG AAAATGGGGT CTAGTGGATG TAGATGACTG CATCAATGGG GCAAAATATC TGGTAGAACA TGGGTTAGTT GATGGTGATC GCCTAGCCAT TTCTGGAGGA AGTGCAGGAG GGTACACCAC CTTGGCGGCT TTAACCTTTC GGGATACCTT CAAGGCCGGA GCGAGTTACT ATGGAATCAG TGATTTAGAA GCCTTAGCTA AAGATACCCA TAAGTTTGAA TCTCGCTATC TAGAGCGGTT AATTGGCAAA TATCCCGAAG AAAAGGAAAT TTATCAACAG CGATCGCCGA TTCATTTTAC CGAACAACTC ACTTGTCCCG TAATCTTTTT CCAAGGGTTA GAGGATAAAG TGGTTCCTCC GTCCCAAGCG TGTCAGATGG TGGAAATTCT CAAGAAAAAA GGACTCCCTG TCGCTTATGT TCCCTTTGCA GGAGAACAAC ACGGTTTTAG ACGGTCTGAA ACGATTAAAC GCGCATTAGA AGCCGAATTT TATTTTTATT CTCGGATTTT TGGCTTTGAA CCCGCAGATA ACATTGAACC TGTTGAGATT ATTAATTGGG GATAA
|
Protein sequence | MKSLKVAPFG SWKSPITADL IVAESLGLGA VIYEGEDIYW LEARPTEGGR NVLMKRTLDG QVTEMTPQPF NVRTRVHEYG GGAFLIVQGT LYFINFSDQR LYQKLPNQDP TPLTPEGTYR YADLILDPLR HRLICVGEDH SQGEKEPENT LVSIDLNSGK INTLVSGCDF YSSPRLSPDG TQLTWISWNH PNLPWDGSQL WLATVQADGS LDQVRLIAGG TNESICEPKW SPDGQLYFSS DRRGWWNLYR YTQKGAIDPL FPLNAEFSYP HWVFGLSNYS FISESHVICS FNQDGQWYLA SLDTQQKQLS VIETHYTNIS SLDANGHKIV LIGGCPTEPT AIVQLNLKTG ETTVLKQSHT LNIDSGYLST PEMVSFPTEN GLTAYAWYYP PKNKDYKPPK GELPPLLVKS HGGPTACASP SFNLRLQYWT SRGFGYLDVN YGGSTGFGRE YRQRLDGKWG LVDVDDCING AKYLVEHGLV DGDRLAISGG SAGGYTTLAA LTFRDTFKAG ASYYGISDLE ALAKDTHKFE SRYLERLIGK YPEEKEIYQQ RSPIHFTEQL TCPVIFFQGL EDKVVPPSQA CQMVEILKKK GLPVAYVPFA GEQHGFRRSE TIKRALEAEF YFYSRIFGFE PADNIEPVEI INWG
|
| |