Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20251 |
Symbol | |
ID | 4777760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1783257 |
End bp | 1784978 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640087539 |
Product | hypothetical protein |
Protein accession | YP_001018032 |
Protein GI | 124023725 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.625545 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATGC CCTCGTTGCC TGAACAGAGA ACTGCTTCTG ATTTATACGC GCTAGCGGTA GAAAAGTACA AAAGTGAAGA ATATCAAGAA GCAATAGATG CATTCCGTAA ATCACTAGCA CTGCAAGAAC ACTGGAATTC ATACCAAGGT CTTGGATGGG GACTATTTTA TACAAATCAA TGTCAAGAAG CAATAGATGC ATTCCGTAAA TCACTAGCAC TACAAGAAGA CTGGAATTCA TACCAAGGTC TTGGATGTGC ACTCTTGAGA GAAACAGTAT ACGCAGAAGC AATAGATGCA TTCCGTAAAT CACTAGCACT ACAAGAAGAC TGGAATTCAT ACCAAGGTCT TGGATGGGCA TTCTTTAGAG CAAACGTATA CACACAAGCA ATAGATGCAT TCCGTAAATC ACTTGCACTG CATGAACATT GGAATGTATA CTTAGGTTTA GGACGATCAC TATTCAAGAC AAACCAATAT CAAGAGGCAA TCGAGGCATT CAGAAAAGCG CTTGCACTTA ATAATTTAAA CTCCAATGAA CTAACTGCTG AACTCCACAG AGAACTTGCC GATGCATATG AAGGTGCAGG CAAATCTGAT GCTTCAATTG CTTCTTGGGA GGTCTACTTA TCTTATCTAG AACCCATCTC ATCCCTTGAT CCATTCCTTG GAAATAGAGT TATTTACGAG CAAGTGGATC ATGAGCAGAT AGAAAGAATC AAAAGTACAT GTGCTTCTAT TGGACTCGAC TTTAATCCCT CCCTAAAAGG GGATAATGAT GCTTCAATCG AATCATGGAA ATATCTTATG TACTTGCATA TACCTAAATG CGGGGGGACT TCATTTGAGA CACCCTTGTA TTTACTTAAA GAGCACTTAA AAGATAAGTC ATGTGATTTG CCTAAAGTTA ATAGAACTAA CGATTATCTT GCAATAAGCA GATTGGCTTC AAATCATTCG ATTGCAGCAT TCACAAATTT GATGTCATCC AATTCTTGTA ATGGTTTAAA GAACGCGTTT CTTGGCCTTC ACGGTGCCAA ATGGAGTGCT TTGCATGATT ACATAGGCGA ATTAACCAAT GCTTGTCCTA GAATTATTAC TACGGTACGT GACCCTCGTC AAAGGTTGTT ATCACACATC AAGCATCAAG CGTTTCAATA TTGCACCTCA ATCGACGACC TTCTTACACT TGTAGATAAT CAAAACAGTA TTTTCAATAA TTTAATGCAT AGACAAATCT TTGATTATGG ACTAGACGGC GACAATCCCT GCGGAAACTC TGAACTTGGT AGCGAAAGAT TAGACTTGCT CCAAGACATG GATTTTATTG ATATATCAGA CTCCACTACA AACTCAAAAG TCAAGTCTTC TTTTTTGAGC GCATCTTTAT TCCCTAATAT TGTTCAAACT TCAAGATTTA ATGATTCCAA GGAACGTGAA GAGATGTATG GTTTCAAGAT AAGTGGCAAC GACATTCAAT ATATTTTCAA GCATTGTGTG GACAAAGGTT TTCTGGAGAA GGATCAGTCT ATTGACTATG ATTTTTTAAA AAATAGAACC CTTGAAAGAT TGCATTTCCC TTCATTCATG GAGGCGCATA CCTGTTATAT TCACCCCTTG ACATTTGTCA TTTTTGGTAT GAACAGATAT TCTATTGTTA CCACTAAAAA GTTTCTAGAT AATCCTCACC ATCTGCTTCA GGAACTCAAT CAATCGCTTT AA
|
Protein sequence | MSMPSLPEQR TASDLYALAV EKYKSEEYQE AIDAFRKSLA LQEHWNSYQG LGWGLFYTNQ CQEAIDAFRK SLALQEDWNS YQGLGCALLR ETVYAEAIDA FRKSLALQED WNSYQGLGWA FFRANVYTQA IDAFRKSLAL HEHWNVYLGL GRSLFKTNQY QEAIEAFRKA LALNNLNSNE LTAELHRELA DAYEGAGKSD ASIASWEVYL SYLEPISSLD PFLGNRVIYE QVDHEQIERI KSTCASIGLD FNPSLKGDND ASIESWKYLM YLHIPKCGGT SFETPLYLLK EHLKDKSCDL PKVNRTNDYL AISRLASNHS IAAFTNLMSS NSCNGLKNAF LGLHGAKWSA LHDYIGELTN ACPRIITTVR DPRQRLLSHI KHQAFQYCTS IDDLLTLVDN QNSIFNNLMH RQIFDYGLDG DNPCGNSELG SERLDLLQDM DFIDISDSTT NSKVKSSFLS ASLFPNIVQT SRFNDSKERE EMYGFKISGN DIQYIFKHCV DKGFLEKDQS IDYDFLKNRT LERLHFPSFM EAHTCYIHPL TFVIFGMNRY SIVTTKKFLD NPHHLLQELN QSL
|
| |