Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28641 |
Symbol | |
ID | 4778321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2533073 |
End bp | 2534818 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640088387 |
Product | hypothetical protein |
Protein accession | YP_001018859 |
Protein GI | 124024552 |
COG category | [N] Cell motility [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTA AGGGTGCTCT TACGGCTGCT GCCCTGTCTT TGCTGCCACT AGGGCAACCA CTGCTACTAG GCACTGCTGG CATCACCACA GCAACCACCG CAGTCCTTCT TCAAGCGCTA GCAGCAGTTG CTCAAGATGC TTCTGCTGTT GCCAAGGTCG CCAAGGCAAT CACTGTTCGT ATAGAAGGAG CAACGCAAGG ATCTGGCGTT CTCGTCAAAA AAGACGGCAA TCGCTACACA GTTCTCACAG CATGGCATGT GGTCAGCAGC AATAGACCTG GAGAAGAGGT TGGGATCTAT ACCTCCGATG GCCAGGATCA TCAACTGAAG CAAGGCAGTA TCCAACGTTT AGGTGAGATT GATATGGCAG TCCTTACCTT CTCCAGTTCT GGAAATTATG AGGTGGCCTC AATTGGAGAT GCAAAAACAG TTCAATACGA TGATCCGATC TACGTCGCTG GATTCCCTCT AGCTAATTCA CAAAACCTTC GTTATGAGAC TGGAGATGTT GTTGCCAACG CAGAAGTAGG CATTGATCAG GGCTATCAAC TGCTGTATGA CAACAAGACA GCCGCTGGAA TGAGTGGTGG TGTCCTCCTC AATGCTGATG GAGAGTTGAT TGGTCTTCAC GGGAGGGGTG AAAAAAATGA ATATGCTTCC AATGGGAATG AAGTCTCAAT GAAGACTGGT GTCAACCAAG GTGTACCGAT TAGTTATTAC AAGCTTTTCC TTAGTGGATC GCCAGTTGTT GTTGCAAACA ACACTGCTGC AAATGCTGAT GACTACTATG CACAAGTGCT TGCCTCGGCC AATAAGAAAG GAAGAGAGCA GACTATGGTC CGCCTAGCAG ATCAGGCATT GAAATTACGC AAAACGGGCT TTGCATACAT CATGCGTGCG TATGCGAAGA ATGATTTGGG TGATTACCAA GGAGCAATTG ATGATCAAAA TAATGCCCTC GAGATTAATC CTGATAATGC AGTCGCTTAC GTCAATCGTG GATTAGCTAG GAGTAATATG GGTGATCCTA AAAGTGCCCT TTCTGATTTT AGCAAGGCAA TAAAGATAGA CCCTGCCAAT GCGATGGCAT TCAGTAATCG GGGTGTTTCT AAGCAGGCGC TAGGAGATCC TCAAGGGGCG CTAGATGATT ACAATAAGGC GATAAAGATT GATCCTCGCA ATGCAAATGC CTATGCTAAT CGCGGTGTTA ACAAGGGCGA TTTAGGAGAT TATCAAGGAG CAATTGCTGA TTACAGCAAG GCAATTGGAA TCAATCCGCA GCATTCTGAT GCATACTACA ACCGTGGTAT TGCAAAGCTT GAATCCAAGG ATTATCAAGG AGCAATTGCT GATTACAATA AGGCAATAAG GATTGGCACG CAGAATGCGA GGATCTATCT TAATCGTGGT CTTGTCTACG ATAATTTAGG CGATTACCAG CGTGCAATTG CTGATTACAA TAAGGCAATA GAGCTTGATC CGCAGTATGC TCTTGCCTAC GTGAACCGTG GTCTTGCCAA GATTAAATCA GGAGATATTC AAGGAGCAAT TGCTGATTCC AATAAGGCAA TAGAACTTGA TCCGCGTATG GCAAAAGCCT ATGCCAATCG TGGCGCAGCA AAAGGCATGC TAGATGATGC TAAAGGAGGT TGTGCAGATT TCAAAAAAGC AGCATCACTT GGTTCTCAAC TAGCGGCTCA ATGGTTAAAC CGCGCAGATG CTGCCTGGTG TCGTAATATG CGATGA
|
Protein sequence | MKAKGALTAA ALSLLPLGQP LLLGTAGITT ATTAVLLQAL AAVAQDASAV AKVAKAITVR IEGATQGSGV LVKKDGNRYT VLTAWHVVSS NRPGEEVGIY TSDGQDHQLK QGSIQRLGEI DMAVLTFSSS GNYEVASIGD AKTVQYDDPI YVAGFPLANS QNLRYETGDV VANAEVGIDQ GYQLLYDNKT AAGMSGGVLL NADGELIGLH GRGEKNEYAS NGNEVSMKTG VNQGVPISYY KLFLSGSPVV VANNTAANAD DYYAQVLASA NKKGREQTMV RLADQALKLR KTGFAYIMRA YAKNDLGDYQ GAIDDQNNAL EINPDNAVAY VNRGLARSNM GDPKSALSDF SKAIKIDPAN AMAFSNRGVS KQALGDPQGA LDDYNKAIKI DPRNANAYAN RGVNKGDLGD YQGAIADYSK AIGINPQHSD AYYNRGIAKL ESKDYQGAIA DYNKAIRIGT QNARIYLNRG LVYDNLGDYQ RAIADYNKAI ELDPQYALAY VNRGLAKIKS GDIQGAIADS NKAIELDPRM AKAYANRGAA KGMLDDAKGG CADFKKAASL GSQLAAQWLN RADAAWCRNM R
|
| |