Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_11721 |
Symbol | |
ID | 4778985 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1030516 |
End bp | 1032003 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640086681 |
Product | hypothetical protein |
Protein accession | YP_001017186 |
Protein GI | 124022879 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.144634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCCGT TGACTAAAGA TCTAGCATCT GTGCTTACAG GAGCAGTAAC AAGTACAACA TCAAGCTCAT GGCAAAAAAG AGAACCTTAT TACAACTATC AGTGGAGCTT ATTTAACAAC GGACAATACA ATGGCTCCAG GGTTGATGCT GATATTGATG CCGATGCTGT TTTTTCTAAA ACTCCCTATC CTTCATTCTT GATGGATCCA GCGGCAGTTG GTCAAAATAC AATCGCTATT CTTGATACAG GCGTCAACTG GAATCATGTT GATTTAAACG AACGGCTCAA TATGAATGCA AACTTTATTT CTAGAGAGCC TATTGACGGG ATTGATAATG ATGGCAATGG CTACGTTGAT GACCCATTGG GATGGAACTT TATGAACAAT GGTAATTCAC CATTTGATGA TCAAGGCCAT GGTTCCCACG TGTCTGGATT AGCAGCTGCA TCTGCTAATG CCCAAGGAAT CGTGGGCACA AACCCTAATG CATCCATTAT CCCGGTTAAA GTCTTAGGGC CTAATGGGGG GACAACCCCA GGGGTTGTTG CTGGAATTAA TTATGCAGTT AGCAGAGGAG CAAAAATTAT CAATATGAGC TTAGGCGGCC CTGGATATTC TCAAGCAATG TATGCGGCAA TCGCTAATGC TAATAATGCA GGAGCTTTGG TGGTCGCAGC TGCAGGCAAT GATTTTGTTA ATACCGATTA TAATCCCTCT TATCCAGCAG CATACAACCT ACCTAATATA ATATCTGTAG CTGCCTCTGA TTACACTGAC TGGTTTGCCA ACTTTTCAAA CTATGGTAAA GCAACTGTTG ACTTATTGGC TCCAGGTAAA TTAATGCTAA GTGATTCTCA TATTGGCAGT ACAGGTCTTG TTGAAAAAAG TGGAACTTCG ATGGCTGCTC CAATCGTAGC TGGCGCAGTC AGTTATTTTT GGTCGCGAAA TCCAACCTGG ACGGCATTAC AAGTAAAGGA TCGATTACTT AACATCGGCG TTGACAAGAT CCCAGGTGCA GAGAATTATA CTGTTTCAGG AGGAAGATTG AACATGGCCC ATTTGATGGG CTGGCCTGCA ACAGATGTAT ACACAAGCTC AGGACAAGAT GCTGATACTC TTGCCGGGAT GGATCTAAGC ACTAATGCAG ACCCTGTTAT TAACTATGTT CCATTCATTG ACAGTAAGAA TCTAAGCACA TTCAACGTTG ATGAAGTTAC AGATGCTGTG ATTGGTATTG TTGATGGAGA TGCATTATCA GATCGTATAG CACGTATGAA AGAATTTCTA ACTAGCACAA ATATTGGTGA GGTCTATGCT GATCAATTTG ATGATTTTGA AATCCTAGAT AGCTTAGGCA CCTCGCTAAC AACCATTGAT TTCAAAGACC ACATCTCAAA TGAGGGTAAG CGTAAACTAA TGGGTGAGTT CATCGCGCGC GGGTGGTTTG AAGGCTTTGA AATGAACAGC GAAGTGTCAC TGTTCTAG
|
Protein sequence | MQPLTKDLAS VLTGAVTSTT SSSWQKREPY YNYQWSLFNN GQYNGSRVDA DIDADAVFSK TPYPSFLMDP AAVGQNTIAI LDTGVNWNHV DLNERLNMNA NFISREPIDG IDNDGNGYVD DPLGWNFMNN GNSPFDDQGH GSHVSGLAAA SANAQGIVGT NPNASIIPVK VLGPNGGTTP GVVAGINYAV SRGAKIINMS LGGPGYSQAM YAAIANANNA GALVVAAAGN DFVNTDYNPS YPAAYNLPNI ISVAASDYTD WFANFSNYGK ATVDLLAPGK LMLSDSHIGS TGLVEKSGTS MAAPIVAGAV SYFWSRNPTW TALQVKDRLL NIGVDKIPGA ENYTVSGGRL NMAHLMGWPA TDVYTSSGQD ADTLAGMDLS TNADPVINYV PFIDSKNLST FNVDEVTDAV IGIVDGDALS DRIARMKEFL TSTNIGEVYA DQFDDFEILD SLGTSLTTID FKDHISNEGK RKLMGEFIAR GWFEGFEMNS EVSLF
|
| |