Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1021 |
Symbol | |
ID | 3606407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 1516665 |
End bp | 1517810 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637687890 |
Product | trypsin-like serine protease |
Protein accession | YP_292214 |
Protein GI | 72382859 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.75122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGTC TCAGAAAGTT AGAAATTATG CGGCGATTCA TTAGTATATT TTCCCTTTTC TCAATCCTTT TTTGTTTTTT ATTTGAGCCA ATATCTGTAT TCGCGTTAGC AGATTTTCAA GAAAGCGAAT CACATAGTTT TGTTGCCAAT GTAGCTAGCA AAGTTTCACC TTCGGTCGTA AGGATTGATA TTGAAAGAGA GTTTCAAACA GATGAATTTG AATCTGATTT GTTAGATCCC TTACTAAAAG ATCTTTTAGG GGATTTGGGA ACTTTTCCCA AAAAAGAGAG AGGGCAAGGC TCTGGGGTCA TAATCGATAG CTCTGGATTA GTTCTCACCA ATGCTCATGT AGTGGAAAGA GTTGATCGTG TGATAGTTAC TCTTCAAAAT GGAAATCAAG TGGATGGAAC AGTTGTTGGA ACCGATCAAG TTACTGACTT AGCTTTGGTG AAAATTAAAG AATTTCCTGA TTTAGAAAGT GCAAAATTGG GTGATTCAGA AGATATCCAA GTTGGGGACT GGGCAATTGC TCTTGGAACA CCTTATGGCC TTGAAAGCAC TGTCACGCTC GGTATAGTCA GCAGTCTTCA TAGAGACATT AATTCTCTTG GCTTTTCTGA TAAGAGATTG GATTTAATTC AAACTGACGC AGCAATAAAT CCTGGGAATT CCGGAGGACC ATTGATAAAT GCAAATGGAG AAGTTATTGG AATTAATACT TTGGTCAGAT CAGGTCCAGG GGCTGGACTT GGATTCGCAA TTCCAATCAA TCTTGCTTCA AAAGTTACCA ATCAACTGCT CACTAACGGT GAGGTTATTC ATCCTTACTT GGGTGCTCAG TTGGTTTTAT TGAATGAAAG AATAGCTAAA GAACATAATC AAGATCCAAA TGCATTGATT TTTTTACCTG AACGATCAGG AGCACTTGTG CAGTCTGTTA TCCCTCAAAG TCCAGCAGAG GAAGGAGGTC TAAGACGGGG TGATCTTGTA ATTAATGCAG GGGGTAATGC AATTAATGAT CCTAGGTCTT TACTCATGCA AGTTGAAAAT GCTCAAATAG GAAAGCCATT TGAATTGGAA GTGGTTCGAA ATAATAAAGA GATTAATCTT TCTATTAAGC CAGCTGCTTT ACCAGGAATT AGCTAG
|
Protein sequence | MQSLRKLEIM RRFISIFSLF SILFCFLFEP ISVFALADFQ ESESHSFVAN VASKVSPSVV RIDIEREFQT DEFESDLLDP LLKDLLGDLG TFPKKERGQG SGVIIDSSGL VLTNAHVVER VDRVIVTLQN GNQVDGTVVG TDQVTDLALV KIKEFPDLES AKLGDSEDIQ VGDWAIALGT PYGLESTVTL GIVSSLHRDI NSLGFSDKRL DLIQTDAAIN PGNSGGPLIN ANGEVIGINT LVRSGPGAGL GFAIPINLAS KVTNQLLTNG EVIHPYLGAQ LVLLNERIAK EHNQDPNALI FLPERSGALV QSVIPQSPAE EGGLRRGDLV INAGGNAIND PRSLLMQVEN AQIGKPFELE VVRNNKEINL SIKPAALPGI S
|
| |