Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01581 |
Symbol | |
ID | 4781164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 154034 |
End bp | 155212 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640083422 |
Product | serine protease |
Protein accession | YP_001013987 |
Protein GI | 124024871 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATGGAT TAGATGACAA CAGACTATTA ATAGGATCCC TAAAGTTACG TCAAAAAAAA TATTTTAAAA TTTTTGTTTT AGTTGTTCTT ATTTTTTTTA ACTTCCGATA TGAAACTCCT CTTCATTCAA GTGAAGCTTC ACTGTTATCC CAAGAAAATC ACAATAAGCA ATCTTTCGTA TCAAAAGCAT TAAATATTAG TGGGGATGCA GTGGTCACAA TTGAAACACA ACGTCAGGTT TTATCTTCAA GTGAAGGTGT ATTTCCTCCT GGGATCTTAA ATGATCGATA TTTTGAACGA TTCTTTGGTC TAAGAGGCCT GCAAGTTCCA CGATCTCGAA TTGAAAAAGG CCAAGGAAGT GGAGTGATTT TTTCTAAAGA AGGTCTGGTC TTGACTAATG CTCATGTAAT AGAAAAAACT GATCAATTAA TAGTGGGTTT ATCAGATGGA AGAAGAGTGC TTGGAAATGT TGTTGGAGAA GATTCTTTAA CAGATCTTGC AGTTATAAAA CTCCAAGCAA AAGGTCCTTG GCCAACTGCC CAATTAGGAA ACTCCGATAA TTTAAAAGTT GGTGATTGGG CAATTGCAGT TGGAAATCCT TTTGGACTTG AAAATACGGT TACTCTTGGA ATCATTAGTA ATCTCAATAG AGATGTTGCT CAATTAGGTA TATCCGACAA AAGAATAGAT CTCATTCAAA CTGATGCAGC TATTAATCCA GGTAATTCTG GAGGACCATT ATTAAATTCT GTTGGAGAAG TGATTGGTAT TAATACTCTT GTTCGCTCAG GACCAGGAGC AGGATTAGGT TTCGCAATTC CAATAAATAG AGCTAGAAAA ATCGCCAAAG ATTTAATCAC CAGCGGGAGA GCCAAGCATC CCATGATCGG AGTAACACTT TCAAGCAATA TCAAACAAAA AAGTAATTTT CTTTCCCAAA CAGAAGATGG AGCGATAATT AAATATTTAA TGCCAAATGG TCCGGCCGAA AAAGGTGGAT TAAAAGTAAA TGATCTAATA ATTTCAATCA ACAATGAAAA AATTTCAACT CCAGCAGATG TGGTAAAAAA AATTAATAAA AATAATTTAC AATCAGCATT AAAAATTAAA ATACTTAGAG GGAATATAGA GTCTATAAAA ATCATCAAAC CAGTTGATGT TTATGATCTT CAAGTATAA
|
Protein sequence | MHGLDDNRLL IGSLKLRQKK YFKIFVLVVL IFFNFRYETP LHSSEASLLS QENHNKQSFV SKALNISGDA VVTIETQRQV LSSSEGVFPP GILNDRYFER FFGLRGLQVP RSRIEKGQGS GVIFSKEGLV LTNAHVIEKT DQLIVGLSDG RRVLGNVVGE DSLTDLAVIK LQAKGPWPTA QLGNSDNLKV GDWAIAVGNP FGLENTVTLG IISNLNRDVA QLGISDKRID LIQTDAAINP GNSGGPLLNS VGEVIGINTL VRSGPGAGLG FAIPINRARK IAKDLITSGR AKHPMIGVTL SSNIKQKSNF LSQTEDGAII KYLMPNGPAE KGGLKVNDLI ISINNEKIST PADVVKKINK NNLQSALKIK ILRGNIESIK IIKPVDVYDL QV
|
| |