Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PMN2A_1456 |
Symbol | |
ID | 3606853 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL2A |
Kingdom | Bacteria |
Replicon accession | NC_007335 |
Strand | - |
Start bp | 136212 |
End bp | 137390 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637688332 |
Product | PDZ/DHR/GLGF |
Protein accession | YP_292647 |
Protein GI | 72383292 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGGAT TAGATGACAA CAGACTAATA ATAGGATCCC TAAAGTTACG TCAAAAAAAA TATTTTAAAA TTTTTGTTTT AGTTGTTCTT ATTTTTTTGA ACTTCCGATA TGAAACTCCT CTTCATTCAA GTGAAGCTTC ACTGTTATCC CAAGAAAATC ACAATAAGCA ATCTTTCGTA TCAAAAGCGT TAAATATTAG TGGGGATGCA GTGGTCACAA TTGAAACACA ACGTCAGGTT TTATCTTCAA GTGAAGGTGT ATTTCCTCCT GGGATCTTAA ATGATCGATA TTTTGAACGA TTCTTTGGTC TAAGAGGCCT GCAAGTTCCA CGATCTCGAA TTGAAAAAGG GCAAGGAAGT GGAGTGATTT TTTCTAAAGA AGGTCTGGTC TTAACTAATG CTCATGTAAT AGAAAAAACT GATCAATTAA TAGTGGGTTT ATCAGATGGA AGAAGAGTGC TTGGAAATGT TGTTGGAGAA GATTCTTTAA CAGATCTTGC AGTTATTAAA CTCAAAGCAA AAGGTCCTTG GCCAACTGCC CAATTAGGAA ACTCCGATAA TTTAAAAGTT GGTGATTGGG CAATTGCAGT TGGAAATCCT TTTGGACTTG AAAATACGGT TACTCTTGGA ATCATTAGTA ATCTCAATAG AGATGTTGCT CAATTAGGTA TATCCGACAA AAGAATAGAT CTCATTCAAA CTGATGCAGC TATTAATCCA GGTAATTCTG GAGGACCATT ATTAAATTCT GTTGGAGAAG TGATTGGTAT TAATACTCTT GTTCGCTCAG GACCAGGAGC AGGATTAGGT TTCGCAATTC CAATAAATAG AGCTAGAAAA ATCGCCAAAG ATTTAATCAC CAGCGGGAGA GCCAAGCATC CCATGATCGG AGTAACACTT TCAAGCAATA TCAAACAAAA AAGTAATTTT CTTTCCCAAA CAGAAGATGG AGCGATAATT AAATATTTGA TGCCAAATGG TCCGGCCGAA AAAGGTGGAT TAAAAGTAAA TGATCTAATA ATTTCAATCA ACAATGAAAA AATTTCAACT CCAGCAGATG TGGTAAAAAA AATTAATAAA AATAATTTAC AATCAGCATT AAGAATTAAA ATACTTAGAG AGAATATAGA GTCTATAAAA ATTATCAAAC CAGTTGATAT TTATGATCTT CAAGTATAA
|
Protein sequence | MHGLDDNRLI IGSLKLRQKK YFKIFVLVVL IFLNFRYETP LHSSEASLLS QENHNKQSFV SKALNISGDA VVTIETQRQV LSSSEGVFPP GILNDRYFER FFGLRGLQVP RSRIEKGQGS GVIFSKEGLV LTNAHVIEKT DQLIVGLSDG RRVLGNVVGE DSLTDLAVIK LKAKGPWPTA QLGNSDNLKV GDWAIAVGNP FGLENTVTLG IISNLNRDVA QLGISDKRID LIQTDAAINP GNSGGPLLNS VGEVIGINTL VRSGPGAGLG FAIPINRARK IAKDLITSGR AKHPMIGVTL SSNIKQKSNF LSQTEDGAII KYLMPNGPAE KGGLKVNDLI ISINNEKIST PADVVKKINK NNLQSALRIK ILRENIESIK IIKPVDIYDL QV
|
| |