Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_18911 |
Symbol | |
ID | 4779194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1553275 |
End bp | 1554420 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640085180 |
Product | trypsin-like serine protease |
Protein accession | YP_001015711 |
Protein GI | 124026596 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.58243 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGAGCC TTAGAAAGTT AGAAATTATG CGGCGATTCA TTGGTATATT TTCCCTTTTC TCAATCCTTT TTTGTTTTTT GTTTGAGCCA ATATCTGTAT TCGCGCTAGC AGATTTTCAA GAAAGCGAAT CACACAGTTT TGTTGCCAAT GTAGCTAGCA AAGTTTCACC TTCGGTCGTA AGGATTGATA TTGAAAGAGA GTTTCAAACA GATGAATTTG AATCTGATTT GTTAGATCCC TTACTAAAAG ATCTTTTAGG GGATTTGGGA ACTTTTCCCA AAAAAGAGAG AGGGCAAGGC TCTGGGGTCA TAATCGATAG CTCTGGATTA GTTCTCACCA ATGCTCATGT AGTGGAAAGA GTTGATCGTG TGATAGTTAC TCTTCAAAAT GGAAATCAAG TGGATGGAAC AGTTGTTGGA ACCGATCAAG TTACTGACTT AGCTTTGGTG AAAATTAAAG AATTTCCTGA TTTAGAAAGT GCAAAATTGG GTGATTCAGA AGATATCCAA GTTGGGGACT GGGCAATTGC TCTTGGAACA CCTTATGGCC TTGAAAGCAC TGTCACGCTG GGTATAGTCA GCAGTCTTCA TAGAGACATT AATTCTCTTG GCTTTTCTGA TAAGAGATTG GATTTAATTC AAACTGACGC AGCAATAAAT CCTGGGAATT CCGGCGGACC ATTGATAAAT GCAAATGGAG AAGTTATTGG AATTAATACT TTAGTCAGAT CAGGTCCAGG GGCTGGACTT GGATTCGCAA TCCCAATAAA TCTTGCTTCA AAAGTTACCA ATCAACTGCT CACTAACGGT GAGGTTATTC ATCCTTACTT GGGTGCTCAG TTGGTTTTAT TGAATGAAAG AATAGCTAAA GAACATAATC AAGATCCAAA TGCATTGATT TTTTTGCCTG AAAGGTCAGG AGCCCTAGTT CAATCAGTTA TCCCTCAAAG TCCAGCAGAG GAAGGAGGTT TAAGACGGGG TGATCTTGTA ATTAATGCAG GGGGTAATGC AATTAATGAT CCTAGGTCTT TACTCATGCA AGTTGAAAAT GCTCAAATAG GAAAGCCATT TGAATTGGAA GTGGTTCGAA ATAATAAAGA GATTAATCTT TCTATTAAGC CAGCTGCTTT ACCAGGAATT AGCTAG
|
Protein sequence | MQSLRKLEIM RRFIGIFSLF SILFCFLFEP ISVFALADFQ ESESHSFVAN VASKVSPSVV RIDIEREFQT DEFESDLLDP LLKDLLGDLG TFPKKERGQG SGVIIDSSGL VLTNAHVVER VDRVIVTLQN GNQVDGTVVG TDQVTDLALV KIKEFPDLES AKLGDSEDIQ VGDWAIALGT PYGLESTVTL GIVSSLHRDI NSLGFSDKRL DLIQTDAAIN PGNSGGPLIN ANGEVIGINT LVRSGPGAGL GFAIPINLAS KVTNQLLTNG EVIHPYLGAQ LVLLNERIAK EHNQDPNALI FLPERSGALV QSVIPQSPAE EGGLRRGDLV INAGGNAIND PRSLLMQVEN AQIGKPFELE VVRNNKEINL SIKPAALPGI S
|
| |