Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_04161 |
Symbol | |
ID | 4779322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 381145 |
End bp | 382518 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083686 |
Product | carboxyl-terminal protease |
Protein accession | YP_001014245 |
Protein GI | 124025129 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.63143 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACTGTA CTTTCAAATA TCTTACTCGA TGTGCCCCTT GGTCGCCGCA AAAGATCAAT CCAATGCTCT CAACTGTTAA ATCTTTAACT AAGCCACTGC AAAGACTTTT TGCAGTATTA ATTAGCTTTG GGATTTTATT TCAAGTCTTT ACTCAGCCCG CTTTTGCCCT AAATGATGGA CAATTGCTTG TTATTGAGGC ATGGAATCAA GTAAATGCAG GTTATTTAGA TCCTAAGAAA TTTGATGAAA TTCAATGGAA AAAACTTCGT CAAAAAGCTC TTGAAAAGCC AATAAACAAT TCTCAACAAG CCTATTCAGC TATTGAAGCA ATGCTTCTTC CTCTTGGAGA TCCATATACG CGCTTATTAA GACCAGTTGA TTACGAGGCG ATGAAGAAAA GCAATATAGG TAGTGAAATT AATGGAGTTG GACTTCAATT AGGAGCAAGA AAAGAAGATG GAGACATAGT CGTAATATCT CCTCTTGAAG GCTCTCCAGC ATCGGATGCA GGAATTACAA GTGGAACGAT TATAAAAAAA GTCAACGGTC AATCCCCCAA GCAATTAGGC CTAGAAGCAA CAGCAGCAAA ATTGAGAGGC CAAACTGGAA CACAAGTAAT CGTTGAATTA GAGCAGCCTG ACAATGAAAT AAAAGAAATT TCTTTAGAAA GAAGAAGTGT TGATTTAAGA CCAGTTAGGA CTAAGAGAAT AAGGAACGAA TCACATACAT TTGGTTATTT AAGAATTACA CAATTTAGTG AGGGAGTCCC TGAGCAAGTC AAGGAAGCAT TAGAAGAACT TTCTGGGAAA GAAATAGATG GACTAATTTT GGACCTAAGA AATAACTCTG GAGGTCTCGT AAGTTCTGGA CTCGCAGTTG CTGACGATTT CCTCAGCAAC ATGCCGATTG TTGAAACAAA AAAAAGAGAT TCCATAAATG ATCCCATCAG TTCTGGACTC GAAACTATTT ATGATGGGCC AATGGTTACT CTTGTAAATG AGGGGACAGC AAGTGCTAGT GAAATTCTTG CTGGTGCTCT TCAGGATAAT AAAAGATCTG AACTCATAGG TAATAAAACT TTTGGGAAAG GCCTCATACA ATCTTTGACC AATCTTAGTG ATGGCAGTGG TTTAGCTGTA ACAGTTGCTA GTTATCTAAC TCCTAGTGGA AGAGACATAC AAAATCTTGG AATTGAGCCT GATCGCCTCT TGGAAATGCC AGAGCCACTT AATCCAGGTT CTGATGACGA TAGATGGCTT TTAGATGCAG AGCTGATCAT GCAAGCCACC TTGGACAAAG AAGAAGTCTC AGAAAAATTA TCAAAAGAGA CTTTGGTAAA AGAAGAACAA TTAATAAGCC AAGAAATTGA ATAA
|
Protein sequence | MHCTFKYLTR CAPWSPQKIN PMLSTVKSLT KPLQRLFAVL ISFGILFQVF TQPAFALNDG QLLVIEAWNQ VNAGYLDPKK FDEIQWKKLR QKALEKPINN SQQAYSAIEA MLLPLGDPYT RLLRPVDYEA MKKSNIGSEI NGVGLQLGAR KEDGDIVVIS PLEGSPASDA GITSGTIIKK VNGQSPKQLG LEATAAKLRG QTGTQVIVEL EQPDNEIKEI SLERRSVDLR PVRTKRIRNE SHTFGYLRIT QFSEGVPEQV KEALEELSGK EIDGLILDLR NNSGGLVSSG LAVADDFLSN MPIVETKKRD SINDPISSGL ETIYDGPMVT LVNEGTASAS EILAGALQDN KRSELIGNKT FGKGLIQSLT NLSDGSGLAV TVASYLTPSG RDIQNLGIEP DRLLEMPEPL NPGSDDDRWL LDAELIMQAT LDKEEVSEKL SKETLVKEEQ LISQEIE
|
| |