Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_02051 |
Symbol | sms |
ID | 4780728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 190824 |
End bp | 192203 |
Gene Length | 1380 bp |
Protein Length | 459 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640083470 |
Product | DNA repair protein RadA |
Protein accession | YP_001014034 |
Protein GI | 124024918 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.169271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.619218 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCTCGTT CTGTCTCTAT TTATGTCTGT CAAAGTTGCG GTGCTCAAAC TAGGCAATTC TTTGGTCGCT GCAATAATTG TGGAGAATGG AACTCAATAA TAGAAGAAAA AATTAATCAA AAATCAGATA AATCTATTTA CACAAAGATT AATTCTCCTA AGGAAAAATC TCCCTATCGT TCAGAACTAA TAAGCCAAAC AAAAAACCAA CTTATTGAAC GCATTTCGAG TGGATATAAG GAATTAGACA GGGTACTTGG AGGTGGCTTA GTGCCTGGAT CACTTGTATT AATTGGGGGA GACCCAGGTA TTGGGAAAAG CACTCTTATT TTGCAAAGTG CGACTGAAAT GGCTCATCAC AGATCAGTCC TTTATGTGGC TGCTGAAGAA TCTGCTCAAC AAGTAAAACT TAGATGGAAT CGAATTGAGG ATGCTGAGTC CAATCTTCAT TTACTCGCAG AAACAGATCT AGAGATGGTT ATAAAAGAGC TTGATCATTT GAAACCTGAG GTTGCAGTGA TTGATAGTAT TCAAGCTTTA CATGATCAAA ATTTATCAAG TTCACCCGGC TCAGTAGCTC AAGTTAGAGA ATGTTCAGCA GCTTTACAGC AAATTGCTAA ACGACAAAAC ATATCCCTTT TGATCATCGG GCACGTTACA AAGGATGGAA TGTTAGCCGG ACCCAAAGTT CTTGAGCATC TTGTGGATGC AGTACTTACT TTTGAGGGCG ATCGATTCGC TTCTCACAGA CTACTAAGAG GTGTGAAAAA TCGCTTTGGT GCCACTTCTG AGCTTGGAGT CTTTGAAATG CAGGCAGATG GATTATCGGA GGTTCCTAAT CCAAGTGAAT TATTTTTAAG CAAAACCTCT GCACCTGGGA TTTCAACAAT TGTTACTTGT GAGGGAACAA GACCATTAGC CATCGATATA CAAGCACTTT TAAATCCCAC GAGTTATGCA AGTCCAAGAA GAACTACAAC TGGCATTGAG ATAAACAGGC TTCATCAAAT CTTGGCAGTT CTAGAAAAGA ATATGAATCT TTCGCTCTCA AGATATGACT GTTATCTGGC TGTAGCTGGG GGATTAGAAG TAGAAGAACC TGGGGCTGAT TTAGGAATAG CTGCTGCAAT AGTTTCGAGC TTTAAAGATA TTGAGCTTGA AGAAGGTGTA GTTTTTATAG GAGAGATAGG TTTGGCTGGT CAATTAAGAT TAGTCAGACA AATGCAACAA CGAATTAACG AAGTGATAAG ACTTGGATAT AAGACATTAA TTATCCCAGA TGGAATAGAC ACAAGTGAGT TTGAAACGAA TCAAAAATTA AAAATATTAA AAGCTTCTAA TATTAATCAA GCTTTAATAT ATGCTTTAGA TAATAATTAA
|
Protein sequence | MSRSVSIYVC QSCGAQTRQF FGRCNNCGEW NSIIEEKINQ KSDKSIYTKI NSPKEKSPYR SELISQTKNQ LIERISSGYK ELDRVLGGGL VPGSLVLIGG DPGIGKSTLI LQSATEMAHH RSVLYVAAEE SAQQVKLRWN RIEDAESNLH LLAETDLEMV IKELDHLKPE VAVIDSIQAL HDQNLSSSPG SVAQVRECSA ALQQIAKRQN ISLLIIGHVT KDGMLAGPKV LEHLVDAVLT FEGDRFASHR LLRGVKNRFG ATSELGVFEM QADGLSEVPN PSELFLSKTS APGISTIVTC EGTRPLAIDI QALLNPTSYA SPRRTTTGIE INRLHQILAV LEKNMNLSLS RYDCYLAVAG GLEVEEPGAD LGIAAAIVSS FKDIELEEGV VFIGEIGLAG QLRLVRQMQQ RINEVIRLGY KTLIIPDGID TSEFETNQKL KILKASNINQ ALIYALDNN
|
| |