Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_01211 |
Symbol | smc |
ID | 4779534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 115731 |
End bp | 119336 |
Gene Length | 3606 bp |
Protein Length | 1201 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640083384 |
Product | SMC ATPase superfamily chromosome segregation protein |
Protein accession | YP_001013950 |
Protein GI | 124024834 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02169] chromosome segregation protein SMC, primarily archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTCCATA TCAACCACGT AGATTTGTCT CATTTTAAGT CCTTCGGTGG ATCGATGTCG ATTCCACTTG AAGAGGGCTT TACTGTTGTA ACAGGTCCAA ATGGTTCAGG TAAGAGCAAT ATTCTTGATG GTGTTTTATT TTGTTTAGGG CTTGCAAATA GTCGAGGCAT GAGAGCAGAC AGATTGCCTG ACTTAGTAAA TAGTGGCGTA TTAAAAGCTG GAAAATCTTC GGAAACTAAA GTAACTGTCA AATTTGATTT AACTGATTGG AAGCCCGATG AAGCCGAAGA AGGTATAGAA CCTACTGAGG AAGGGCCTTG GATTAAACCT GATCAAAAAG AATGGACAGT ATCAAGAAGA TTAAAAGTTA TGCCAGGAGG GTCATATGCT TCCACTTACA GCGCAGATGG TGAGACTTGT AATTTACAGC AATTACAAAC CCAATTAAGG CGACTAAGAA TCGATCCTGA GGGGAGCAAT GTTGTTATGC AGGGAGATGT TACTCGAATT GTTTCTATGA GTAATAAAGA TCGTAGAGGT CTAATAGACG AACTTGCTGG TGTTGCACTA TTTGATACAC GTATTGATCA AACTCGAGCA AAGTTGGATG ATGTTTATGA AAGGCAGGAA CGTTGTCGGA TTGTTGAACA AGAACTGATT CTTTCCAAAC AGCGTTTGCA AAAAGATTGT GAGAAAGCAA GTTTATATAA AGATCTAAAA AATCAATTGT TAATTGGCAG ACAGCAAGAA TTAGTTTTAT CTTATGAAAA GGCAAAAAAG GGACTGGAAA AACTAGATAT AGATCACCAA GAATTGCTTA AAAAAGAAAA AATAGATTCA GAAAAGTTAC TTAATCATGA AAATGATTTA AGTACATGCA TAGAAAAATT AAATATCTTA CAAAAAAATG TTAAAGAACT CGGTGAAGAT CAATTGCTTG AGGTTCAAGG TAAACTAGCA GGGATTGAAT CTCAACATAG AGAGCTAGAA AGACAAGGAC TTAATCATAA AAATGAAGGT GAGAAATTAC AGGAATCTAG AAATGATCTT CTACAGAAAA AGAAAGATTA TCAAACTGAC TTGCAAAGTA AATTGAATGA GATTAATCCT GAAGAATTAG AAGAAGCTGA CTTAAGATGT AAAGAGGCTG AGGCTTGGGT TGAATCTTCT AGACGGAAGC TTTCCGATGT AGCAGGTCGT TCTGGTGCAT GGATAGAAAA ACATCAGAAA GCCAGAGATG AGCTTAATAA AATCCGATTG GAATTAGATC CTAAAAGACT AGAGAAACAA AATATTGAAG AAAGTTTATT GCAATTAAAT GTTATTTTAA AAGAATTAGA GACTGATCAA AAAGCTGATC AATCTTCCAA TGAAAAAGTA CATATAGAAA TTAGTAATTT GAATGAAAAA TGGGATAGTA TTTTAGATTT ACTGTCTGTT AAGAAAGAAG AATTTCAAGT ATTAGTATCT GAGAAGGGTA TACAAGAACG CACTAAATAT AGATTAGAGA AAGAACAAGT CAAGCTTCAA AATGATATTG CTCGTTTAGA AAGTAGGAAA GAAATGATAT CCGAAAGTCG AGGCACTAAT GCAATAACTT TGTTACTAGA GTCTGGTTTA GAAGGTATAC ATGGTCCTGT AGCTAATTTA GGTGAAGTGG AAGATCGTTA CAGGATTGCT CTTGAAGTAG CGGCTGGAGC CAGGCTTGGG CAGGTTGTTG TAGACAGCGA TCAAATTGCT GCAAAATCAA TTGATCTTTT AAAACGAAAA AGAGCTGGAA GATTGACTTT TTTACCTCTT AATAAGATTT TAAAAAACTC TCAAAGTAGG TCTGATGTAT TCCAGAGATC TGTTCACACT AATCTCAATA AAAGTACTGG ATTAATTGGC AAAGCTATTG ATTTGATTCA ATATGATTCA ATATATAAAA ATGTTTTTTT GTATGTATTT GGTGAAACAA TTGTTTTTAG TGACTTGTCA TCAGCTCGTG ATCAAATTGG TATAAAAAGA GCTGTTACTT TAGAAGGTGA ATTACTGGAG AAGAGTGGAG CGATGACTGG TGGAAGTTTA AATAACAGAG CCTTGGGTTT AAGTTTTGGA AGAGTAAAAG ATAATGATGA TTGTGATCTT TTGAAGAATC GATTATTAGA AATTGGTGAG ACTTTAACTA ATTGTCAAAA GAACGAAAAT CAACTCATAA ATAAGCTAGA CAATCTTAGA TCTCAGCTAA GTAAATTAGA ACAAGAGAAA GCTGCACTTG ATGCTGAAAG AGTAACTTCT AAAAGATCAA ATTCACCGTT ATTAGAACGC CAAAGCCATC GCTCAAAAAG AATTAGCGAT CTTCAAAAAT CTAAAAAGGA AAAACTATGT CAATTAGAAT CTATTAATTT GATTATTAAA CCTTTAGAGG TCATTTTACT AGAAATTGAG AAGGAAGAGA AAAAAATAGA TAAATCAAGT GACTCATCTG TTTGGTCTAA ATTACAAAAT GATTTAGAAG ATGCTGATAA AAATCTTCTT TCTTTTAGGA ATAAAAGAGA TGAGATTTCA AATAAGCAAT CTCAAACTAA GCTTGCAATT GATCGATTAA ATGATCAAGA AACTACTTTA ATAAGCGAAG AAAAACGATT AAAAGATTCC ATAGATACGC TTGCTTCTGC TCATATCGCT TGGCGTGAGC AAAGCAAACT ACTTACTACC AATCGGCAAG ATTTGATAAA TCAACAAAAA GATTTAGAAA CTCGCTTTGG GGAACAACGA AGAGAAAGAG ATTGTGTGGA AGCAGATGTA GCTAAAAAAA GATTCAATTT ACAAGAAATG CAATGGAGTC TCCAACGTTT AAGGGAAGAT CAAAAAAACA TGAAGGAAGA GATACGAATG GAAACTATTC GATGTACTGA ATTAGAGAGG AAGCTTCCTA ATCCTATGCC TTTGATTTCA GATGAAATAA GAGATAATGG TTTGGATGAT TTACTTTCTA AATTGGAAGA TTTACAGAAA AGAATGGAGG AATTGGAACC TGTCAATATG CTTGCATTGG AAGAGTTGGC TAAATTAGAA GAGAGACTGA ATGAACTGGA AAATAGACTT CAGGTTCTAA CTGATGAAAG ATCGGAGTTA TTACTTCGAA TAGAGACAGT TGCAACTTTA AGAGAGGAGG CCTTTATGGA GGCTTTTAAG GCGGTTGATA TACATTTTCG TGAAATTTTT GCAAGCCTTT CTGAAGGAGA TGGACATTTG CAGCTTGAAA ATCCTGAGGA ACCTTTGGAG GGTGGCTTGA CTTTGGTTGC TCATCCAAAA GGTAAGCCTG TAAGACGGCT TGCGGCTATG TCAGGTGGAG AAAAATCTTT AACAGCTTTG AGTTTTCTTT TTGCCTTGCA ACGTTTTAGA CCCTCTCCTT TTTATGCCCT TGATGAAGTA GATAGTTTCT TGGATGGAGT AAATGTTGAA AGGTTAGCGG CCTTAATTGC TAAACAAGCT GAACAGGCAC AATTTCTTGT CGTAAGTCAT AGAAGACCTA TGATTGGAGC ATCAATGAGG ACTATTGGGG TTACACAAGC AAGGGGAAAT CATACTCAAG TTGTTGGGTT GCCAATCGCA GCTTGA
|
Protein sequence | MVHINHVDLS HFKSFGGSMS IPLEEGFTVV TGPNGSGKSN ILDGVLFCLG LANSRGMRAD RLPDLVNSGV LKAGKSSETK VTVKFDLTDW KPDEAEEGIE PTEEGPWIKP DQKEWTVSRR LKVMPGGSYA STYSADGETC NLQQLQTQLR RLRIDPEGSN VVMQGDVTRI VSMSNKDRRG LIDELAGVAL FDTRIDQTRA KLDDVYERQE RCRIVEQELI LSKQRLQKDC EKASLYKDLK NQLLIGRQQE LVLSYEKAKK GLEKLDIDHQ ELLKKEKIDS EKLLNHENDL STCIEKLNIL QKNVKELGED QLLEVQGKLA GIESQHRELE RQGLNHKNEG EKLQESRNDL LQKKKDYQTD LQSKLNEINP EELEEADLRC KEAEAWVESS RRKLSDVAGR SGAWIEKHQK ARDELNKIRL ELDPKRLEKQ NIEESLLQLN VILKELETDQ KADQSSNEKV HIEISNLNEK WDSILDLLSV KKEEFQVLVS EKGIQERTKY RLEKEQVKLQ NDIARLESRK EMISESRGTN AITLLLESGL EGIHGPVANL GEVEDRYRIA LEVAAGARLG QVVVDSDQIA AKSIDLLKRK RAGRLTFLPL NKILKNSQSR SDVFQRSVHT NLNKSTGLIG KAIDLIQYDS IYKNVFLYVF GETIVFSDLS SARDQIGIKR AVTLEGELLE KSGAMTGGSL NNRALGLSFG RVKDNDDCDL LKNRLLEIGE TLTNCQKNEN QLINKLDNLR SQLSKLEQEK AALDAERVTS KRSNSPLLER QSHRSKRISD LQKSKKEKLC QLESINLIIK PLEVILLEIE KEEKKIDKSS DSSVWSKLQN DLEDADKNLL SFRNKRDEIS NKQSQTKLAI DRLNDQETTL ISEEKRLKDS IDTLASAHIA WREQSKLLTT NRQDLINQQK DLETRFGEQR RERDCVEADV AKKRFNLQEM QWSLQRLRED QKNMKEEIRM ETIRCTELER KLPNPMPLIS DEIRDNGLDD LLSKLEDLQK RMEELEPVNM LALEELAKLE ERLNELENRL QVLTDERSEL LLRIETVATL REEAFMEAFK AVDIHFREIF ASLSEGDGHL QLENPEEPLE GGLTLVAHPK GKPVRRLAAM SGGEKSLTAL SFLFALQRFR PSPFYALDEV DSFLDGVNVE RLAALIAKQA EQAQFLVVSH RRPMIGASMR TIGVTQARGN HTQVVGLPIA A
|
| |