Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_00681 |
Symbol | smc |
ID | 4911543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 69528 |
End bp | 73118 |
Gene Length | 3591 bp |
Protein Length | 1196 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 640159632 |
Product | SMC ATPase superfamily chromosome segregation protein |
Protein accession | YP_001090292 |
Protein GI | 126695406 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02169] chromosome segregation protein SMC, primarily archaeal type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGATTGG TACATATCAA TCAGGTCGAG TTTGAAAATT TTAAATCTTT TGGGGGAAAT GTAAAAATTC CTCTTGAAGA AGGTTTTACG GTGGTTACGG GTCCTAACGG TTCTGGAAAA AGTAATATTT TAGATGGAAT TTTATTCTGT TTAGGTTTAT CCAATAGTAG AGGGATGAGG GCTGAAAGAT TACCAGATCT AATAAATAAC TCCAAAGTTA AAGAGGGTAA GTCATCAGAA ACATCTGTAT CGGTAAAATT TAATATTCAA GATTGGTTTC CCAGAGAAGA TCTTCCGCCT TTGGAACTAG AAGAAGAAGA AATTGGCCTT AATAAAGGTC AAAAAGAATG GGTAGTTTCT AGAAAATTAA GGCTTATGCC AGGGGGTTCT TATGCTTCTA CTTATACGTC TGATGGAAAA CAATGTACCT TGCAACAAAT ACAGAGAATA TTAAGAGATA TTAGTGTTGA TCCTGAGGGC AGCAATGTTG TTATGCAGGG TGATGTAACA AGAATAGTAT CAATGAATAA TAAGGAGCGG AGAAATCTTA TTGATGAATT AGCAGGAGTC GCACTTTTTG ATACAAGAAT AGAACAAACT AATGCAAAAT TAAATGACGT TTTTGAAAGA CAAGAAAGAT GTGAAATTTT AGAAAATGAA TTGCAATCTA GTAAGAATAA GCTTGAAAAA GAATGTGAAA AAGCAAAGCG ATATAAAGAG TTAAAGGCAA AACTACAACA AATAATGGAA TTAGAGAAAG TTCTTATTTT TGAAAAACAA GTTAAGCATG TTGAATCTAT AGAAAAAAAA GAAAGTGAAA TTGAAAAAAA TAAAATCTTA TTTAATAAAC AAAAAGCATC TATTAGTAAT GAAATATCAG TTTTAGAAGA TGCTTTGAAA ATACTAGTTG ATGAGCTGAA GGAGAAAGGA GAGGATACTT TAATAAAAGT TAATTCTGAT ATTGGAAGTA TTAACTCTAA CTTGAGAGAA CTTGATAGGA TATCAATTCT GAATAAAGAA GAAGGTATTA AATTACAAAA ACAGAGAGAT GAAATTTCAA TTTCTAAGAG GAATATTGAG TCAGAAAAGA TTAGACAAGA AAATTTCGAT GATAATTTTT TAAATCAATT GAACTTGCAA ATTGATGATC TCACTTTAAA ACACAAATTA TCCAGAAAAA AACTTTCTGA TGCGGCTGGA GAATCTGGAG AATTCTCAAA ACAAAGTATC AAATTAAATG CTGAGCTTGA AAGTATAAAA AATCAAATTA ATCCTTTGGA AATAAAAAAA AGGAAAATTG AAGAAGAGAC TATTCAAAAT AATATTCAAA AGGATGAGAT ATTGTCACAG ATCGAATCCT TAGACTTAGA AGAGCAGAAA ATTTTTAAGG GAAATCAAAG AAAAAAAGAG ACATCCGATA CAAAGAATAA AAATTTAGCA AGTAATAGCG CAGAAATTAA TTCTTTAAAA AATGAAATCG ATTTATTAAT TAAAACTAAA TCAAGGCTAA ATAACGAGCA ATTAAGGCTT GAAAAAGATT TATCTAGATT CGAAAGCAGG AAAGAAGCTT TAAACGAATC TAGAGGTTCA TATGCTCTCA GAATTCTTTT AGAGGCAGGG TTAGAAGGTA TACATGGTTA TGTAGCTCAA CTTGGAGAGG TCAGTGAGAA AAATAGATAT GCATTAGAAA TTGCTGCTGG AAATAGGTTA GGACAAATTG TTGTTGATAA TGATCATATT GCTGCAAAAG CAATTGAAAT TCTTAAAAAG AAGAAAGCGG GAAGATTAAC TTTTTTACCT TTAAATAGAA TTAAAAGTCA AAAAAAGAAT TATGTAATTT CAAGATTTGA AAATCATAGG GAGAATGGAT TTATTGATAA AGCTATTAAT CTAATTACTT TTGATGAAGT TTATTCAGAT GTTTTTCGAT ATGTTTTTGG AGATACTTTG GTTTTTTCAG ACTTATCCTC AGCTAGGTTA TCTACACAAA AAAATAGGTT GGTTACCTTA AGTGGTGAAT TATTAGAAGC AAGTGGTGCT ATTACAGGAG GCAGTAAGTT AAATAAAGAT TTGGCTTATA GGTTTGGAAC TAATAATGAA ATTGATGATT CCAGTCCTAT AAAAGAAAGA TTATTAGTTA TCGAAGAAGC TTTAAAAGAG TCAAATAATG ATTTGATACT AAAAAATAAT AGACTTAATA CATTAAATTC TAACCGCAGT CAAATAATTG AGGATTGTGC CTCATTTAAT AAAGAAATTG AAGTAAATCA AGATTCACTT AAAGCTGTCT CGCAAAGAAT TGAGGATTGT AAATCGAGAT TAAAAAAACT TGATATTGCT AATAATTTAT TAGTTAACGA GTTAGGGCAT TTAAAAAATC AATTGAAGCC TTATTACGAT AAGTTTGATC AACTACAAAC CATTCAAAAG GCAAATTATG AAAAAAATCA AAAATCATCA TTAATAGCTT TTAATGACGA TTTTAATAAT CTTGATAAAA AACTTGAATT ACTTATTAAA GAGAGAAATA CATTACTAGA TAAAAAGAAT CAATTTGCTT TAAATAAAGA GCGTATCAAT AATTCATTAA AAATTACTCT ACTACAAGAA AAAAACTTGC AGGAATCTAT TAAACAACTC GCAACTGCTC ATAGTGAATG GCTAGAAAAA AGAGATCAAT TTAAAAAAGA ACTTTCAGAT CTTGATAATC AAAAAAATTC TCTAGAGAAG AATTTAGGTT TATTGAGAAG GAAAAGAGAT GAATTAAACT CTTCAATTTC AAATAAAAGG CAAGAATATA ATAACTATCT GTTAAAGCTT GAATATCTTG AAAGGGATAT GCATACCCTT AAAGAAGAGA TGAGGAGCGA GAAAATAAAA TTAGAAAATT ATAAAAGAGA TCTACCTAAT CCTTCCCCGG AGTTTGGAGA ATATGAAGGG AAGAGTCTTG AATCTTTGCA ATCAGAAATT TCGATTATAA ATGCAAAATT AGAAAGCTTA GAACCTGTCA ATATGTTGGC TCTTGATGAA TTAGAAGAAT TAATTGAGAG ATTAAATGGT TTGCGAGAAA AATTAGAAAT TCTATCTAAT GAAAGATCTG AATTATTGTT GAGAATAGAA ACTGTATCTA CGATGCGTCA AGAAGCTTTT ATGCAAGCAT TTACAGAAGT TGATAGACAT TTTAGAGAAA TTTTTGCAAA TTTATCTGAT GGAGATGGAT TTCTTCAACT TGAAAATCCT AATTCTCCTT TAGAAGGAGG ATTAACTTTA GTGGCTCATC CCAAGGGAAA AAATGTCAGA AGATTAGCGT CAATGTCTGG TGGTGAAAAA TCGTTAACTG CTTTAAGTTT TTTATTTGCT TTGCAAAAGT ATAAGCCTTC ACCTTTTTAT GCATTAGACG AGGTTGATAG TTTTTTAGAT GGTATTAATG TTGAAAGGTT GTCAAAATTA ATATCAAATC AGTCCTCAAA TGCTCAATTT ATAGTCGTAA GTCATAGAAG GCCTATGATT AGTGCATCTG AACGAACAAT TGGGGTTGCG CAAGCAAGAG GTGCTAATAC TCAAGTTCTT GGGTTACCAA ATGCTGCATA A
|
Protein sequence | MRLVHINQVE FENFKSFGGN VKIPLEEGFT VVTGPNGSGK SNILDGILFC LGLSNSRGMR AERLPDLINN SKVKEGKSSE TSVSVKFNIQ DWFPREDLPP LELEEEEIGL NKGQKEWVVS RKLRLMPGGS YASTYTSDGK QCTLQQIQRI LRDISVDPEG SNVVMQGDVT RIVSMNNKER RNLIDELAGV ALFDTRIEQT NAKLNDVFER QERCEILENE LQSSKNKLEK ECEKAKRYKE LKAKLQQIME LEKVLIFEKQ VKHVESIEKK ESEIEKNKIL FNKQKASISN EISVLEDALK ILVDELKEKG EDTLIKVNSD IGSINSNLRE LDRISILNKE EGIKLQKQRD EISISKRNIE SEKIRQENFD DNFLNQLNLQ IDDLTLKHKL SRKKLSDAAG ESGEFSKQSI KLNAELESIK NQINPLEIKK RKIEEETIQN NIQKDEILSQ IESLDLEEQK IFKGNQRKKE TSDTKNKNLA SNSAEINSLK NEIDLLIKTK SRLNNEQLRL EKDLSRFESR KEALNESRGS YALRILLEAG LEGIHGYVAQ LGEVSEKNRY ALEIAAGNRL GQIVVDNDHI AAKAIEILKK KKAGRLTFLP LNRIKSQKKN YVISRFENHR ENGFIDKAIN LITFDEVYSD VFRYVFGDTL VFSDLSSARL STQKNRLVTL SGELLEASGA ITGGSKLNKD LAYRFGTNNE IDDSSPIKER LLVIEEALKE SNNDLILKNN RLNTLNSNRS QIIEDCASFN KEIEVNQDSL KAVSQRIEDC KSRLKKLDIA NNLLVNELGH LKNQLKPYYD KFDQLQTIQK ANYEKNQKSS LIAFNDDFNN LDKKLELLIK ERNTLLDKKN QFALNKERIN NSLKITLLQE KNLQESIKQL ATAHSEWLEK RDQFKKELSD LDNQKNSLEK NLGLLRRKRD ELNSSISNKR QEYNNYLLKL EYLERDMHTL KEEMRSEKIK LENYKRDLPN PSPEFGEYEG KSLESLQSEI SIINAKLESL EPVNMLALDE LEELIERLNG LREKLEILSN ERSELLLRIE TVSTMRQEAF MQAFTEVDRH FREIFANLSD GDGFLQLENP NSPLEGGLTL VAHPKGKNVR RLASMSGGEK SLTALSFLFA LQKYKPSPFY ALDEVDSFLD GINVERLSKL ISNQSSNAQF IVVSHRRPMI SASERTIGVA QARGANTQVL GLPNAA
|
| |