Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00881 |
Symbol | |
ID | 4778202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 88774 |
End bp | 91557 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640085588 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001016110 |
Protein GI | 124021803 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0543692 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGCCA ACGCTGAGTT GTTGCAAGGC AGTCTGTTTG GTGATCTGGA ACCGCAGGCA AACGCTGAAA GCTGCTCAGA AACCATTACA CGGGCTCTGC GTAACGATCT CTCTGATCAA GAACTCGTCG ACGAATCCCT AAAAAGGCCC CGCAATCGAC ATAAGCCAAC ATCCGTTCCA AGCATCCCTC TTGATTCTGA AAGCCAAGAG CAACTCGAAA CAGCAGATAA CGACAACGAC CTGCCCGCCT GGGCACACCA CACTCTCGTT GACCCTGAGC AGCTCACGCC AATGCTGCGT CACTACGTGG AACTCAAGGC CAAACATCCC GAAAGGATCT TGCTCTACAG ACTCGGCGAT TTTTTTGAAT GCTTTTTTGA AGATGCCATT CAGCTCTCAA GGCTGCTGGA GCTGACTCTC ACTGGCAAGG AAGGAGGTAA AGGGATCGGC CGTGTGCCGA TGGCAGGCGT CCCCCACCAC GCTGCAGAGC GCTATTGCAC CGAATTGATC CGCCATGGGC TAAGCGTGGC CATTTGTGAC CAACTGGAAA CAACTCCAAG CAAAGGCGCA CTGCTCAAAC GCGACATCAC CAGGGTGCTC ACCCCAGGCA CGGTGCTTGC CGAAGGCATG CTGACAGCTC GCCGCAACAA TTGGCTAGCA GCAGTTGTGG TTGAGCCAGC ACAAGGCAAC CAACCGTTTT GCTGGGGTCT AGCCAACGCC GATGTCAGCA CAGGCGAATT CCTGGTCACG CAACGGGAGG GGAGCGCTGA GCTGCACCAA CACCTTGCTC AGCTCGAGGC TTCGGAACTG ATCATGGCTC AAAAGATCGG AGAAAGCAGC AGGCCTGCGT GGTGTCCGGA GCAGCTCTTC CTGACAACGA TGGCCACTAC CCCCTTCAGT CAGCCAGAGG CCGAGCGCAC GCTGCTCAAT CATTACCGGC TCAGCACCCT TGACGGTCTA GGGCTGCAAG AGGTCCCTTT AGCACTAAGG GCCGCCGGCG GACTGCTGAC TTATCTAAGG GACACCCAAC CCCTCGCTGA AAACGTCGAT GAAGGCATCG CCCCGATGCC CCTGGAACAT CCGCTCACAG TGTTCGCCGG CGATGCCTTA GTACTCGACG CCCAAACCCG ACGCAACTTA GAACTCACTA GCACCCAACG GGATGGCCAA TTCCAAGGAT CCCTTCTTTG GGCCGTAGAC CGTACCCTCA CTGCCATGGG AGCCCGCTGC CTGCGCCGCT GGATCGAGGC CCCCCTCCTA GACAGCAAAG CCATCCGCGC CCGTCAAGCG GTGGTGAACC ATCTCGTGGA AACACGCTCC CTTCGGCATT CACTGCGACG CTTGCTGAGA CCGATGGGAG ATCTCGAACG GCTGGCCGGC AGGGCCGGCG CCGGTCATGC TGGAGCCCGC GAACTCGTGG CTATCGCCGA TGGCATTGAA CGATTACCAC GGCTAGCAGA GCAACTACAC AATGCATTGC GCTCAGCCCC AAACTGGCTC GACAACTTGC TGACTCTGGA TAAGAGCCTG CCAAAGCTGG CAGCGAGTAT CCGCGAGCAA TTGATCAATA ACCCACCGCT CAGCCTCAGC GAGGGAGGCC TCATGCACGA CAACGTTGAT CCGCTACTGG ATGGCCTGCG CAATCAACTG GATGACCAGG ACACCTGGCT TGCAGGTCAG GAAGTCCAAG AACGCAAACT CAGTGGCAAC CCTAATCTGC GCCTGCAATA CCACCGAACC TTCGGCTACT TCCTAGCCGT AAGCAAAGCC AAGGCTTCCA TGGTGCCGGA CCACTGGATC CGCCGGCAGA CCCTGGCCAA CGAGGAACGC TTCATCACTC CAGATCTCAA AACCCGGGAG GGGCAGATCT TCCAGCTCAG AGCAAGGGCG TGCCAACGGG AATACGAACT GTTCTGCCAA TTGCGCGAGC AAGTGGGAAA ACAGGCAACT TCAATCCGTA AAGCAGCCCG GGCAGTGGCA GGACTCGATG CGCTAGTCGG CCTAGCAGAA GTAGCCGCCA CTGGAGACTA TTGCTGCCCA GAAATCGATA ACAGCAGAGA ACTACAGCTC AAAACCTGCA GACACCCAGT GGTGGAACAA CTACTGGTAG AGAGATCCTT CATCCCTAAC GATGTAGAAC TTGGGAAGGA CATCGACCTC GTCGTACTCA CAGGCCCAAA CGCCAGTGGC AAAAGCTGCT ACCTGCGTCA AATCGGCCTC ATACAGTTGC TAGCCCAGGT AGGTAGTTGG GTGCCTGCAA AGCAGGCCCG TGTTGGCATC GCAGACCGCA TCTTCACCCG CGTCGGCGCC GTCGATGACC TAGCGGCAGG ACAATCCACC TTCATGGTGG AAATGGCAGA AACGGCCAAC ATTCTCCATC ACGCCAGCGA TCGTTCCTTG GTGCTTCTCG ATGAGATTGG ACGTGGCACC GCCACCTTTG ATGGCCTTTC GATTGCCTGG GCGGTGAGCG AACACCTGGC AAGAGATCTA AGAAGCCGCA CTGTGTTCGC GACCCATTAT CACGAACTCA ATGGGCTGAG CCAAGAACTG ACCAATGTCG CCAATTCTCA AGTTTTGGTG GAGGAAACAG GCGACGACCT GGTGTTCCTA CATCAGGTGG CGGCAGGCGG TGCAAATCGC AGTTACGGCA TTGAAGCCGC CCGTCTAGCT GGTGTGCCTG ACGACGTGGT ACAACGAGCA AGGCAGGTGT TAGCTCAGCT CCATGACGAT GACTCATCTC TGCCAGCCCT ACTAAGCGCG AAAACAATCA AAAGACAAAG CTGA
|
Protein sequence | MTANAELLQG SLFGDLEPQA NAESCSETIT RALRNDLSDQ ELVDESLKRP RNRHKPTSVP SIPLDSESQE QLETADNDND LPAWAHHTLV DPEQLTPMLR HYVELKAKHP ERILLYRLGD FFECFFEDAI QLSRLLELTL TGKEGGKGIG RVPMAGVPHH AAERYCTELI RHGLSVAICD QLETTPSKGA LLKRDITRVL TPGTVLAEGM LTARRNNWLA AVVVEPAQGN QPFCWGLANA DVSTGEFLVT QREGSAELHQ HLAQLEASEL IMAQKIGESS RPAWCPEQLF LTTMATTPFS QPEAERTLLN HYRLSTLDGL GLQEVPLALR AAGGLLTYLR DTQPLAENVD EGIAPMPLEH PLTVFAGDAL VLDAQTRRNL ELTSTQRDGQ FQGSLLWAVD RTLTAMGARC LRRWIEAPLL DSKAIRARQA VVNHLVETRS LRHSLRRLLR PMGDLERLAG RAGAGHAGAR ELVAIADGIE RLPRLAEQLH NALRSAPNWL DNLLTLDKSL PKLAASIREQ LINNPPLSLS EGGLMHDNVD PLLDGLRNQL DDQDTWLAGQ EVQERKLSGN PNLRLQYHRT FGYFLAVSKA KASMVPDHWI RRQTLANEER FITPDLKTRE GQIFQLRARA CQREYELFCQ LREQVGKQAT SIRKAARAVA GLDALVGLAE VAATGDYCCP EIDNSRELQL KTCRHPVVEQ LLVERSFIPN DVELGKDIDL VVLTGPNASG KSCYLRQIGL IQLLAQVGSW VPAKQARVGI ADRIFTRVGA VDDLAAGQST FMVEMAETAN ILHHASDRSL VLLDEIGRGT ATFDGLSIAW AVSEHLARDL RSRTVFATHY HELNGLSQEL TNVANSQVLV EETGDDLVFL HQVAAGGANR SYGIEAARLA GVPDDVVQRA RQVLAQLHDD DSSLPALLSA KTIKRQS
|
| |