Gene P9303_00881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_00881 
Symbol 
ID4778202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp88774 
End bp91557 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content57% 
IMG OID640085588 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001016110 
Protein GI124021803 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0543692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCA ACGCTGAGTT GTTGCAAGGC AGTCTGTTTG GTGATCTGGA ACCGCAGGCA 
AACGCTGAAA GCTGCTCAGA AACCATTACA CGGGCTCTGC GTAACGATCT CTCTGATCAA
GAACTCGTCG ACGAATCCCT AAAAAGGCCC CGCAATCGAC ATAAGCCAAC ATCCGTTCCA
AGCATCCCTC TTGATTCTGA AAGCCAAGAG CAACTCGAAA CAGCAGATAA CGACAACGAC
CTGCCCGCCT GGGCACACCA CACTCTCGTT GACCCTGAGC AGCTCACGCC AATGCTGCGT
CACTACGTGG AACTCAAGGC CAAACATCCC GAAAGGATCT TGCTCTACAG ACTCGGCGAT
TTTTTTGAAT GCTTTTTTGA AGATGCCATT CAGCTCTCAA GGCTGCTGGA GCTGACTCTC
ACTGGCAAGG AAGGAGGTAA AGGGATCGGC CGTGTGCCGA TGGCAGGCGT CCCCCACCAC
GCTGCAGAGC GCTATTGCAC CGAATTGATC CGCCATGGGC TAAGCGTGGC CATTTGTGAC
CAACTGGAAA CAACTCCAAG CAAAGGCGCA CTGCTCAAAC GCGACATCAC CAGGGTGCTC
ACCCCAGGCA CGGTGCTTGC CGAAGGCATG CTGACAGCTC GCCGCAACAA TTGGCTAGCA
GCAGTTGTGG TTGAGCCAGC ACAAGGCAAC CAACCGTTTT GCTGGGGTCT AGCCAACGCC
GATGTCAGCA CAGGCGAATT CCTGGTCACG CAACGGGAGG GGAGCGCTGA GCTGCACCAA
CACCTTGCTC AGCTCGAGGC TTCGGAACTG ATCATGGCTC AAAAGATCGG AGAAAGCAGC
AGGCCTGCGT GGTGTCCGGA GCAGCTCTTC CTGACAACGA TGGCCACTAC CCCCTTCAGT
CAGCCAGAGG CCGAGCGCAC GCTGCTCAAT CATTACCGGC TCAGCACCCT TGACGGTCTA
GGGCTGCAAG AGGTCCCTTT AGCACTAAGG GCCGCCGGCG GACTGCTGAC TTATCTAAGG
GACACCCAAC CCCTCGCTGA AAACGTCGAT GAAGGCATCG CCCCGATGCC CCTGGAACAT
CCGCTCACAG TGTTCGCCGG CGATGCCTTA GTACTCGACG CCCAAACCCG ACGCAACTTA
GAACTCACTA GCACCCAACG GGATGGCCAA TTCCAAGGAT CCCTTCTTTG GGCCGTAGAC
CGTACCCTCA CTGCCATGGG AGCCCGCTGC CTGCGCCGCT GGATCGAGGC CCCCCTCCTA
GACAGCAAAG CCATCCGCGC CCGTCAAGCG GTGGTGAACC ATCTCGTGGA AACACGCTCC
CTTCGGCATT CACTGCGACG CTTGCTGAGA CCGATGGGAG ATCTCGAACG GCTGGCCGGC
AGGGCCGGCG CCGGTCATGC TGGAGCCCGC GAACTCGTGG CTATCGCCGA TGGCATTGAA
CGATTACCAC GGCTAGCAGA GCAACTACAC AATGCATTGC GCTCAGCCCC AAACTGGCTC
GACAACTTGC TGACTCTGGA TAAGAGCCTG CCAAAGCTGG CAGCGAGTAT CCGCGAGCAA
TTGATCAATA ACCCACCGCT CAGCCTCAGC GAGGGAGGCC TCATGCACGA CAACGTTGAT
CCGCTACTGG ATGGCCTGCG CAATCAACTG GATGACCAGG ACACCTGGCT TGCAGGTCAG
GAAGTCCAAG AACGCAAACT CAGTGGCAAC CCTAATCTGC GCCTGCAATA CCACCGAACC
TTCGGCTACT TCCTAGCCGT AAGCAAAGCC AAGGCTTCCA TGGTGCCGGA CCACTGGATC
CGCCGGCAGA CCCTGGCCAA CGAGGAACGC TTCATCACTC CAGATCTCAA AACCCGGGAG
GGGCAGATCT TCCAGCTCAG AGCAAGGGCG TGCCAACGGG AATACGAACT GTTCTGCCAA
TTGCGCGAGC AAGTGGGAAA ACAGGCAACT TCAATCCGTA AAGCAGCCCG GGCAGTGGCA
GGACTCGATG CGCTAGTCGG CCTAGCAGAA GTAGCCGCCA CTGGAGACTA TTGCTGCCCA
GAAATCGATA ACAGCAGAGA ACTACAGCTC AAAACCTGCA GACACCCAGT GGTGGAACAA
CTACTGGTAG AGAGATCCTT CATCCCTAAC GATGTAGAAC TTGGGAAGGA CATCGACCTC
GTCGTACTCA CAGGCCCAAA CGCCAGTGGC AAAAGCTGCT ACCTGCGTCA AATCGGCCTC
ATACAGTTGC TAGCCCAGGT AGGTAGTTGG GTGCCTGCAA AGCAGGCCCG TGTTGGCATC
GCAGACCGCA TCTTCACCCG CGTCGGCGCC GTCGATGACC TAGCGGCAGG ACAATCCACC
TTCATGGTGG AAATGGCAGA AACGGCCAAC ATTCTCCATC ACGCCAGCGA TCGTTCCTTG
GTGCTTCTCG ATGAGATTGG ACGTGGCACC GCCACCTTTG ATGGCCTTTC GATTGCCTGG
GCGGTGAGCG AACACCTGGC AAGAGATCTA AGAAGCCGCA CTGTGTTCGC GACCCATTAT
CACGAACTCA ATGGGCTGAG CCAAGAACTG ACCAATGTCG CCAATTCTCA AGTTTTGGTG
GAGGAAACAG GCGACGACCT GGTGTTCCTA CATCAGGTGG CGGCAGGCGG TGCAAATCGC
AGTTACGGCA TTGAAGCCGC CCGTCTAGCT GGTGTGCCTG ACGACGTGGT ACAACGAGCA
AGGCAGGTGT TAGCTCAGCT CCATGACGAT GACTCATCTC TGCCAGCCCT ACTAAGCGCG
AAAACAATCA AAAGACAAAG CTGA
 
Protein sequence
MTANAELLQG SLFGDLEPQA NAESCSETIT RALRNDLSDQ ELVDESLKRP RNRHKPTSVP 
SIPLDSESQE QLETADNDND LPAWAHHTLV DPEQLTPMLR HYVELKAKHP ERILLYRLGD
FFECFFEDAI QLSRLLELTL TGKEGGKGIG RVPMAGVPHH AAERYCTELI RHGLSVAICD
QLETTPSKGA LLKRDITRVL TPGTVLAEGM LTARRNNWLA AVVVEPAQGN QPFCWGLANA
DVSTGEFLVT QREGSAELHQ HLAQLEASEL IMAQKIGESS RPAWCPEQLF LTTMATTPFS
QPEAERTLLN HYRLSTLDGL GLQEVPLALR AAGGLLTYLR DTQPLAENVD EGIAPMPLEH
PLTVFAGDAL VLDAQTRRNL ELTSTQRDGQ FQGSLLWAVD RTLTAMGARC LRRWIEAPLL
DSKAIRARQA VVNHLVETRS LRHSLRRLLR PMGDLERLAG RAGAGHAGAR ELVAIADGIE
RLPRLAEQLH NALRSAPNWL DNLLTLDKSL PKLAASIREQ LINNPPLSLS EGGLMHDNVD
PLLDGLRNQL DDQDTWLAGQ EVQERKLSGN PNLRLQYHRT FGYFLAVSKA KASMVPDHWI
RRQTLANEER FITPDLKTRE GQIFQLRARA CQREYELFCQ LREQVGKQAT SIRKAARAVA
GLDALVGLAE VAATGDYCCP EIDNSRELQL KTCRHPVVEQ LLVERSFIPN DVELGKDIDL
VVLTGPNASG KSCYLRQIGL IQLLAQVGSW VPAKQARVGI ADRIFTRVGA VDDLAAGQST
FMVEMAETAN ILHHASDRSL VLLDEIGRGT ATFDGLSIAW AVSEHLARDL RSRTVFATHY
HELNGLSQEL TNVANSQVLV EETGDDLVFL HQVAAGGANR SYGIEAARLA GVPDDVVQRA
RQVLAQLHDD DSSLPALLSA KTIKRQS