Gene Apre_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1336 
Symbol 
ID8398143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1436374 
End bp1438719 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content37% 
IMG OID644995698 
ProductMutS2 family protein 
Protein accessionYP_003153080 
Protein GI257066824 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAA AAAGCTTAAA GGTTTTAGAA TATGACAAGA TTCTAGAAAG ACTTGCTGGA 
TGTGCGAGGT CAAATCTTGT AAAGGATCAG ATCCTAAAGC TAAGACCTTA TGATGATATA
AATTATATAA GAGAAGAGCT TTACGAGACT TCTGCTATGG TCGATGTGAT TAGGAAAAAT
GGAAACATAG ATCTATTTGG TCTCTATGAT CTTACAGAAA TTGTTGCCTA TATTAGAAAA
AATGGCATAC TCGATCCAGG AGAGCTCTTA AAAGTCCTTG ACCTACTAAG GGTGAGTGAA
TATTTAAAAG ATTATGGGAA AAATATCGAA GATAGGAAGA TAGGAGATAT TTTTTCTAGA
ATCTCAATCA ATGATTTTCT CAAAAACGAA ATCGATAGGT CTATAATAAA CGAGGAGGAA
ATAGCAGATT CTGCCTCATC TACCCTTAGA AATATCAGAC GTCAAAAGCA AAGGAAGGAA
GCTGATATAA GGATTAAGCT AAATTCATAT ATCACAAATT CCAAATACGA TGACGCCCTA
CAAGATAAGG TAGTATCAGT AAGAGATGGA AGATATGTAG TTCCTGTAAA GACTAATAAG
CGTGCCTTGA TCGGGGGAAT CGTCCATGAC AAGTCATCTT CTGGGAACAC TCTTTTTATA
GAACCAGGCG CCATAGTAGA GCTTAACAAC CAGCTTAGAG ATCTAGAGAT TAAGGAAGAA
GATGAAATCA GGAGAATTCT CGATAGACTT TCTAGACTTG CCCAAGGATT CGATGTAGAG
CTACTAGAAA ATCAGAAGTT GATAGCAAGG ATTGATTTCC TCCAAGCAAA ATCTAGATTT
GCCATAGAAA ATGAGTATAG CCTTCCTATA ATTACAGATG AAAAGAAAGT CGACTTAAAA
TCAGCAAGAC ATCCCCTCCT TCCTGGTAAG GTTGTGCCAA TCGATGTAAG AATTGGAGGA
GACTATACTA CTTTAATAAT CACAGGGCCA AATACTGGAG GTAAGACTGT AAGTCTAAAG
ACTGTAGGAC TAATAAGTGC AATGGCTCAG ACAGCCCTTT TCATACCAGC CTACGAGGGA
AGTAAGCTTT GTGTATTCGA TGATATTTTC CTAGACATAG GAGATACCCA GTCTATAGAG
ATGAGTCTAT CGACTTTTTC AGCATCTTTG ACTAATATCG TCGATATATT GAAAAACTCT
ACGGAAAATT CCTTGGTTCT CTTAGATGAG ATTGGTTCAG GAACAGATCC GGTAGAAGGA
GCAGCCCTTG CTATTTCTAT CCTTAATTCC TTGACCCAAA AGAAAGTCAT GACCTTTTCT
ACAACCCATT ACAGCGAGTT AAAATACTAT GCTGTAGAGA CTTCTGGTGT TATGAATGCA
TCTGTAGAAT TTGATGTCGA TACACTTTCT CCAACCTACA AGCTTGAAAT AGGGACTCCA
GGTAAGTCCA ACGCCTTCGA GATTTCCAAA AGACTTGGTC TTCCTTACGA GATTTTAAAC
AATGCCAAAA ATCTTATAGG AGATGATACC AAAAATATCA ACAAAATCCT TGCAGAAATC
GAAGAAGATA AGAAGGAAAT TGAAGATAAG AATAAGGAAA TAGAAAGCTA TAAGAGAGAA
ATAGCTAAAA TAAGAAATGA GCTTAAGGAA AAATCCAAAA GACTTGATCA GAAGGAAGAA
GATATCTTAA GAGAGGCCGA AGATAAGGCT AACAGCATCC TAGATAAGGC AAATAAGAGA
AGTCAAGACA TGCTAAAAGA AGCCAAGAAG ATGAGAAATG CCAATACTTC CGACATAGAT
AGGTCTCTAA ACAAAATTAG GCATGAATAC AAAGAAGGAA GAATAGAAAG AAAAGGGGAA
GGCCTTTACA CCAAAGAGTC TAAGAATGCA CCAGATAGTC TAAAGGTGGG GGACACTGTT
CTAATAGCAG GACTTAACGA AAAGGCAGAA GTAATCGAAG CTCCTGATAA GAAGGGTAAT
ATCAAGGTAC AAATGGGAAT TCTTAAGATG GATTCTAATA TCAAAAACGT AAGTAAGATT
AAAGGCGACA ATCAAACAGA AAAAAACATC AGAAAAGTGT ATAATACTAA AAAAGCCATG
AATATCTCGC CAACCCTAGA CCTTAGGGGA CAAAGATATG ACGAGGCTAT GAGAAATCTA
GATAAATACC TTGATGATGC AATGCTTGCA GGTCTTTCTA AGGCCAAGAT CATTCATGGA
AAAGGAACTG GCGCCCTAAT CAAGGGAGTC GGAGAAATTC TAGAAGGTGA TAAGAGAATT
GAAGATTACC GTTTCGGCGA TGACAAAGAA GGCGGATACG GTGTTACTAT AGTGAAATTT
GGATAG
 
Protein sequence
MQEKSLKVLE YDKILERLAG CARSNLVKDQ ILKLRPYDDI NYIREELYET SAMVDVIRKN 
GNIDLFGLYD LTEIVAYIRK NGILDPGELL KVLDLLRVSE YLKDYGKNIE DRKIGDIFSR
ISINDFLKNE IDRSIINEEE IADSASSTLR NIRRQKQRKE ADIRIKLNSY ITNSKYDDAL
QDKVVSVRDG RYVVPVKTNK RALIGGIVHD KSSSGNTLFI EPGAIVELNN QLRDLEIKEE
DEIRRILDRL SRLAQGFDVE LLENQKLIAR IDFLQAKSRF AIENEYSLPI ITDEKKVDLK
SARHPLLPGK VVPIDVRIGG DYTTLIITGP NTGGKTVSLK TVGLISAMAQ TALFIPAYEG
SKLCVFDDIF LDIGDTQSIE MSLSTFSASL TNIVDILKNS TENSLVLLDE IGSGTDPVEG
AALAISILNS LTQKKVMTFS TTHYSELKYY AVETSGVMNA SVEFDVDTLS PTYKLEIGTP
GKSNAFEISK RLGLPYEILN NAKNLIGDDT KNINKILAEI EEDKKEIEDK NKEIESYKRE
IAKIRNELKE KSKRLDQKEE DILREAEDKA NSILDKANKR SQDMLKEAKK MRNANTSDID
RSLNKIRHEY KEGRIERKGE GLYTKESKNA PDSLKVGDTV LIAGLNEKAE VIEAPDKKGN
IKVQMGILKM DSNIKNVSKI KGDNQTEKNI RKVYNTKKAM NISPTLDLRG QRYDEAMRNL
DKYLDDAMLA GLSKAKIIHG KGTGALIKGV GEILEGDKRI EDYRFGDDKE GGYGVTIVKF
G