Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2063 |
Symbol | |
ID | 8824906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2105767 |
End bp | 2107986 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003480195 |
Protein GI | 289581729 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGCAC AGGATGCAGC GGGCGGGAGC TTTTTCTCGC TCCGGTTTGA TACGACCGTC GAGAACATGG ACCTCGAGTC GATTCCGGGT GTCGGCGAGA AGACCGCCCG GGCGCTGTCG GACCTCGACG ATCCCGAGCG GGCGCTGCGC GCGGGCGACG TCGCGACGAT CGCGTCCGCC CCCGGCATCA CGCAGGGACG GGCCGCCCGG ATCGCACGCG GTGCGATCCG CCTCGAACAC GACGACGCTG GCGGCTTCCT CGCCACCGAC CGCGCCAGAG AGATCTATCG CGAGATCCTC TCGTTGCTCA AGCAACGAAC AGTCACGGAC TATGCCGCCC AGCGCGTCGA AACGTTCTAC CCGAGTTCGC GCCGCTCGCG CATCGAGGAG GTCCAGACGT TCGCCCGCGA GGCAATCTCG CGTGAGGCCA GCCCCGCCGT CCTCGAGGCA CTCGAGGGCG TCGAACCGCT CCGAACGCCC GGCGACGTTC GCGTCCGCGA GCGCTGTCTC GCGACGACCG ACGCCGAGCG CTACTCCGCG GCCCGCGAGG CGATTCCCGA ACTCTCCGTC GAAGTCGTCG AGGACGCCCA GGGACTCGCC GAACTCGCTC GCGGCTACTC GACCGTTATC GCGCTCGACG AGTCCTTCGC CGGCGTCACC GTCGAGGGCG ATGTCCAGGT CAGACCGGAC GCACTCGAGT CCCCCGCCGA AATCGTCCCC GAGCGCCCGC TCGCGTTTTT CGCACGAAAT CGGGATCGGA TCACGGCCGC GATCGACGTT CACCGGACCG CTGGCCTCGC AGACGAGGTC GACTGCGACC TCGACGCACT CGAGGACGGC CTCTCGCGAC TCGAACCGGA CGGGACCGTT GCCGGCGACG ACGAACTCGA CCGGCTGACG GTCGCCGCTG ACGATCTGGA CGCCGCGGTC GGCACCGCCG AAACTGTCGC GAACGACCAC CTCCGGGACG CGATCCGCGA GGAGGACGTG ACCATCGAGG GGTCGGACCT CCTCTCGCTT GTCGAACGCG GCGCTGGCGT CGACTCGCTG CTCTCGCGGG AACTCGCCGA CGAGTTCGCC GCGGCCGTCG ACGCCGCGCG CGAGCACCTG ATCGACTCGC TCGACCTCGA CGCGGGCGAG GCGGAACTCG CGCGACGCAT CTTCGGCGAC GAACCGACCT TCCCTGTCGA GCGCGACGAG GACGCCACCG CTCGACTGCG TGAGGAACTC GTCGCAGCCA AGGAGCGCCG TGCTGGCAAA CTCAAGCGCG AACTGGCGGC CGACCTCGCC GATCAGCGCG ACGGCGCGCG GGAACTCGTT CGAGGCGCAC TCGAGTTGGA CGTCGAACTC GCGATTGCCC GGTTCTCCCG GGACTTCGAG TGTACGATGC CGGAGTTCGT CTGGGACGCT GACGGCGGCG GTAGCGGTGG CGGTGGGGGT AGCGGTAGTG GTGACGGTGG CGGTAGCGGT GGCGGTAGCG GTAGCGGTGA CGGTGGCGGT GATGACGCCA CGAACGCCAC CACCGGCTTC ACCATCGAGG CCGGTCGCTC ACCGCTGCTC GACGAACCGC TCGAGAAAAT CGATCCCGTC GACTACGAAG TGTCGGGTGT CGCGCTCCTC TCGGGGGTCA ACAGCGGCGG GAAGACCTCG CTGCTCGACC TCGTCGCGAG CGTCGTCGTC CTCGCGCACA TGGGGCTGCC AGTCCCCACC GAGCGCGCGG AACTGCGCCG GTTCGACGAC TTACACTACC ATGCGAAGAC CCAGGGCACC CTCGACGCTG GCGCGTTCGA GTCCACCGTC CGCGAGTTCG CCGACCTCGC CCAGGGCGGC GAAGGCTCGC TCGTGCTCGT CGACGAACTC GAGAGCATCA CGGAACCGGG CGCGTCGGCG AAGATCATCG CCGGGATTCT GGAAGCCCTG CACGAAAACG GCGCGACGGC AGTCTTCGTC TCCCACCTGG CCGACGAGAT CCGTGAGATG GCCGGCTTCG CGGTGACCGT CGACGGCATC GAGGCCGTCG GACTGGTCGA TGGGGAACTC GAGGTGAACC GCTCGCCGGT GAAGAACCAC CTCGCGCGTT CGACGCCGGA ACTGATCGTC GAGAAGTTGG CGACGGAGGC GACGGAAGCG AGCGCGAGCG ACACGGTGCG TGCGAACGGC GGCACCGAGA CCGTGTCGGA GCCGCAGTTC TACGACCGGC TACTCGAGAA GTTCGACTGA
|
Protein sequence | MVAQDAAGGS FFSLRFDTTV ENMDLESIPG VGEKTARALS DLDDPERALR AGDVATIASA PGITQGRAAR IARGAIRLEH DDAGGFLATD RAREIYREIL SLLKQRTVTD YAAQRVETFY PSSRRSRIEE VQTFAREAIS REASPAVLEA LEGVEPLRTP GDVRVRERCL ATTDAERYSA AREAIPELSV EVVEDAQGLA ELARGYSTVI ALDESFAGVT VEGDVQVRPD ALESPAEIVP ERPLAFFARN RDRITAAIDV HRTAGLADEV DCDLDALEDG LSRLEPDGTV AGDDELDRLT VAADDLDAAV GTAETVANDH LRDAIREEDV TIEGSDLLSL VERGAGVDSL LSRELADEFA AAVDAAREHL IDSLDLDAGE AELARRIFGD EPTFPVERDE DATARLREEL VAAKERRAGK LKRELAADLA DQRDGARELV RGALELDVEL AIARFSRDFE CTMPEFVWDA DGGGSGGGGG SGSGDGGGSG GGSGSGDGGG DDATNATTGF TIEAGRSPLL DEPLEKIDPV DYEVSGVALL SGVNSGGKTS LLDLVASVVV LAHMGLPVPT ERAELRRFDD LHYHAKTQGT LDAGAFESTV REFADLAQGG EGSLVLVDEL ESITEPGASA KIIAGILEAL HENGATAVFV SHLADEIREM AGFAVTVDGI EAVGLVDGEL EVNRSPVKNH LARSTPELIV EKLATEATEA SASDTVRANG GTETVSEPQF YDRLLEKFD
|
| |