Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1162 |
Symbol | |
ID | 8823994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1186376 |
End bp | 1188301 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | |
Product | DNA mismatch repair protein MutS domain protein |
Protein accession | YP_003479308 |
Protein GI | 289580842 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.906748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTCG AGGAGTACTG GGGCGTCGGC CCGAAGACGA GGGAGACACT TGTGTCGGAG CTGGGACGGG AACGCGCGAT CCAGGCGATC GAGAGCGGCG ACGTTCGCGA ACTCGCGACT GCCGGCCTCG CTCGCGGGCG CGCAACACGT ATCTTGCGAC GGGCGACCGG CGGCGACGGA ATCGACATGC TGGCGACGAG CGACGCCCGC GCGGCGTACA AGGACCTGCT CGATCTGGCG GTCGAACACG CCGTCACGCA GCGCGCGGCC GACCGAATCC GCGTCCTCAC GCCGCTCACC AGCCGCGAGG AGATGGAATC TCGCCTCGAC GACGTGCTCG CGGCCCGCGA CGCCTGGGCG ACACTCGAGA AGGCAGACCG CGAGGCCGTC CTCGCAGCCT ACGAGCGCTA CGACGAGCGC GACGAGAGCG AACGCGCTGC CGTCGAAACC GCGCTCGCCC TGCTTGAGGC CGGGGTCGAC TCCGGTCCGT TCGAAACTGT CGCCGAACTC GAGCGCGACA CGCTCACAAC CGCCGCCGAT GCACTCTCCG CGTTCGCAGA CGACGGCGGG CAGGGTCGAC TCGTCCGCGG TGCCGACGAC GAACTCGACC GCCTGCGCGA CGCACTCGGC ACCGTCGAAG ACATGGACGC CAACGCACTC GAGTTGATCG AGGAACTGCG AGACGACGGC GTCCGCGACG TGAGCCAGTT CCGCGAGGCG TTCGAGGACC ACCTGCTCTC GGAGACGGCG GTGACGGTCG ACCAGGTTCG CGAGGCGATG CCGACGGACG CGACCGACGC GACGGATTTC GTGGGGAGTA CGCTTCGGAC CTTGCGCGGC GATCTCACGG CGGCGATCGA CGAACGCGAA GAGCAGGTCG CTGGGGAGTT GCAGGCAGAA CTCGAGGACG CTCGCGACGC CATCGACCAG GCGGTCGCGG CGGTCGACGA CATCGCGTTG CACCTCTCGC TCGCGCGCTT CGCGCTCGCG TACGACTGTA CTCGTCCGAC GTTCGTCGAG GGCGAGTCGG CCGCAGTGTC GGTCGTCAAC GCGCGAAATC TGACGCTCGC CTCGCCGGCG ACCGACTCGA ATGTCGACCA GCGCGATGGT GGCCGGGGCG AGGGTGGCGA TCAGGTCCAG CCGATCACCT ACGCGCTGGG TGAGCATGGG CTCACTGAGG CGGATGCGAT TTCCGGTCGA AGCGGGATTG GTACTGGCGT TGGCACTGGT ACCAGCATCG CCACCGAGAC CGGCGTCGGC ACCGATACCG ACGGCGACGA CGGCTCCAGC GCAAACGCAC TTCCCGGACG GGAACGCGTC TCCGTCCTCA CCGGCGCGAA CAGCGGCGGG AAAACCACGC TGCTCGAAAC GTGTTGCCAG GTCGTCCTGC TCGCTTCGAT GGGACTGCCC GTCCCCGCCG AGCGCGCCGA GGTGACGCCC GTCGACTCGC TCGTGTTCCA CCGCCGCCAC GCCAGTTTCA ACGCGGGAGT ACTCGAGTCC ACCCTGCGCT CGGTCGTCCC ACCGCTGTCC TCGGATGGTC GGACGCTAAT GCTGGTCGAC GAGTTCGAGG CGATAACGGA GCCGGGAAGT GCGGCTGACC TCCTGCACGG CCTTGTGACG CTGACGGTCG AGCGCGACGC GCTCGGCGTC TTCGTCACGC ACCTCGCAGA CGACCTGGAG CCGCTGCCGC CCGAGGCTCG CGTGGATGGT ATTTTCGCCG AGGGACTGAG CCCGGAACTC GAGTTACTCG TGGATTACCA GCCGCGGTTC GATACGGTGG GCCGGTCGAC GCCGGAGTTC ATCGTCTCGC GGTTGGTAGC GAACGCGGAT GACCGGGCCG AGCGTGCGGG GTTCGAGACG CTTGGCGAGG CGGTCGGCAA CGACGTGGTT CAGCGGACGC TGGCGGACGC TCGCTGGAGT GAGTGA
|
Protein sequence | MRLEEYWGVG PKTRETLVSE LGRERAIQAI ESGDVRELAT AGLARGRATR ILRRATGGDG IDMLATSDAR AAYKDLLDLA VEHAVTQRAA DRIRVLTPLT SREEMESRLD DVLAARDAWA TLEKADREAV LAAYERYDER DESERAAVET ALALLEAGVD SGPFETVAEL ERDTLTTAAD ALSAFADDGG QGRLVRGADD ELDRLRDALG TVEDMDANAL ELIEELRDDG VRDVSQFREA FEDHLLSETA VTVDQVREAM PTDATDATDF VGSTLRTLRG DLTAAIDERE EQVAGELQAE LEDARDAIDQ AVAAVDDIAL HLSLARFALA YDCTRPTFVE GESAAVSVVN ARNLTLASPA TDSNVDQRDG GRGEGGDQVQ PITYALGEHG LTEADAISGR SGIGTGVGTG TSIATETGVG TDTDGDDGSS ANALPGRERV SVLTGANSGG KTTLLETCCQ VVLLASMGLP VPAERAEVTP VDSLVFHRRH ASFNAGVLES TLRSVVPPLS SDGRTLMLVD EFEAITEPGS AADLLHGLVT LTVERDALGV FVTHLADDLE PLPPEARVDG IFAEGLSPEL ELLVDYQPRF DTVGRSTPEF IVSRLVANAD DRAERAGFET LGEAVGNDVV QRTLADARWS E
|
| |