Gene Nmag_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1162 
Symbol 
ID8823994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1186376 
End bp1188301 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content68% 
IMG OID 
ProductDNA mismatch repair protein MutS domain protein 
Protein accessionYP_003479308 
Protein GI289580842 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.906748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTCG AGGAGTACTG GGGCGTCGGC CCGAAGACGA GGGAGACACT TGTGTCGGAG 
CTGGGACGGG AACGCGCGAT CCAGGCGATC GAGAGCGGCG ACGTTCGCGA ACTCGCGACT
GCCGGCCTCG CTCGCGGGCG CGCAACACGT ATCTTGCGAC GGGCGACCGG CGGCGACGGA
ATCGACATGC TGGCGACGAG CGACGCCCGC GCGGCGTACA AGGACCTGCT CGATCTGGCG
GTCGAACACG CCGTCACGCA GCGCGCGGCC GACCGAATCC GCGTCCTCAC GCCGCTCACC
AGCCGCGAGG AGATGGAATC TCGCCTCGAC GACGTGCTCG CGGCCCGCGA CGCCTGGGCG
ACACTCGAGA AGGCAGACCG CGAGGCCGTC CTCGCAGCCT ACGAGCGCTA CGACGAGCGC
GACGAGAGCG AACGCGCTGC CGTCGAAACC GCGCTCGCCC TGCTTGAGGC CGGGGTCGAC
TCCGGTCCGT TCGAAACTGT CGCCGAACTC GAGCGCGACA CGCTCACAAC CGCCGCCGAT
GCACTCTCCG CGTTCGCAGA CGACGGCGGG CAGGGTCGAC TCGTCCGCGG TGCCGACGAC
GAACTCGACC GCCTGCGCGA CGCACTCGGC ACCGTCGAAG ACATGGACGC CAACGCACTC
GAGTTGATCG AGGAACTGCG AGACGACGGC GTCCGCGACG TGAGCCAGTT CCGCGAGGCG
TTCGAGGACC ACCTGCTCTC GGAGACGGCG GTGACGGTCG ACCAGGTTCG CGAGGCGATG
CCGACGGACG CGACCGACGC GACGGATTTC GTGGGGAGTA CGCTTCGGAC CTTGCGCGGC
GATCTCACGG CGGCGATCGA CGAACGCGAA GAGCAGGTCG CTGGGGAGTT GCAGGCAGAA
CTCGAGGACG CTCGCGACGC CATCGACCAG GCGGTCGCGG CGGTCGACGA CATCGCGTTG
CACCTCTCGC TCGCGCGCTT CGCGCTCGCG TACGACTGTA CTCGTCCGAC GTTCGTCGAG
GGCGAGTCGG CCGCAGTGTC GGTCGTCAAC GCGCGAAATC TGACGCTCGC CTCGCCGGCG
ACCGACTCGA ATGTCGACCA GCGCGATGGT GGCCGGGGCG AGGGTGGCGA TCAGGTCCAG
CCGATCACCT ACGCGCTGGG TGAGCATGGG CTCACTGAGG CGGATGCGAT TTCCGGTCGA
AGCGGGATTG GTACTGGCGT TGGCACTGGT ACCAGCATCG CCACCGAGAC CGGCGTCGGC
ACCGATACCG ACGGCGACGA CGGCTCCAGC GCAAACGCAC TTCCCGGACG GGAACGCGTC
TCCGTCCTCA CCGGCGCGAA CAGCGGCGGG AAAACCACGC TGCTCGAAAC GTGTTGCCAG
GTCGTCCTGC TCGCTTCGAT GGGACTGCCC GTCCCCGCCG AGCGCGCCGA GGTGACGCCC
GTCGACTCGC TCGTGTTCCA CCGCCGCCAC GCCAGTTTCA ACGCGGGAGT ACTCGAGTCC
ACCCTGCGCT CGGTCGTCCC ACCGCTGTCC TCGGATGGTC GGACGCTAAT GCTGGTCGAC
GAGTTCGAGG CGATAACGGA GCCGGGAAGT GCGGCTGACC TCCTGCACGG CCTTGTGACG
CTGACGGTCG AGCGCGACGC GCTCGGCGTC TTCGTCACGC ACCTCGCAGA CGACCTGGAG
CCGCTGCCGC CCGAGGCTCG CGTGGATGGT ATTTTCGCCG AGGGACTGAG CCCGGAACTC
GAGTTACTCG TGGATTACCA GCCGCGGTTC GATACGGTGG GCCGGTCGAC GCCGGAGTTC
ATCGTCTCGC GGTTGGTAGC GAACGCGGAT GACCGGGCCG AGCGTGCGGG GTTCGAGACG
CTTGGCGAGG CGGTCGGCAA CGACGTGGTT CAGCGGACGC TGGCGGACGC TCGCTGGAGT
GAGTGA
 
Protein sequence
MRLEEYWGVG PKTRETLVSE LGRERAIQAI ESGDVRELAT AGLARGRATR ILRRATGGDG 
IDMLATSDAR AAYKDLLDLA VEHAVTQRAA DRIRVLTPLT SREEMESRLD DVLAARDAWA
TLEKADREAV LAAYERYDER DESERAAVET ALALLEAGVD SGPFETVAEL ERDTLTTAAD
ALSAFADDGG QGRLVRGADD ELDRLRDALG TVEDMDANAL ELIEELRDDG VRDVSQFREA
FEDHLLSETA VTVDQVREAM PTDATDATDF VGSTLRTLRG DLTAAIDERE EQVAGELQAE
LEDARDAIDQ AVAAVDDIAL HLSLARFALA YDCTRPTFVE GESAAVSVVN ARNLTLASPA
TDSNVDQRDG GRGEGGDQVQ PITYALGEHG LTEADAISGR SGIGTGVGTG TSIATETGVG
TDTDGDDGSS ANALPGRERV SVLTGANSGG KTTLLETCCQ VVLLASMGLP VPAERAEVTP
VDSLVFHRRH ASFNAGVLES TLRSVVPPLS SDGRTLMLVD EFEAITEPGS AADLLHGLVT
LTVERDALGV FVTHLADDLE PLPPEARVDG IFAEGLSPEL ELLVDYQPRF DTVGRSTPEF
IVSRLVANAD DRAERAGFET LGEAVGNDVV QRTLADARWS E