Gene Nmag_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2031 
Symbol 
ID8824874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2067258 
End bp2069180 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content66% 
IMG OID 
Producthelicase c2 
Protein accessionYP_003480163 
Protein GI289581697 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTCAGA CGGTGAACCC CGAGCGGATC TTCGACGCCT TTCCCGCGCC CAGCTACCGC 
GGGAACCAGG AGCAGGCCCT CCGCGACATT CGTGACGCCT TCGCGGCCGG CAACGATGTC
GTCCTCGTGC GCGCACCCAC CGGGAGCGGC AAGTCCCTTC TCGCACGGTC CGTCGCCGGC
TGTGCCCGAA CGATCGACGA GGCGGAGCCG AGCGAGGCCG CCGGGGCCTA CTACACGACC
CCGCAGGTCT CACAGCTCGA CGACGTCGCC TCGGACGACC TGCTCGCCGA TTTGAACGTC
ATCCGCGGCA AGTCGAACTA CACCTGTATC CTCCCCCAGG AGCGCAATAC GCCGGTCAAC
CAGGCACCCT GCGTCCGCGA ACGGGGCTAT GACTGCTCGG TCCAGCATCG CTGTCCGTAC
TTTTCGGACC GTGCAATCGC CTCGAATCGC TCGATTGCGG CGATGACCCT CGCGTACTTC
ATGCAGACTG CGGGTAGCGA GGTCTTTCGC AAACGCGACG TCGTCGTCAT CGACGAGGCA
CACGGTCTCG CCGAGTGGGC GGAGATGTAC GCGACGATCC AGCTTGGGCC GCGAACCGTC
CCGTTCTGGG ACGACCTCCG TGTGCCGCAA ATCGACAGTA TCGAACGGGC CGTCCGCTAC
GCCGAGAACC TCGAGCAGAC CTGTACCCGT CGCAAGGACG ACCTGCTCGC ACAGGAGACG
CTCTCGCCTC GCGAGGTCCG CGAACGCGAC CGGCTGCAGG AGCTGATCGG CGAACTCGAC
TGGTTCGTCT CGGACTTTCG GGACCCACAG AGTCCGACGA CGTGGTTGGT CGACCAGTCC
GAGCGGAACG CAGCCAGTAC GGACGACGAG ACCGACGACG AGGAACTCGG CGGTCCCCTG
ACCATCAAGC CGATGAACCC CGAGAAGTAC CTCGCCCACA CCGTCTGGGA CCGAGGCAAC
AAGTTCGCGC TCCTCTCGGC GACCATCCTC AACAAGGCGG CCTTCTGCCG GCAGGTCGGG
CTCAATCCTG ACGACGTCGC GCTCGTCGAC GTCAGCCACA CCTTCCCCGT CGAAAACCGG
CCGCTGTACG ACGTCACCCA GGGGAAAATG ACCTACGAGC ACCGTGACGA GACGACGCCG
GACATCGCCC GTACCATCGT CCGGCTCATG CAGCGCCACC CCGACGAGAA GGGGCTGATT
CACGCCCACT CCTACAACAT TCAGGAGCGA CTCGCCGACC TCCTGCGCGA TTTCGGCGTC
GGCGAGCGTA TTCGCGTCCA CGACCGCGAC GGCCGCGACG CCGACTTAGA GGAGTGGAAA
GCCAGCGACG ACCCCGACGT GTTTATCTCC GTGAAGATGG AGGAAGCGCT CGACCTCAAG
GGCGACCTCT GTCGCTGGCA GGTGCTCTGT AAGGCCCCCT ACCTCAACAC CGGCGACTCG
CGCGTCGCCC ACCGACTCGA GGAAGGCCAG TGGGCGTGGT ACTACCGGAC CGCGCTGCGA
ACCATCATCC AGGCCTGCGG CCGCGTCGTC CGCGCCCCCG ACGACCACGG CGCGACGTAC
CTCGCGGACT CGAGTCTCCT CGATCTTTTC GAGCGCGCGC GAACGGACAT GCCCGACTGG
TTCGCAGCGC AGGTCGACCG CATGTCGACG CCCGAGTTGC CGGCGTTCGA TCCACAGGCG
GCGTGTGACT CGTCCGGACC GGGTGGCCGG CGCGGCTCTG GTCGTGGTGG TGGCTCGGGC
AGGGACTCGA GTACAAGTGG GTCACAATCC GAGTCACCGG GTCAGTCTGC AACTGGGTCG
GATTCGGGGA GTGCGTACAC GCGCTCTCGG TCTCGGTCTG GTTCTCGCTC GCGCTCACAG
TCGGGGTCGT CGAAAGACTC ATCATCGAGT CCGCTCGCAG ATGTCTGGGA TACGGACGGC
TAA
 
Protein sequence
MTQTVNPERI FDAFPAPSYR GNQEQALRDI RDAFAAGNDV VLVRAPTGSG KSLLARSVAG 
CARTIDEAEP SEAAGAYYTT PQVSQLDDVA SDDLLADLNV IRGKSNYTCI LPQERNTPVN
QAPCVRERGY DCSVQHRCPY FSDRAIASNR SIAAMTLAYF MQTAGSEVFR KRDVVVIDEA
HGLAEWAEMY ATIQLGPRTV PFWDDLRVPQ IDSIERAVRY AENLEQTCTR RKDDLLAQET
LSPREVRERD RLQELIGELD WFVSDFRDPQ SPTTWLVDQS ERNAASTDDE TDDEELGGPL
TIKPMNPEKY LAHTVWDRGN KFALLSATIL NKAAFCRQVG LNPDDVALVD VSHTFPVENR
PLYDVTQGKM TYEHRDETTP DIARTIVRLM QRHPDEKGLI HAHSYNIQER LADLLRDFGV
GERIRVHDRD GRDADLEEWK ASDDPDVFIS VKMEEALDLK GDLCRWQVLC KAPYLNTGDS
RVAHRLEEGQ WAWYYRTALR TIIQACGRVV RAPDDHGATY LADSSLLDLF ERARTDMPDW
FAAQVDRMST PELPAFDPQA ACDSSGPGGR RGSGRGGGSG RDSSTSGSQS ESPGQSATGS
DSGSAYTRSR SRSGSRSRSQ SGSSKDSSSS PLADVWDTDG