Gene Nmag_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3041 
Symbol 
ID8825901 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3137695 
End bp3139929 
Gene Length2235 bp 
Protein Length744 aa 
Translation table11 
GC content63% 
IMG OID 
Productprotein of unknown function DUF87 
Protein accessionYP_003481155 
Protein GI289582689 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCAA GCACTCCCTT GCCGACAGCG ATGGCTGCCG GATGGCGAGT GCACGGTGGC 
GTGGTCGGGA TCATCGGAAC GATCGATACG TTGAGCGGCA GTGACATAAT TGTTTCTCAG
GTATGCTGTG GGTGGACTTA CCACGACCTG AGAGGCCAGC ACGCCGAGGA CCGGTCGTCC
AGACGGACTG TCACGCAGAG GCCAATATTC AAGGTCGCGT GTTCGGACTG TACTGATAAT
GACCGAGTCG CGCGCAGTCG GGGTGGGATG GCGGTGAGCG AGCAACGCGA GGTGCTCGTG
GGCGAAACGG CCGACGGCAC GGATCTACAC CTGCCGGTCG TCGAACTGCT GACCGGCCGG
GGATTCGTCA CCGGCAAGTC CGGATCCGGG AAATCAAACA CCGCTTCCGT CATCGCGGAG
GAGTTGCTCG AGGCCGGCTT CCCCCTCCTC ATTGTCGACA CCGACGGCGA GTACTACGGC
CTCAAAGAGG AGTACGAGAT GCTCCACGCC GGGGCCGACG AGGAGTGTGA CATCCAGATC
GGGCCCGAGC ACGCCGAACA GATGGCGACC CTCGCCTTAG AGGAGAACGT CCCCGTCATC
CTCGACGTTT CGGGCTATCT GGACGAGGAC GTTGCAAACG AGCTGTTGCG CAAGATCGCC
CGCCAGCTGT TCGTCAAGGA GAAGAAACTC AAGAAACCGT TCCTGCTCGT CGTCGAGGAA
GTTCACGAGT ACATCCCCGA AGGCGGCGGC GTCGGCGAGA CCGGCAACCT CCTGATCAAG
ATCAGCAAGC GCGGGCGCAA GCACGGCCTC GGCATCCTCG GCATCAGCCA GCGCCCCGCC
GACGTCAAGA AGGACTTCAT CACGCAGGCG AACTGGCTCG TCTGGCACCG CCTCACCTGG
GACAACGACA CCAAGGTCGT CGGCCGGATC ATCGACACGG AATACTCGGA GATGGTCTCC
GATCTCGACG ACGGCCAGGC GTTCGTCCAG ACCGACTGGA CCGATATCGA CGTCCGCAAG
GTCCAGTTCC GCCGCAAGCG CACCTTCGAC GCCGGCGCGA CGCCCGGTCT CGACGACTTC
GAGCGCCCCG AACTCAAGTC CGTCTCCGAC GCCCTCGTCG GCAACTTACA GGACATCTCA
GAACGCAAGG ACCGCGAACA GGACCGCATC CACGAACTCG AGAACGAACT CGAGAAGAAA
GAGAAACGAA TCGAGACCTT AGAGGACGAA CTCGAGTCGG CCCGCGACGT CTCGAGTGCG
GCGAAGCAGA TGGCCGACGC GCTGAGCGGC CGGAGTACGG TGCAGACGCA ACTTGGCGGC
GGAAGCGATG AGGAGTTGCG GCGATTGCAC GAAGAGGTCG TCGAACTGGA GGATGAGCGG
GACGAACTGG CGGACGAGCG AGATGCGTTG GAGACCGAGC GCGACGAACT ACGCGAGCAA
CTCGAGTCCC GCGAGAAACG CATCGAGTCG TTCGAGGGGA CACTCGAGTC GCGCGCGGAG
ACGGTAGCGA GTTTGCGGTC CCAGCGGGAT CGGTTGCGGT CGCGGGTGCG AGAGCTGGAA
CGGGAGCTTC GTAGCGGCGA AACTGCGTCT ACTGGTGTCG CTGCTGATTC AGACACAGCC
TCCGCCGCTG AAGCGGAGTC CTCGACTGAC ACAATCGTGC AAGCGGGCGG CACGCCGGTC
GAACTCGGCT ACGTGACCGT CGACGACGAG GACGAGGACG AGGCGGACGG CGATACCAGT
GACGACATCC ACCCCCGATT TATCGTCGCT AACGACCAGG ACGAGTACGA CCTGCGCGAT
GTCGTCACGC TCTTCGAGAC CACCTGGATC GGCGACCGGC TCCACCGTGC CAGCGAGGGC
TCTCGCTGTA CCGTCGAAAC CGCCGCCCAC GTGCTGGAGG TGCTCGCGCG CGAGGGTGCA
CTCGAGACGG AGCAGATCGC GAGCAGGGTC GATCGGTCGA CGGTGGCAGT CCAGAGCCTG
CTTTCGGAGC TCCGAACGGA GTCGATTCTC GATCGGACGG AGGTACGGTC GTACGAACTC
GCGGGTCCAG TGCGGGCGAA GTTACACGCA TTGGCGACCG ACGAGTACGA CGCGGAGGTT
CCTGATGAAC AGACAGGGAA CGAGGACGCC GACGCCGATG GAGACGGAGA CGGAGACGGG
GACGGAGACG GGGACGAGGG CGATCAGACA CAGAAAGACC ACAAACAGGA CCGGCGAGAA
ACTCACTCCC ACTGA
 
Protein sequence
MDASTPLPTA MAAGWRVHGG VVGIIGTIDT LSGSDIIVSQ VCCGWTYHDL RGQHAEDRSS 
RRTVTQRPIF KVACSDCTDN DRVARSRGGM AVSEQREVLV GETADGTDLH LPVVELLTGR
GFVTGKSGSG KSNTASVIAE ELLEAGFPLL IVDTDGEYYG LKEEYEMLHA GADEECDIQI
GPEHAEQMAT LALEENVPVI LDVSGYLDED VANELLRKIA RQLFVKEKKL KKPFLLVVEE
VHEYIPEGGG VGETGNLLIK ISKRGRKHGL GILGISQRPA DVKKDFITQA NWLVWHRLTW
DNDTKVVGRI IDTEYSEMVS DLDDGQAFVQ TDWTDIDVRK VQFRRKRTFD AGATPGLDDF
ERPELKSVSD ALVGNLQDIS ERKDREQDRI HELENELEKK EKRIETLEDE LESARDVSSA
AKQMADALSG RSTVQTQLGG GSDEELRRLH EEVVELEDER DELADERDAL ETERDELREQ
LESREKRIES FEGTLESRAE TVASLRSQRD RLRSRVRELE RELRSGETAS TGVAADSDTA
SAAEAESSTD TIVQAGGTPV ELGYVTVDDE DEDEADGDTS DDIHPRFIVA NDQDEYDLRD
VVTLFETTWI GDRLHRASEG SRCTVETAAH VLEVLAREGA LETEQIASRV DRSTVAVQSL
LSELRTESIL DRTEVRSYEL AGPVRAKLHA LATDEYDAEV PDEQTGNEDA DADGDGDGDG
DGDGDEGDQT QKDHKQDRRE THSH