Gene Nmag_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1079 
Symbol 
ID8823910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1102459 
End bp1103643 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content63% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003479225 
Protein GI289580759 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.223586 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACG GAACCGGACC AACCGGGGAC GAGGGCGACT CGTTATCCGC GCTCTTTGCC 
TCTCTCGCCC GCGAGCCTCG TCGTCACCTC CTCGGCGTGC TGTACGAGCA CGCCTCCGAT
TCACTCTCAC TGTCCGCGTG CGCAACGCGG GTCGTCTCCA GAACGACGGA CACACCGCGT
GAAAACGTCT CGGAAACGGC CATACAGCAA CTGCGCGTCT CGCTCCATCA CGTTCACCTG
CCAAAACTCG CCGACGCCGG ATTGATCGAC CGCGACACTG CCACTCAGAC TGTGACGCTC
GCAGATCACT CTGCGTATCG GGATTCAGCG ATCGTCAACA CGATTCGGTC AGCGGACAGG
GCACGTGCCG ACTCGCTCGA TGCGGTGTTC GATGCGCTCG CAGACTCCCG CCGGCGTAGT
ATCCTCGCCT GTCTCAATCA CTCCTTTCAG GAAATTCACC TCGAAACGCT TGCCCGAGAT
GTCGCGACGA GAGAGCAGGC CACCACCGAC ACGACAGCAC CCGAGTCCGG ACTGGTCACC
GACCAGCTCC TCGCCAGCCT CGAACACACG CACCTCCCGA CGCTTGCGGC AGCAGACCTG
ATCGACTTCG ATACCGATGC GCGGACGGTT TCCTACAGCG GCCATCCGGC TCTGTGTGTC
TCCTGGTTGC ACTCTGTTCT CAACCCCGAC CTCCGGATGC ACCTGACAGA GCCCTCGCCG
GACGACGGCG TCCGATCGAT CGATGGACGC GAGGCGATCG TCGCCTACAG CCAGTCGCTC
CTCGAGCGAG CCGACGAGGA ACTATTCTCG GTCTTCACGT CGCCGCGGCT ACTCGAGTCC
GGTTGCTTTG CGCGCGCTAT GGACGCGGCC CGACGTGGAG TCGACGTCTA CCTCGGCACG
ACCGATCCGG TCGTCCGTGA ACTCGTCCGA GCGAACGCTC CAACAATCTC TCTCTGGGAA
CCGACCGACG AGTGGCTGTC CCTCTCCGTC CAGGGAGAGA CTGTCGGCCG TCTCGTACTC
GCCGACCGCG AGTCGCTCCT GTTCGGGTCG CTCGGCGAAC GGCTCGAGAA CCACCGCTAT
GCGGAAACAG CACTAATTGG CGACGGAGAG GCCGCCCACC AGCTTCTGGG AGCCCATTTC
GACCGGATCG ATCAGAAAGT GCAGGAACTC GAGTCGGCTT CGTGA
 
Protein sequence
MDYGTGPTGD EGDSLSALFA SLAREPRRHL LGVLYEHASD SLSLSACATR VVSRTTDTPR 
ENVSETAIQQ LRVSLHHVHL PKLADAGLID RDTATQTVTL ADHSAYRDSA IVNTIRSADR
ARADSLDAVF DALADSRRRS ILACLNHSFQ EIHLETLARD VATREQATTD TTAPESGLVT
DQLLASLEHT HLPTLAAADL IDFDTDARTV SYSGHPALCV SWLHSVLNPD LRMHLTEPSP
DDGVRSIDGR EAIVAYSQSL LERADEELFS VFTSPRLLES GCFARAMDAA RRGVDVYLGT
TDPVVRELVR ANAPTISLWE PTDEWLSLSV QGETVGRLVL ADRESLLFGS LGERLENHRY
AETALIGDGE AAHQLLGAHF DRIDQKVQEL ESAS