Gene Nmag_3873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3873 
Symbol 
ID8826743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp270262 
End bp271653 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content60% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003481976 
Protein GI289583566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0964469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAATG ATACGAAAGA TCATGGAAAT GGCGCAACGC AGCGTGCAAC GTCTCGGGAC 
ACGTACCGTC GACGAACTGT CCTGAAGGGT GCAACGACGG GGGCGACGAT CGGTGCACTG
TCGGTGGCTG GTTGTCTGGG CGGCGGGAAC GGCGGCCCAG TGTTGCGAGT CATCAACTCG
GCGTACCAGC AACAAGAGGA CGAGTACCGT GCAATCTTCG ACGAGTTCGA GGAGGAACAC
GATTGTGAGG TGGAGTACAC CCGATCCGAT TTCGCATCCG CGCCATCGGA AGCCGCACAG
GCACAGGCCG GCGGCAATCC GTACGACCTG CTGATGCTCG CCTCGCCGGG GAACAACGTG
TTCGGCGTGC AGGAAGGGCT GTACCAGCCG ATCAACGACG TCATCGAGGA CATGGGTGCA
GAGGACCACT GGCGAGAGGA GTTCCTCGTC CAGATGGACG GCGATTACTA CTTCGCGCCG
AACACAGGGA CAGTTTCGAC GCTCATCTAC CGCGAGGACC TGTTCAACGA GTACGATGCG
CCGATGCCGC CATTTGACTC GTGGGACGAA TACCACACCG CCGCCGAGAC GATGACCGAC
GAGGACGAAA ACCTGTACGG GAGCCCGGTC TTCCTCGGGA GCAACCACTT CCACGGAATC
CTGCCCCTGT CTCTGCTTCA CGGCCGAGGC GGTTCGGTCA TCAACACAGA CGACGAGGTC
GTCTACGACT CCGAGGAGAC GGTCGAGATG CTCGAGTTCA TGCGGGACCT GAACGAATTC
AGTCCACAGG CGGCCCACGG TGCTGACATT CCGGAAATGC GTCCGCCGCT GTACCAGGGG
ACGTACGCGA TGACGTGGTA CTCCACCAAC GTCATTCCGT ACGACATCGA AGAATACAAC
CCGGACCTGA CTGGAGACGT GCAGGTTGCG CCGATTCCAG CGTACGACTC GAGTTACGAG
CCGGTTGCGC GGCTGACCGG CCTGGGCCAC GGACTCGGTG CCGAGACCGA ACATCCGGAG
CTAGCGAAGG ACCTGTTGCG AGAGATCACC TCGTTCGAAG GCGTCATGCG ATTGATGACC
GCCCAGCCGG CGAGTCACGT TCCGGCGATT CACGGCATAC TCGAGGAGGA CGACCTGTGG
GAGACGGACG TTATGCAGGA CTACGAAGAG CACTACCGGG ACCTCGTCGA CATCGCAGAC
GAGTACGGTC GGGTCGTCGC CGTCGAGGAA AACGAGGGTC ACATCAACCC GGTGACGGGG
CAGGCCGTCG CCGAAACACA CGTTATCAGT TCCGTTCAGG ACGTGATCTT GGAGGACGAA
GATCCCCAGG ATGCAGCCGA ACATTGGGCG GACGAAATTC GCGATGACTA CGAAGATCAA
CTGAACGTCT AG
 
Protein sequence
MGNDTKDHGN GATQRATSRD TYRRRTVLKG ATTGATIGAL SVAGCLGGGN GGPVLRVINS 
AYQQQEDEYR AIFDEFEEEH DCEVEYTRSD FASAPSEAAQ AQAGGNPYDL LMLASPGNNV
FGVQEGLYQP INDVIEDMGA EDHWREEFLV QMDGDYYFAP NTGTVSTLIY REDLFNEYDA
PMPPFDSWDE YHTAAETMTD EDENLYGSPV FLGSNHFHGI LPLSLLHGRG GSVINTDDEV
VYDSEETVEM LEFMRDLNEF SPQAAHGADI PEMRPPLYQG TYAMTWYSTN VIPYDIEEYN
PDLTGDVQVA PIPAYDSSYE PVARLTGLGH GLGAETEHPE LAKDLLREIT SFEGVMRLMT
AQPASHVPAI HGILEEDDLW ETDVMQDYEE HYRDLVDIAD EYGRVVAVEE NEGHINPVTG
QAVAETHVIS SVQDVILEDE DPQDAAEHWA DEIRDDYEDQ LNV