Gene Nmag_3996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3996 
Symbol 
ID8828730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp36475 
End bp38166 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content55% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003482091 
Protein GI289937489 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAATG GAAACACGCA ACTTGATCGA AGGTCGTTTT TAAAAAGAAC CTCCGCTGCT 
GCCGGTGCTG CGTCGATAGC AGCAACTGCC GGCTGTCTCG GCGGCGAGAA TGGTGGCGAT
AGCGATATCG ACTACGACGA CGAAGAAGAC GCAGGTGAAC CCTTCCCAGA ATATACGTTC
TACAACAACC CACAGGATTA CAATCCGCAA CGTCACGACC TGATCAACCT TATTGCTGAA
CAGTGGCAGG AAGTTGGCTT CGATGTCGAA GTCGAGGTAC TCGAGTGGGC GACACTCCTC
TCTCGAGTCT CCGACGAGTA CGAGTTCGAT TTCGCTGCCT GGTCGCAATA CCAGTCGCCC
GATCCTGCGG AGAACGTAGC GGATCGGTGG TCACCAGAGC ATGCAGAGGA GCCTGGACGA
GGTAATTACA ACCAGTACCA GAACGATCGG GTTGGTGAAC TCATCGACGA ACAGTTGGCG
GCTGAAGAGA TGGAGGGACG GGTAGATGCC TTCCACGAGA TCCAGGATAT CCTCGCAGAA
GATGCTCCGT TCAGTCCGGT CTGCTACGAA ACACAGCTTA TTCCCTACCG AACCGACGAA
CTTGATGGGT GGGTCGAGCA TCCAGCTGGT CCGGACCGAA TTCATCAGTA TGCCAACGTC
GAACCGATGG ATCAGAACGA AGACGGATAT CTCCGTGGAT TCTGGACTGA GGCACTCGAG
AACCTGAACT CGTGGTCTCA CGAAGGCCTG AGCAAACATC TCCACATACA GGACGCAATA
CAACTTCGGG TAGCCCAAGT TGACGCGGAC CTCGAACTCG ATCCTGAACA CGGACTCGCG
CAGGATATAG AGCGACCGGA CGAGACGACG ATTCGATTCG AGATTCGTGA CGATGTCGAG
TGGAGCGACG GCGAACCGCT TACTCCCGAC GACGTCGCAT TCACCTACAA TACGATCGCT
GAACAGGAGC CATCGACATA CACCACTGTT TCCAACTACG TCGAAGGGGC GTCGGTTGAT
GGCGACTGGG TCGAAGTCGA CCTCTCACAG GAGATTGGCA GAGCCGCACT GTTGCTCATT
GGGTATGAAG TGTACGTTGC TGCCGAGCAC GTCTGGGAAG GCACTGATCC CGTTCAAGAC
GAACTCGTTG AAGAACCAGT CTGCAGTGGT CCGATGGAGG TGGACTACTG GGATGTCGGT
CAGGGGATCG AGTTAGAGAC GAGAGACGAT CACCCGATCG ATCTCGCAGT TGATGGGTTG
TTCTGGGAGA TCATTCCTGA GAGTTCGACG ATCTGGTCGT ACACCGAAGA TGGGACGATC
AACTACCATC CGTTTGCACA GCCCGGACTC GAGCTACAGG ACGGAGAGGA GGAAATAGAC
GACTTCGAAG TATTCGAGGC CCCTGGGGAT GGATGGACAC ACCTCAATAT CAATACGACG
AGCGAAGGGT TGGACGAAGT CGCTGTTCGA CAAGCACTCG TTCACGCTCT CCCCAAAGAG
GCGGCTAGCG AACAGTTGTT CTACGGCTAC ATGCCTGTCG CACACTCCTA TGTCGCGCCC
GCATTCGGAC CGCTTCACCG CGAACACGAA GACCTCCCGT TCACTGGCGA AGGGACCATT
GCAGCGGCGG CAAGCCATCT GCAAGAGAAC GGGTACGTGG TAACCGAGGA CGGCGTCTAC
TATTCAGAGT AG
 
Protein sequence
MGNGNTQLDR RSFLKRTSAA AGAASIAATA GCLGGENGGD SDIDYDDEED AGEPFPEYTF 
YNNPQDYNPQ RHDLINLIAE QWQEVGFDVE VEVLEWATLL SRVSDEYEFD FAAWSQYQSP
DPAENVADRW SPEHAEEPGR GNYNQYQNDR VGELIDEQLA AEEMEGRVDA FHEIQDILAE
DAPFSPVCYE TQLIPYRTDE LDGWVEHPAG PDRIHQYANV EPMDQNEDGY LRGFWTEALE
NLNSWSHEGL SKHLHIQDAI QLRVAQVDAD LELDPEHGLA QDIERPDETT IRFEIRDDVE
WSDGEPLTPD DVAFTYNTIA EQEPSTYTTV SNYVEGASVD GDWVEVDLSQ EIGRAALLLI
GYEVYVAAEH VWEGTDPVQD ELVEEPVCSG PMEVDYWDVG QGIELETRDD HPIDLAVDGL
FWEIIPESST IWSYTEDGTI NYHPFAQPGL ELQDGEEEID DFEVFEAPGD GWTHLNINTT
SEGLDEVAVR QALVHALPKE AASEQLFYGY MPVAHSYVAP AFGPLHREHE DLPFTGEGTI
AAAASHLQEN GYVVTEDGVY YSE