Gene Nmag_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4050 
Symbol 
ID8828784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp91861 
End bp93507 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content57% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003482140 
Protein GI289937538 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACCCG ATGGCAATAT GCAGTTTAAC CGAAGAGAGC TAATTAGCGC GCTCAGTGCA 
GGGGGTGTAC TCGCGCTCGC AGGCTGTGCG GACCAGGCCG ATGGCGATGG GAACGACGAA
ATTTTCGTCG ACGCACTCGA CTCCGATCCG GGAACGCTCG ACCTACACGA AACAAACCGC
GTGCCGGAGA GTATGTGTCT GACGCCGGTC CACGAGCGAT TGTTTACGAT CGATCCGGAT
CTCGAACCGC AGCCGTGGCT GGCAACGGAG TACGAAACGA ACGACGACGA GACCGAGTAC
GTAATCCAAC TCGAGGAGGG CGTCGAGTTT CACGACGGAA CTGAGTTCAA TGCAGACGTC
GCCAAGTGGA ACCTCGAGCG GGCCGAAGAG AACTCGCCGG ACGCCTGGCA GTTCGGGACG
CTCGAAGAGA TCGATGCAAC CGGAGACTAC GAGTTGACGT TCCATTTCGA GGAGCCACAT
CCGCTGTTCC CACAATACCT GGCAAACGTC CAGATGGGAT TCGCCTCACG GGAGGCAGTT
GAAGCAGCAG GAGACGACTA CGGACAGGAA GAAGTTGTCG GGACAGGACC GCTCGTGTTC
GAAGAGTGGG TGCGTGACGA TGAAATCGTC TATTCACGCA ACGAAGACTA CGACCGAGGG
CCGGATTTCC TCAGCAACGA TGGCCCGATC AACTTCGAGG AGTACCACGT CAGAATCGTC
CCGGAACCAA CGACGCTGCT CAACGAGGTT ACTGTCGGCA ACGTCGACGG AAGCATGATG
ATCGCAGCGA GCGATGCCGA AGATGTAGAG GCGCACGATA ACACGCAACT CGAGCGTGTC
GACGACGCGC ACCCGACCTT CCTGTCAATC AACGTCGAAG CGGAGCCAAC CGACGAGGTA
GAAGTGAGAC AGGCGATGGC ATACGCTGTC GATCAAGAGG CAATAGTCAA CGCTGCATTC
CACGGCGAAG GCTATCCAAT CTACAGTCTG TGTCCCCCAA TGGCTGTCGG TGGACTGGAC
GAAGCAACCG CACGAGAGAC AGGGTACGAG CAAGACCTCG ACACTGCCCG TGAACTTCTC
GACGACGCAG GATGGGAGAA CGACGAAGAA GGAGAAGTCC GGACAAGAAA CGGCGACGAT
CTCTCGGTTT CGTTCTTCGC CTTCGAGATG GAGCCCTACT CGAGTATCGG AGAAGTCACA
CAGGATATGC TCAGCCAGGT CGGCTTCGAA GCCAATCTAG AGATTCTCGA GTCGGGGACA
CTGTACGACA GAGTGGAGGG CGGCGAGCAC AATCTGGTGA CGATGGCACT GAGTGGAGGA
TACATTGCCA ACAACACGCT TGCATCGACC CTTCACAGCC AAAACTATGC GCCCGACGGT
GGGAGCAATT ACTCGCTGTA CCAGAGCGAC GAGTATGACG AGATCATCGA TCAGGCGGAG
GTCGAACCCG ACGATGCCGA GCGAGAGGCG TTGCTTCACG AGGCACAGGA GCACATCCTC
GAGGAGGTCC CCGTCGTGCC ACTTGTAGGC TTCGTCAAGT TCTACGCGGC CAAAAACGAG
ATCAGCGTGG ATGCCTGGAC CGATCACCCA TGGTGGCCAT CTCCTGATCA GTACAACCTC
CATGCGGTGG ATGTTGATCG GAGCTAA
 
Protein sequence
MEPDGNMQFN RRELISALSA GGVLALAGCA DQADGDGNDE IFVDALDSDP GTLDLHETNR 
VPESMCLTPV HERLFTIDPD LEPQPWLATE YETNDDETEY VIQLEEGVEF HDGTEFNADV
AKWNLERAEE NSPDAWQFGT LEEIDATGDY ELTFHFEEPH PLFPQYLANV QMGFASREAV
EAAGDDYGQE EVVGTGPLVF EEWVRDDEIV YSRNEDYDRG PDFLSNDGPI NFEEYHVRIV
PEPTTLLNEV TVGNVDGSMM IAASDAEDVE AHDNTQLERV DDAHPTFLSI NVEAEPTDEV
EVRQAMAYAV DQEAIVNAAF HGEGYPIYSL CPPMAVGGLD EATARETGYE QDLDTARELL
DDAGWENDEE GEVRTRNGDD LSVSFFAFEM EPYSSIGEVT QDMLSQVGFE ANLEILESGT
LYDRVEGGEH NLVTMALSGG YIANNTLAST LHSQNYAPDG GSNYSLYQSD EYDEIIDQAE
VEPDDAEREA LLHEAQEHIL EEVPVVPLVG FVKFYAAKNE ISVDAWTDHP WWPSPDQYNL
HAVDVDRS