Gene Nmag_3203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3203 
Symbol 
ID8826066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3320597 
End bp3322522 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content64% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003481315 
Protein GI289582849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC CCGCCCCGTC AACGAGCCCC GACAACCACA CACGCGACAG CGACGACGCC 
GCTCGCGGCG GTCCCAGTCG CCGGTCGATG CTCGCCGCGA CAGCCGGTCT CACCGTCTCG
ATGAGCGGCT GTATCCGCGA AGTACGAAGC ATCGTCAACC GCGACGAGGT TGCCCCACTC
TCACTGACGA TTTCGACGGT TCCCGCCGAC GGCGACCGAG AGAGTATCCA GCTCGCCCGC
GCCGTCGGCT CGGCACTCGA GTCCGTTGGC GCCGACATCG CGATCGATAT GCTGTCTCAA
GAGGAGTTTC AGCGTGCGAT CCTCGTCAAT CATGACTTCG ATCTCTACGT CGGTCCCCAT
CCCGGGGACA CGACGCCTGA CTTCCTTTAT GAACTCCTGC ACTCGCGATT CGCGGAGGAA
GCCGGCTGGC AGAATCCCTT CGGACTGACG AATCTCGCTA TCGACGACCG GCTGGACGAG
CAGCGTGTCG CCACAGGCGT AGCCGCAGAC GCAGACGGGA ACGGGAACGG GAACGGGAAC
GGAGACGAAG GCGAAAATGG AGACGAAAAC GAGACCAGCG ACCGCGAGGA CGCGATCACG
GCAACACTCG AGACGGTTGC GCGCGAGCAG CCGTTCGTCC CGATTTGTCG GCCGACGGAG
ATCAGACTCG CGCGCTCGGA TCGATTCCAG GGCTGGGGAG AGGGGCATCC AGCGACTCGA
TTTGGCTATC TTGGTCTCGA GCCACAGTCT GATGGGGCGT CTGCGGAGCT TCGGGCAGTA
CACACGGACG CACGGCCGTC ACAGAATCTG AATCCGCTGT CGGCGGAGTA TCGGGAGCGT
GGTCTCTCGA CCGAGTTGCT GTACGACTCG CTTGCAGTGG CGCAGTGGCG TGGTGCTGGC
GACCGTGAGT CGGAGACCGA CACCGCAACC GATGTCAGTC CCTGGCTCGC GAGTGACTGG
GAGTGGAGCG ACGACGACAC CCTCCTCGTC ACCCTCCGGG AGGACTGTGA ATTCCACGAC
GGCACGCGAC TCACTGCCGA GGACGTCGCC TTTACCTACG AGTTCCTCGC GGACACGACC
CGCGGCAGTC GGCGCTTTGC CGCACCAGCC CCCCACTACC GCGGCCGCGT CGCCGCCGTC
GACGACGTGA CCGACCTGAG CGAGACCGAA CTCGAGTTCA CCGTCGCAGC GAGCCAGCCG
GTCGCCGAGC GCGCACTGGA GGTGCCGATC CTCCCGAAAC ACATCTGGGA CGAGCGCACG
GACCAGGCGA GCCTCCGCGG CGTCACGGTT TCGGAGGGGA CGACCGAAGC GCTCGTCACC
GACAACGTGC CCGCAATCGG CAGCGGCCCC TATCAGTTCG ACGAGCGGAC CGACCGTGAT
CACCTTACGC TCGCACGCTT TGACGCACAC TTTACGCGCC GCGACGGCGT CGATCTGCCG
GAACCGACAA GCGAGCACCT GCGCATCCAG ATCGACCCGC GGAGCACGTC CGCGATCGAA
CTCGTCGAAA GCGACAGTGC CGACGTGACG TCTTCGCCAC TCGAGTCCTA CGTCGTCGAT
GACGTGTTCG AAGACGACGA GGCAATTTCG GAAACGACGG TACTCGAGTC ACCATCCTGG
TCGTTCTACC ATATCGGGTT CAACACGCGT CGAGCGCCGC TTTCCAATCC GCGGTTTCGC
AGGGTCGTTG CCCGACTCCT CGACAAGGAA CAGTTGGTCG AGGAGGTGTT CCACGGCCAT
GCACAGCCGA TCGCAACGCC GGTGGTCGAG GAGTGGGTTC CCGACTCGCT CGCCTGGAAC
GGCAGCGATC CGGAGACGCC GTTTTTCGGG ACTGACGGGG AGCTCGATGT CGCGGCTGCG
AGGGATGCGT TCGAAGATGC GGGACTGCGC TACGACGATG ACGGAAGACT CCGGGTGAGA
CACTGA
 
Protein sequence
MSDPAPSTSP DNHTRDSDDA ARGGPSRRSM LAATAGLTVS MSGCIREVRS IVNRDEVAPL 
SLTISTVPAD GDRESIQLAR AVGSALESVG ADIAIDMLSQ EEFQRAILVN HDFDLYVGPH
PGDTTPDFLY ELLHSRFAEE AGWQNPFGLT NLAIDDRLDE QRVATGVAAD ADGNGNGNGN
GDEGENGDEN ETSDREDAIT ATLETVAREQ PFVPICRPTE IRLARSDRFQ GWGEGHPATR
FGYLGLEPQS DGASAELRAV HTDARPSQNL NPLSAEYRER GLSTELLYDS LAVAQWRGAG
DRESETDTAT DVSPWLASDW EWSDDDTLLV TLREDCEFHD GTRLTAEDVA FTYEFLADTT
RGSRRFAAPA PHYRGRVAAV DDVTDLSETE LEFTVAASQP VAERALEVPI LPKHIWDERT
DQASLRGVTV SEGTTEALVT DNVPAIGSGP YQFDERTDRD HLTLARFDAH FTRRDGVDLP
EPTSEHLRIQ IDPRSTSAIE LVESDSADVT SSPLESYVVD DVFEDDEAIS ETTVLESPSW
SFYHIGFNTR RAPLSNPRFR RVVARLLDKE QLVEEVFHGH AQPIATPVVE EWVPDSLAWN
GSDPETPFFG TDGELDVAAA RDAFEDAGLR YDDDGRLRVR H