Gene Nmag_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3647 
Symbol 
ID8826515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp30363 
End bp32033 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content60% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003481757 
Protein GI289583347 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0862095 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAATG AAGACACGAC ATCCGACATC GATCGACGCC GATTCATCAA ATCGACGGCT 
GCTATCGGCG CAGCAGGGCT CTTTGCCGGG TGCGTTGGAA CCGATCCAGA TGAAACCGGT
GATGGGGGTG GGACGTTCCG GATCGCGACA CCAGACGAGG TCGAGACGTT CGATCCGCGA
ATGAATCAGA TGGTGTGGTA CAGTACGGCT GCACACTACC TGTTCGACTC GCTTTTCATG
TTGGAACCGG ACGGCAGTGG CGCGGTCCCA CACCTTGCCG ACGGCGAACT CGAGGAGGTC
GACGAAACGA CGTTCGTCGT CGACATCCGC GATGACGTCA CCTTCCACAA CGGTGACCAG
CTGACCGCGG AGGACGTGGC GTACTCGCTC AACTGGGTCC GTGACCCGGA CAACGACTCG
CCCAACCTCT CGAACGTCGA GTTCATCGAC GAAGCCGAGG CGACTGGTGA GTTCGAGGTC
ACACTCCACC TCGACTTCCA GTTCGCGCTG ATGGAACGGG AGTTATCCTC GATGAACGCC
GCGATCGTTC CAATGGATGC CGCCGAGGAG ATGGGACAGG AGGAGTTCGC TCAGAACCCA
ATCGGCAGTG GGCCGTTCGA ACTCGCGAAC CACGATCCGT CGGCCAGCGT CGAACTCACG
GCCAACGACG ACTACTTCCT CGGCGAACCC ACCCTCGGCG GGGTCGAGTA CCGGATCATT
CCAGAGGCAG AGGTTGGGTT CGTCGAACTG GCGGACGGTA CTATTCACCA GTCGAGTGTG
ACGGAAGCCC TCGTCGACGA AGCTGAATCG AACGACAACG TCAACACGTA CCAGATCAGT
GATTTCAACT TCCAGGGGTT CATCATCAAT TGTCTGGAAG GCCCGTTCGT GGACACCAGG
GCTCGCGAAG CCGTTCAGTA TCTCGTCGAC TATGACGAAC TGCTGACTGG TGCGGTCGGT
GACCTCGGTA GTCGAAGTGT TGCCCACATG CCCCCGGGTG TCGCCGAAGC ATGGGATTTC
CCAGCCGACG AGTGGCGAGA ACAGTACTAC CCGGAGCAAG ACCACGACCG TGCCGTCGAA
CTGTTCGAGG AGGCTGGACT CGGCACCGAC TTTGAAGTTG ACATTGTGAC CATGTCTGGC
GAGTCGACGA CGGGACGATC CACTGTCTTA CAGCACGAGT TCCAGGAAGT CGGAATCGAT
GCATCGGTCA GAGAGGTCTC AGACGGCGAA TGGCTGGATG CACTCGATAC CGGGGACTAC
GACATCAACA CCTATGGCTG GGGCGGCGGC GACGACCCGG ATGGCTACTA CTACCGGATG
TTCCGTGACC TCGCAAACGA CGACGGTGGC ATGAGCGACG ATGTCGTCGG CCATTCGTCG
ATCGGCTACC TCTACGAGGG CGCTCGAGAT CGCGGCGACG ACGAATTACT CGACGAGCTC
GAACGACTGG ACGAACTCGT TCGTGCAGCG CGAGAGACGA CGGATCGAGA CGAACGCTAC
GAGTACTACG TCGAGGCAGT CGACCTCCTC ATGCCACTGC ATCCGGTCAT CGGCGTCTAC
TCCGCCGAGG GTATAACTGG CGTCCACACG GACGTACAGG ACTACGAGCC GAGTCCGTTC
GGCGAGCAGG AGGCGTTCAA CCAGTGGCAA GAGGCGCGGA TCGACGACTA G
 
Protein sequence
MFNEDTTSDI DRRRFIKSTA AIGAAGLFAG CVGTDPDETG DGGGTFRIAT PDEVETFDPR 
MNQMVWYSTA AHYLFDSLFM LEPDGSGAVP HLADGELEEV DETTFVVDIR DDVTFHNGDQ
LTAEDVAYSL NWVRDPDNDS PNLSNVEFID EAEATGEFEV TLHLDFQFAL MERELSSMNA
AIVPMDAAEE MGQEEFAQNP IGSGPFELAN HDPSASVELT ANDDYFLGEP TLGGVEYRII
PEAEVGFVEL ADGTIHQSSV TEALVDEAES NDNVNTYQIS DFNFQGFIIN CLEGPFVDTR
AREAVQYLVD YDELLTGAVG DLGSRSVAHM PPGVAEAWDF PADEWREQYY PEQDHDRAVE
LFEEAGLGTD FEVDIVTMSG ESTTGRSTVL QHEFQEVGID ASVREVSDGE WLDALDTGDY
DINTYGWGGG DDPDGYYYRM FRDLANDDGG MSDDVVGHSS IGYLYEGARD RGDDELLDEL
ERLDELVRAA RETTDRDERY EYYVEAVDLL MPLHPVIGVY SAEGITGVHT DVQDYEPSPF
GEQEAFNQWQ EARIDD