Gene Nmag_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0005 
Symbol 
ID8822819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp6750 
End bp8549 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content60% 
IMG OID 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003478167 
Protein GI289579701 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.369822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGGG TCGCTCCGTC GGTGACACGA CGGTCCCTCC TTGCCGGCGC AAGCGGTGTC 
GGGTTGAGCG CACTCGCCGG GTGCTCGGAG CGATTCTGGT CGCGAGCGGA GAACACGGGC
CCTGATCAGG TCGAACTCAC GATTAAGACG GTCCCTGCCG ACGACGACGC GATCGCGGCG
ACGATCATGA GTCAACTTCG CGAAAATTTC CAGGCCGCTG GCATCGATGC GACCCACGAG
CCGATCGCCG AAGCGGAACT GTACCGTGAC GTTCTCATCG ACGGCGATTA CGACGTCTTC
GTCCTCAAAC ATGCCGGACT CGACGAGTAC GACGCCCTCC ATGGACTGCT TCACTCGCAG
TTTGTCACCG AACGGGGCTG GCAGAATCCG TTCCAGTTCT CCGATATTCA CGCAGACGAG
TTGCTCGACG AGCAACGGGC AACTGACAGG GCAGAGCGCG AAGAGACACT GACTGAACTG
TTCAACTATC TGTTAGAGAC GGTGCCGTTC ACCGTCGTCG CCTACCCCGA CCGCCTCGGC
GGTACTCGCG AGACACTCTC GGTCTCACGA CCACCGCGGC GCGCAATCGA AATCATCGAT
ATTCTCTCAC AGGAGTTCGA GGATGGCCCC TTCGACCGGC CGCTCGAACT CGGCATCTAC
GGTGAGGCGC TGACGGACCG ATTGAATCCT CTCATCGTCG ACCGTAACCG AGTGCAGGGG
TTGATGGAAT TACTGTACGA CCCGCTCGCC CGCCAGTCAC TAACTGACGA GTCAGAGCCG
GGCGAGCCAG CGACGTACAG CCCGTGGCTC GCCGAAGAAA TCGAGTGGGA CGAGGAGAGT
TCGCTCACCG CGACGGTGAC GCTTCGTTCT GACCTTCACT GGCACGATGG CGAACCGCTC
GATGCCGACG ACGTCGATTT CACGTTTCGG TTCCTCGGGG ATACGTCCCT CGGAGCCGTC
GATGGAACGA TCCCGGCACC CCGGTATCGG GACAGACAGA CGCTTATCGA CGACAGTGAT
CGGATCGAGG TGCTCGACGA TCGAACAGTT GTGCTCCCGT TCGGCTCTAC CGCTCGCCCG
GCTGCGACGC GTATCCTCTC GGTCCCGATC CTGCCCCAGC ACATCTGGGA GCCACTATCC
GAGGTCATCG CCGAGCGCCA AACCAGGGCA CTCGTCCACG ACAACGAGGA ACCAGTCGGC
TCCGGTCTCT TCGAACTCGT CGAAACATCG GCGGACGAAC TCGTCCTCGA GCCGTTCGAA
GATCACGTGC TTCGCTCGTC TACTGTTCCA AATCGCCCCA GTATCCTGGA ACACTTCTCG
CAGTTCGAGG GCATTCGATA CCGAATTGAT CCGAACGTCG GTGCCATGCT CGACGCGCTC
GAAGACGGAG AGATCGATGT CACGGCCGGT GCAGTCCCTC CCGACTCGGC CGCACAACTC
ACCGATGCAG ACGATGTCTC GATGGTCACC ACCTCGACGA GTTCGTTCTA CATGGTCGGG
TACAACATCC ACCACCCCGA ACTCGGCAAT CCAAACTTCC GACAGATCGT CTCACAGTTA
ATCGACCGCG AGTATGTCGT CTCTGAGTTC TTCAACGGCT TTGCATCAGA ACCCACACGG
AGCACTGGAC TCTTTGGCTA TCAGCAGTAC AGTCAGGACG ACGCTACCCT CGAGGGCACT
ACATCGACCA ACCCCATCTC GGTATTCCCG GGATCTGACG GCGAAATCGA TCCCGAACGC
GTCAGGCAAC TGTTCACCGA GGCCGGCTAC CAGTACGACG ATGACGTGTT ACTCGAGTAA
 
Protein sequence
MKRVAPSVTR RSLLAGASGV GLSALAGCSE RFWSRAENTG PDQVELTIKT VPADDDAIAA 
TIMSQLRENF QAAGIDATHE PIAEAELYRD VLIDGDYDVF VLKHAGLDEY DALHGLLHSQ
FVTERGWQNP FQFSDIHADE LLDEQRATDR AEREETLTEL FNYLLETVPF TVVAYPDRLG
GTRETLSVSR PPRRAIEIID ILSQEFEDGP FDRPLELGIY GEALTDRLNP LIVDRNRVQG
LMELLYDPLA RQSLTDESEP GEPATYSPWL AEEIEWDEES SLTATVTLRS DLHWHDGEPL
DADDVDFTFR FLGDTSLGAV DGTIPAPRYR DRQTLIDDSD RIEVLDDRTV VLPFGSTARP
AATRILSVPI LPQHIWEPLS EVIAERQTRA LVHDNEEPVG SGLFELVETS ADELVLEPFE
DHVLRSSTVP NRPSILEHFS QFEGIRYRID PNVGAMLDAL EDGEIDVTAG AVPPDSAAQL
TDADDVSMVT TSTSSFYMVG YNIHHPELGN PNFRQIVSQL IDREYVVSEF FNGFASEPTR
STGLFGYQQY SQDDATLEGT TSTNPISVFP GSDGEIDPER VRQLFTEAGY QYDDDVLLE