Gene Nmag_1462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1462 
Symbol 
ID8824295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1491359 
End bp1492537 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content64% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003479602 
Protein GI289581136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCGG CTGCCGTGAT ATTGTCCGTC ATGCACGGAT CGGACGGGAC TCGAGTACCT 
GGCCATGATC ACGGCGGTCG ACGCAGTCGC AGCTACGATC GGCGACGATT TCTCGCAGCA
GCGGGGGTCG TCTCGGCGGG AGTCGTGAGC GGCTGTCTCG GACTCGGAGA CGACGAATCG
GACGTGCTCG GGGATCCCGA GTATCGGGAG GGGCGACCCG ATCCGGGTGG TGTCTCGATA
GAAGAGATGC CGGATCTGAA CGGAGATTTG ACGATCTACT CTGGGCGCAG CCAGCCGCGG
ATCGGCGAAC TGATCGAGTA CGTCGAGGCA CAGTACGACG AACTGACCAT CGAGGTCAGA
TACGACGATA CCGCGGACCT GATCAGTACG ATCGAGACGG AGGCCGAAAC GCCGGCGGAC
GTCTTCTACG GCAGCGAGAC ACAGTCGATG ACCCACCTCA AGGACGAGGG TTACACCGTC
GAGTTGCCTG ACGAAGTCAT CGATTTGGTC GACACGGGCT CAATCGATCC GGACGGCCAC
TGGACGGGTT TCACCCGCCG ATTTCGGGCG ATGGCGTACA ACAGAGACGC GTACGATGCG
GACGAGCTAC CGGACGACAT CTTCGCCTAT GCGGAGGACG AACGATTCCA GGACGAGATC
ATGTGGCCGC CGGATCAGGG CTCGTTCCAG GCGTTTCTCA CCTCGATGCG GCTGCTCCAC
GGCGAGGAGG AGACCCGCTC GTGGGTCCAA TCGATGACCG ACGACCAGGG TGTCGAGGCG
TCTCCGGGCG GCGACAGCGC GCTGGCACAG GCCGTCGGCG ACGGGGAGGT CAGCGTCGGG
CTGACGAACC ACTACGTCGT CCGCGACCAC GGCGGCGACT CCGTCGGCCT GGCGTTCACC
AGCGACGACG CGGGGGCGAT GTACAACGTC ACCGGCGGTG CGGTGATGGC CGACAGCGAC
GACACCGAGA CCGCCGCGAA CTTCGTCCAG CACATGCTCT CGGCGGAAGC CCAGGAGTAC
TTCGCGACGA CCACCTGGGA GTACCCCGTC ATCGACGGCG TCGCCCCACT CGAGGAACTC
CCTGGCACAG ACGAGTTCGA GCCACCCGAG TTCGACCTGA ACGAGCTCGA CGATCCCGAC
CCGACGCTCG AACTCCTGCG CGAGGAGGAC GTTCTCTGA
 
Protein sequence
MQPAAVILSV MHGSDGTRVP GHDHGGRRSR SYDRRRFLAA AGVVSAGVVS GCLGLGDDES 
DVLGDPEYRE GRPDPGGVSI EEMPDLNGDL TIYSGRSQPR IGELIEYVEA QYDELTIEVR
YDDTADLIST IETEAETPAD VFYGSETQSM THLKDEGYTV ELPDEVIDLV DTGSIDPDGH
WTGFTRRFRA MAYNRDAYDA DELPDDIFAY AEDERFQDEI MWPPDQGSFQ AFLTSMRLLH
GEEETRSWVQ SMTDDQGVEA SPGGDSALAQ AVGDGEVSVG LTNHYVVRDH GGDSVGLAFT
SDDAGAMYNV TGGAVMADSD DTETAANFVQ HMLSAEAQEY FATTTWEYPV IDGVAPLEEL
PGTDEFEPPE FDLNELDDPD PTLELLREED VL