Gene Nmag_0100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0100 
Symbol 
ID8822919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp123734 
End bp124945 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003478261 
Protein GI289579795 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGTGT TGAATGTCAC TATGACAAAT AGACGCGCGA TTCTCAAAAC CGCAGGTGTG 
CTCGGGACAG GGTGTCTCGC AGGCTGTCTC GGTGGGGACA GCGACGGGGA GTCGATTCGC
CCCGTTGCTA TCGAGGAGTG GCCGCCAGAA TCATACAACA ACGAACTCAA TACCTGGAAC
TGGTACGCCG AGTGGAACGA ATGGGGGACC GAAGCGTTTG CCGAGGAGTA CGATCTCGAT
AGCTACTCCA CTGAGACATA CGCGTCACCC GGTGATTGGT TTACCAATCT CCAGTCGAAT
CCGGAGAATC ACGGAATCGA TCAAATCGGT GCCTTCTCCG AATGGCAGTA TCGGGCCGTT
GAAGAGGACC TGTTAGAACC GATTCCTATC GATATGATGC CGAACGTAGA GATTCCCGAC
CAGTACCTCG ACGTCCACCG GGAGCAGTTC TGGAGCGACG ACGGTGCAGG TGGACTCTAC
GGTATCCCTC ACTCAATCGT GATCAGCCCG ATCGTCATGT ACAACACGGA GGAATTGCAG
GACCCAGGCG AATCGATCGA CATCCTCTGG GACGAGGAGT TTGCCGACGA AATCTCCATG
ATCGCTCACC AACCAGCGAT CCTCTGTGAG GCTGGTGCAC TTTACACCGG TCAGGATCCG
ACCGATCCGG ACGATTTCGA AGAGATTCAG GAGGTACTCG AACAACAGCG CGACCTCGTG
TTCACCTACG CCGATGACCA CCAGACACAG ATGCAACTCG TGATGAGCGG TGATGCAACG
CTCGGTTCAC ACATGGACGG CCGAGCGTTC AGGGCGATAT ACAACCATGG CGGTGACGTC
GACTGGTTTA TTCCGGAGGA GGGTGCGACC TGGGGGACGG ATACGCTCGT TATCCCGAAA
AACGCGCCAA ACCCGGTTAC GAGCACGATG TATCTGGACT ACCTCTTCAG CGACGAAGGG
ATGGAGCAGC TGATCGACAC GTCCCTGTAT CGTCCACCGG TTGCCAACGA CGAATTCACT
GACGGTGAAC TCGGAGAGAT AATTCGCGAA AACTGGACCG ACGAGTGGGA GAAAGAAGGC
GATGCCGAGG ACTTCATCGA CGACCTCGTG TTGACAGAAG AAACAATGGA CAATCTGTAC
CACAACTGGC CCCGTTCAGA CGAAGTCATC GAGCGATACG ACGAGATCTG GACGGCAGTT
ACCGCGGGAT AA
 
Protein sequence
MGVLNVTMTN RRAILKTAGV LGTGCLAGCL GGDSDGESIR PVAIEEWPPE SYNNELNTWN 
WYAEWNEWGT EAFAEEYDLD SYSTETYASP GDWFTNLQSN PENHGIDQIG AFSEWQYRAV
EEDLLEPIPI DMMPNVEIPD QYLDVHREQF WSDDGAGGLY GIPHSIVISP IVMYNTEELQ
DPGESIDILW DEEFADEISM IAHQPAILCE AGALYTGQDP TDPDDFEEIQ EVLEQQRDLV
FTYADDHQTQ MQLVMSGDAT LGSHMDGRAF RAIYNHGGDV DWFIPEEGAT WGTDTLVIPK
NAPNPVTSTM YLDYLFSDEG MEQLIDTSLY RPPVANDEFT DGELGEIIRE NWTDEWEKEG
DAEDFIDDLV LTEETMDNLY HNWPRSDEVI ERYDEIWTAV TAG