Gene Nmag_3165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3165 
Symbol 
ID8826026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3278673 
End bp3280334 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content63% 
IMG OID 
Productputative sodium symporter protein 
Protein accessionYP_003481279 
Protein GI289582813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAGC TACTCGAGCC GTTCGTCCTC CAGGAGACGG CGCTCGACGT CGGCGAGTTC 
AAACTCGTGC CGGCGCTGAC CGTCGCGGCG ATGCTGGCGC TGTTCCTCGG CGTCGGCTAC
TTCTTTCGGG TCGCGGCCGT CGACGATCTG TGGGTTGCCG GCCGATCGAT CGGTGCAGTT
GAGAACGGGA TGGCGATCGG TGCAAACTGG ATGAGTGCGG CATCCTACCT CGGTGTCGCG
TCGATTATCG CCCTCTCGGG CTACTTCGGA CTGGCGTACG TCGTCGGCTG GACGACGGGC
TACTTCATCC TGCTCATCTT CCTGGCAGCC CAGTTCCGTC GGTTCGGGAA GTACACGGCA
CCTGACTTCG TCGGCGACCG GTTCTACTCC GACTGGGCAC GCGGTATCGC GGCGTTCACC
ACACTTGCGA TCGCGTTCAC CTACGCCATC GGGCAGGCCA GCGGCATGGG GTTGATGGCC
CAGTACATCT TCGGAATCTC CTACGAGATG GGTGTGATCG TCCTGATGGG AGTCACTATC
GGCTACGTCG CCCTCTCGGG GATGCTGGGG ACGACGAAGA ACATGGCGAT CCAGTACGTG
ATTCTCATCA TTGCCTTCAC CATCGGCCTC TACGCCGTCG GCTGGAGCCA GGGCTGGTCG
ACCGTTCTGC CGTACTTCGA ATTCGGCAGC GAAGTCGCGG CGGCGGCTGA GATCGAAGCG
CAGTTCGTTG AGCCGTTCGC CGACGGTTCC TACTACGCCT GGATCGCGCT AGCGTTCAGT
CTGATCGTCG GCACCTGCGG TCTGCCGCAC GTTCTCGTCC GGTTCTACAC CGTCGAGAAC
GAGCGAACGG CCCGCTGGTC GACCGTCTGG GGCCTGTTCT TCATCTGCCT GCTGTACTGG
GGAACGGCCA CCTACGCCGC GTGGGGCGGA TTGCTCTACG ACAGCGAGGT AACCGGCGGC
GGTGGCTTCG CCGACATGCT CCCCATCGAA GCCGACGCGC TAGTCGTTCT GACTGCACAA
CTCGCTGAAC TGCCGACGTG GCTCGTCGGA CTGGTGGCTG CCGGTGCCGT CGCCGCAGCG
CTTGCGACCA CCGCGGGGCT GTTCATCTCG GCTTCCTCGG CCGCTGCACA CGACATCTAC
ACGAATCTCT ACAAGGAGAA CGCAACCCAG CGCGAGCAGA TGCTCGTGGG CCGCGTGACG
ATCCTGGGAA TCGGTATCCT GGTGACGCTC ATCGGTCTCA ACCCGCCTGC GCTCATTGGC
GAACTCGTCG CGATGTCGTT CGCCATCGCG GGCACCGTCT TCTTCCCGGT GTTCTTCCTC
GGGCTTTGGT GGGAAAACAC CACGAAAGAG GGCGCACTCG CGGGAATGCT CACCGGCATC
CTGCTCTCGT TCGGTGCGAT CCTCAACGAC ACGGCGATCC CGATGTACAC CGGCGTCGAA
GATGCGGTAA TTCCCGCGCT CGCGACCTGG CTGCCGGGGA CCTCCTCGGC ACTGGTCGGC
GTGCCGGTCG TCTTCGGGGT CATCATCGTC GTTTCGATCG TGACGGACAA CCCACCACAG
CACGTCAAAC GACTCGTCCG GCAGTGTCAC AGCCCGGAAC CGATGAGTCA GACGGAGTCG
GCGGAAGACG CTGCGAACGG TGGGGCACCG GCTGACGACT GA
 
Protein sequence
MIELLEPFVL QETALDVGEF KLVPALTVAA MLALFLGVGY FFRVAAVDDL WVAGRSIGAV 
ENGMAIGANW MSAASYLGVA SIIALSGYFG LAYVVGWTTG YFILLIFLAA QFRRFGKYTA
PDFVGDRFYS DWARGIAAFT TLAIAFTYAI GQASGMGLMA QYIFGISYEM GVIVLMGVTI
GYVALSGMLG TTKNMAIQYV ILIIAFTIGL YAVGWSQGWS TVLPYFEFGS EVAAAAEIEA
QFVEPFADGS YYAWIALAFS LIVGTCGLPH VLVRFYTVEN ERTARWSTVW GLFFICLLYW
GTATYAAWGG LLYDSEVTGG GGFADMLPIE ADALVVLTAQ LAELPTWLVG LVAAGAVAAA
LATTAGLFIS ASSAAAHDIY TNLYKENATQ REQMLVGRVT ILGIGILVTL IGLNPPALIG
ELVAMSFAIA GTVFFPVFFL GLWWENTTKE GALAGMLTGI LLSFGAILND TAIPMYTGVE
DAVIPALATW LPGTSSALVG VPVVFGVIIV VSIVTDNPPQ HVKRLVRQCH SPEPMSQTES
AEDAANGGAP ADD