Gene Nmar_1093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_1093 
Symbol 
ID5773859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp996775 
End bp998052 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content36% 
IMG OID641316735 
Productsodium:dicarboxylate symporter 
Protein accessionYP_001582427 
Protein GI161528601 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTGA AAAATTATAT TTTCAAACAG TTGTGGCTTC AAGTTGTAAT CTCTTTACTA 
GTGGGATTAG GAGTTGGTTT AGTTTTAGGA GATGATGTAG GGGTCAGTCT AGATGACAAC
ACTCTAGATA CTGTTTCATC ATATCTCAAA ATTCCAGCCA ACATATTTTT GAGTGTAATT
TTGATGATCA TAGTTCCATT AATTTTTGCA TCAATTGTGG TTGCCATTAC TAATTTAGGT
ACAAAAGAAA AAATGAAAAC TCTAGGATTA GGTATTGGAG TTTATTTTGT AATTACTACA
ACAATTGCAA TTCTAATTGC AATAACTCTT GCATCAGTTA TTGCCCCTGG AAGTATTCTA
GACCTTACTG CACTTCAAGA GACTCATGAT CTTTCTGAAG ATGATTTGAA AGTTACAGAA
GGTTTCTCAG TAGATGAAAT TCCAGATATT GCATCAAACA TTATCCCAAG AAATCCAATT
GTATCTTATC TTGAAGGACA AATGTTAAGC ATTTTGATAA TGGCGATGAT TGTAGGATTG
GCAATGGCTG CTCTTCCAAA AGAGTCAGTC AAACCACTAC TGGATTTACT TGAATCTGTT
CAAAAAATCA CATTATACAT TTTGATTATG GCAATGAAGA TTGTACCATT TGCAGTCTTT
GGTTTGATTG TAGGCATGGT TGCTAAAGTA GGAGTAGAAA CAATGGCAGG ACTTGGAGTA
TACATGGCAA CAGTGATCCT AGGGCTTGGA GCCATGTTGT TGGTTTATAC AATGATCATA
AAATTTGTAG CAAAAAGACC AATCTCATCT ACATTCTCAA AGTTCCGTAA CCCACAGACT
TTGGCATTCT CAACTGCAAG TTCAATGGCA ACAATGCCTA TGACATTAAA GACTGCAGAA
GAGGATCTGA AGATAGATAC ACGTGTATCA AAATTTGTCA TTCCACTTGG AACTACAGTA
AACATGGATG GAACTGCATT GTATCAAGTA ATAGCAGTCT TCTTTTTGGC ACAGCTCTTT
GCAATTGAAT TGAGTATATT CTCCATACTT GTGATTATCA TTACTTCACT TTTGGCATCA
ATTGGAACAC CTGCAGTTCC AGGAGCAGGA ACAGTTGTTT TAGCTACAAT ACTTGTAACA
GTTGGAATTC CTCCAGTAGG AATTTTGTTA TTGCTTTCAG TAGATAGAAT TTTGGACATG
ATCAGGACCA TGGTAAATGT CACAGGTGAT CTTACTGCAT CTTGCGCGTT TAATGAAATA
ACGCGAGAAA AGAATTGA
 
Protein sequence
MELKNYIFKQ LWLQVVISLL VGLGVGLVLG DDVGVSLDDN TLDTVSSYLK IPANIFLSVI 
LMIIVPLIFA SIVVAITNLG TKEKMKTLGL GIGVYFVITT TIAILIAITL ASVIAPGSIL
DLTALQETHD LSEDDLKVTE GFSVDEIPDI ASNIIPRNPI VSYLEGQMLS ILIMAMIVGL
AMAALPKESV KPLLDLLESV QKITLYILIM AMKIVPFAVF GLIVGMVAKV GVETMAGLGV
YMATVILGLG AMLLVYTMII KFVAKRPISS TFSKFRNPQT LAFSTASSMA TMPMTLKTAE
EDLKIDTRVS KFVIPLGTTV NMDGTALYQV IAVFFLAQLF AIELSIFSIL VIIITSLLAS
IGTPAVPGAG TVVLATILVT VGIPPVGILL LLSVDRILDM IRTMVNVTGD LTASCAFNEI
TREKN