Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_1093 |
Symbol | |
ID | 5773859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | + |
Start bp | 996775 |
End bp | 998052 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316735 |
Product | sodium:dicarboxylate symporter |
Protein accession | YP_001582427 |
Protein GI | 161528601 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTGA AAAATTATAT TTTCAAACAG TTGTGGCTTC AAGTTGTAAT CTCTTTACTA GTGGGATTAG GAGTTGGTTT AGTTTTAGGA GATGATGTAG GGGTCAGTCT AGATGACAAC ACTCTAGATA CTGTTTCATC ATATCTCAAA ATTCCAGCCA ACATATTTTT GAGTGTAATT TTGATGATCA TAGTTCCATT AATTTTTGCA TCAATTGTGG TTGCCATTAC TAATTTAGGT ACAAAAGAAA AAATGAAAAC TCTAGGATTA GGTATTGGAG TTTATTTTGT AATTACTACA ACAATTGCAA TTCTAATTGC AATAACTCTT GCATCAGTTA TTGCCCCTGG AAGTATTCTA GACCTTACTG CACTTCAAGA GACTCATGAT CTTTCTGAAG ATGATTTGAA AGTTACAGAA GGTTTCTCAG TAGATGAAAT TCCAGATATT GCATCAAACA TTATCCCAAG AAATCCAATT GTATCTTATC TTGAAGGACA AATGTTAAGC ATTTTGATAA TGGCGATGAT TGTAGGATTG GCAATGGCTG CTCTTCCAAA AGAGTCAGTC AAACCACTAC TGGATTTACT TGAATCTGTT CAAAAAATCA CATTATACAT TTTGATTATG GCAATGAAGA TTGTACCATT TGCAGTCTTT GGTTTGATTG TAGGCATGGT TGCTAAAGTA GGAGTAGAAA CAATGGCAGG ACTTGGAGTA TACATGGCAA CAGTGATCCT AGGGCTTGGA GCCATGTTGT TGGTTTATAC AATGATCATA AAATTTGTAG CAAAAAGACC AATCTCATCT ACATTCTCAA AGTTCCGTAA CCCACAGACT TTGGCATTCT CAACTGCAAG TTCAATGGCA ACAATGCCTA TGACATTAAA GACTGCAGAA GAGGATCTGA AGATAGATAC ACGTGTATCA AAATTTGTCA TTCCACTTGG AACTACAGTA AACATGGATG GAACTGCATT GTATCAAGTA ATAGCAGTCT TCTTTTTGGC ACAGCTCTTT GCAATTGAAT TGAGTATATT CTCCATACTT GTGATTATCA TTACTTCACT TTTGGCATCA ATTGGAACAC CTGCAGTTCC AGGAGCAGGA ACAGTTGTTT TAGCTACAAT ACTTGTAACA GTTGGAATTC CTCCAGTAGG AATTTTGTTA TTGCTTTCAG TAGATAGAAT TTTGGACATG ATCAGGACCA TGGTAAATGT CACAGGTGAT CTTACTGCAT CTTGCGCGTT TAATGAAATA ACGCGAGAAA AGAATTGA
|
Protein sequence | MELKNYIFKQ LWLQVVISLL VGLGVGLVLG DDVGVSLDDN TLDTVSSYLK IPANIFLSVI LMIIVPLIFA SIVVAITNLG TKEKMKTLGL GIGVYFVITT TIAILIAITL ASVIAPGSIL DLTALQETHD LSEDDLKVTE GFSVDEIPDI ASNIIPRNPI VSYLEGQMLS ILIMAMIVGL AMAALPKESV KPLLDLLESV QKITLYILIM AMKIVPFAVF GLIVGMVAKV GVETMAGLGV YMATVILGLG AMLLVYTMII KFVAKRPISS TFSKFRNPQT LAFSTASSMA TMPMTLKTAE EDLKIDTRVS KFVIPLGTTV NMDGTALYQV IAVFFLAQLF AIELSIFSIL VIIITSLLAS IGTPAVPGAG TVVLATILVT VGIPPVGILL LLSVDRILDM IRTMVNVTGD LTASCAFNEI TREKN
|
| |