Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0620 |
Symbol | |
ID | 5774200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 561296 |
End bp | 563041 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641316255 |
Product | membrane protein-like protein |
Protein accession | YP_001581954 |
Protein GI | 161528128 |
COG category | [S] Function unknown |
COG ID | [COG3356] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAAG AATCAGACGA TGTTTCAAAT ATTCACAATA GGTTTTCCCT TACCCTAGTA AATCCATCAT CACATTACTT TTCACTAGTG GTATCACTGG GAGTGGCCGC AATAATTTCA CTTGCAACAC TCTTGGGGTA TTTGGACAGT AGTATTGAAG AAGCATGGTA TATTGTACCT GCAGTCATGA CAGTACTTTT CTTAACACAA TTACTTGATA CACGATTCAC AAAGAAAAAA GAATACTCAA AGTCATTACA TTCCTCATTA TTTGCAAATA TGTTATGGGC TGCAACAGTA TTGTTAGGAT TACTAGCAAG TTTTGTCTTA TCTAAAGAAA CATCATTGTT CTTTGTAACT TTTGGAATGT TTCTCTTTGC AAGTTTTAGA ATTGGGATTT ACACTACAAC GCTTGGAGTA AGTCTCAAGA AAGCATGGGC GATTTGTTTT GTTCAACCAT TAGCAATGTT TGCAGTTTTA ATCCCACAAC ATTTGTGGAG TCCAATGTTC AGTGACCCAA TGACATTGTT TTACGGGATT TCATTTATGA TAATTGCTAG TGTGTGGTCA GTACTTACAG ACAGAGCAGG AAGACCAGGA ATGGAAAGTA CTCATAAAAC AATTCAAGCA TATCTTGCAT CACAAAAAAA TGATCATACA GAAGCAGAAG AAATCATGGA AGAGCGCTCT AGTGAAACCA AGGTTGCAAC TTCACAAATA AGATTGTCAG CTAATGACGG AGGTAAAGAA TTCAGAATGG TTCTACCTGA AATTCATCCA GGTCCATATC ATCCAGTAGG GGGTAGCAAC ATACCATATT TGATTTACAA AAATTTGTCA TCATCAGCCA TGGTGATGCA CAGTATATCA GACCATGCAC TCAATCTTCC ATCAAAAAAT GAAGTTGAAA ATTATCTAAA GAATCTAGAA AACAGCAAAG TCAAAGAAGA AGGATTGCAG TGTACAGAAC CAGTAACAGT CCAAATCAAC AAAGCAAGAG TGATTGGATT GCTTTTTGGA AAAAACCCAT TACTGTTCTT GTCACTGTCA CCACATGGAA TGGAAGACAT TCCAAACTAC ATGAAAAATG ATATTGAGCA ATATGCTAAA AACCGAAATT ATTCAAAACC GCTAATTGTT GATTGTCACA ATGCAATGGG AGAAGAAATT TCAAGTGAAG ATGGGGAAGA TATGCTAAAA GCTGCCAAAT CCTGTCTAGA CACTCTAATC ACAAAAGATA GTTTCCCAAT TGAATTTGGA TATGCAAATT CAGATGACAT GGATGTTTGG ACTGAAGACT TGGGAATGGG TGGACTAGGA ATAGTATGTC TCAAAATCAA TGAAAAAAAA TACTTTCTTG GATGGGCAGA TGCAAATAAT ATGGAAAATG GAGTTAGAGA GAAGATCATA GATTTGTTTG CAAGAAAAGA CCTTCAACTA CTTGAGATTT GCACATCAGA TACTCATTAT GCACCAGTGA AAGCCAGAAA TAGAAATGGA TACTACCAAT TAGGATTGAT TACCAGTGCA GATAAACTTG CAAAATGGTT TTTGGAGATT GCAAGCAATG CTGAATCTAA CACATCAACT GCAAAGTTTG AGATTTTAGA AAATGAAACA GAAGTTAAAG TGATGGGACA AGGGATCTAT GAAGATTATT CAAAGGCATT AGATAACTCA TTAAAAATCA CTAAAGGGTT TCTAATTGGA GGAGTAATAT TCTTCATAAC TAGTCTTTTT CTATAG
|
Protein sequence | MEKESDDVSN IHNRFSLTLV NPSSHYFSLV VSLGVAAIIS LATLLGYLDS SIEEAWYIVP AVMTVLFLTQ LLDTRFTKKK EYSKSLHSSL FANMLWAATV LLGLLASFVL SKETSLFFVT FGMFLFASFR IGIYTTTLGV SLKKAWAICF VQPLAMFAVL IPQHLWSPMF SDPMTLFYGI SFMIIASVWS VLTDRAGRPG MESTHKTIQA YLASQKNDHT EAEEIMEERS SETKVATSQI RLSANDGGKE FRMVLPEIHP GPYHPVGGSN IPYLIYKNLS SSAMVMHSIS DHALNLPSKN EVENYLKNLE NSKVKEEGLQ CTEPVTVQIN KARVIGLLFG KNPLLFLSLS PHGMEDIPNY MKNDIEQYAK NRNYSKPLIV DCHNAMGEEI SSEDGEDMLK AAKSCLDTLI TKDSFPIEFG YANSDDMDVW TEDLGMGGLG IVCLKINEKK YFLGWADANN MENGVREKII DLFARKDLQL LEICTSDTHY APVKARNRNG YYQLGLITSA DKLAKWFLEI ASNAESNTST AKFEILENET EVKVMGQGIY EDYSKALDNS LKITKGFLIG GVIFFITSLF L
|
| |