Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmar_0119 |
Symbol | |
ID | 5773788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosopumilus maritimus SCM1 |
Kingdom | Archaea |
Replicon accession | NC_010085 |
Strand | - |
Start bp | 108318 |
End bp | 109481 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 641315739 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_001581457 |
Protein GI | 161527631 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000000000173585 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTTCAT TAAAGGATAA AATTAAAAAC ATCAAAGGAA TTAAAGATTT AACATCAATT GGTTTTGCCA ATATTTCTGG AAATGCAATT AGTGCATTAT TTTGGTTTTA TTTAGCAAGT TTGTTAACAA CTACAGAATA TGGTGAATTA TCTTATTTAA TATCAATAGC AGGGCTTGCA TCTGTTTTAT CATTTGTAGG TGGTAGTCAT ACAATAACAG TTCTTACAGC AAAAAAAATC AACATCCTAT CAACACTTGT AACAATTATT CTTCTGATTG GAGTTACAAT AATGTTTGTA CTATATCTTC TTTTTAGTAA TTTATCAATA AGTATGTATG CTATCAGTTA CCTAATCTAC GGAGTTAGTA TTGGAGAAAT TTTAGGCAAT AAACTTTACA GAACATATTC AATTTTATTT ATTGTTCAGA AAGCTACAAT GGTTTTAGCA AGTATTCTAC TTTACCCCAA TTTGGGAATT GACGGAGTAA TTCTTGGATA CGCAATATCA CATTTTGTTC CAGGTTACAG AATTTTTAAA ATGTTAAAAG GAAAATTCTC TTTTTCGTCA TTAAAGCCTC AATGGAGTTT TATTTCAAAT AATTATGGGT TAGTAATTAG TAGGGCATTT ACTGGACAAA TTGACAAAAT TATAATTGCA CCCATACTAG GATTTGCATT ATTAGGAAAT TATCATTTAG GAATTCAATT TCTTTCACTT TTGAGTATAT TACCTATGAG TATGATGCAG TATATCTTAC CCCAAGAATC CACAGGTCAT TCTCATGTGA TTTTAAAGAA ATTAGCAGTT ATCGTAGCTG TTTTGTTTGC AGTTTTAGGA ATCTTCCTAG GTCCAATAAT TCTGCCACAT TTCTTTCCAA AATTTGAGAG TGCTGCAGAA ATAATTCCCA TAATGAGTTT AGTAGTAATT CCGAGAACAA TAACAGCTAT CACAAATGCA AAATTGTTAG GAATTTTATC AACTAGGTTT ATTGTTATAG GTGTTGCAAT ATATCTTACA ATACAAGTTT CAGGAATTTT GATCTTAGGG GAATTGTATT CGCTTAATGG AGTTGCGTGG GCATTAGTTT TAGCAGAAGC TGGTCAAGCT ATTTTCCTAT ATGTTTCAAG TAAGTATTTC TTAAAAAAAA CAAGTAATGC ATAA
|
Protein sequence | MVSLKDKIKN IKGIKDLTSI GFANISGNAI SALFWFYLAS LLTTTEYGEL SYLISIAGLA SVLSFVGGSH TITVLTAKKI NILSTLVTII LLIGVTIMFV LYLLFSNLSI SMYAISYLIY GVSIGEILGN KLYRTYSILF IVQKATMVLA SILLYPNLGI DGVILGYAIS HFVPGYRIFK MLKGKFSFSS LKPQWSFISN NYGLVISRAF TGQIDKIIIA PILGFALLGN YHLGIQFLSL LSILPMSMMQ YILPQESTGH SHVILKKLAV IVAVLFAVLG IFLGPIILPH FFPKFESAAE IIPIMSLVVI PRTITAITNA KLLGILSTRF IVIGVAIYLT IQVSGILILG ELYSLNGVAW ALVLAEAGQA IFLYVSSKYF LKKTSNA
|
| |