Gene Nmar_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0119 
Symbol 
ID5773788 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp108318 
End bp109481 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content30% 
IMG OID641315739 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001581457 
Protein GI161527631 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000173585 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTTTCAT TAAAGGATAA AATTAAAAAC ATCAAAGGAA TTAAAGATTT AACATCAATT 
GGTTTTGCCA ATATTTCTGG AAATGCAATT AGTGCATTAT TTTGGTTTTA TTTAGCAAGT
TTGTTAACAA CTACAGAATA TGGTGAATTA TCTTATTTAA TATCAATAGC AGGGCTTGCA
TCTGTTTTAT CATTTGTAGG TGGTAGTCAT ACAATAACAG TTCTTACAGC AAAAAAAATC
AACATCCTAT CAACACTTGT AACAATTATT CTTCTGATTG GAGTTACAAT AATGTTTGTA
CTATATCTTC TTTTTAGTAA TTTATCAATA AGTATGTATG CTATCAGTTA CCTAATCTAC
GGAGTTAGTA TTGGAGAAAT TTTAGGCAAT AAACTTTACA GAACATATTC AATTTTATTT
ATTGTTCAGA AAGCTACAAT GGTTTTAGCA AGTATTCTAC TTTACCCCAA TTTGGGAATT
GACGGAGTAA TTCTTGGATA CGCAATATCA CATTTTGTTC CAGGTTACAG AATTTTTAAA
ATGTTAAAAG GAAAATTCTC TTTTTCGTCA TTAAAGCCTC AATGGAGTTT TATTTCAAAT
AATTATGGGT TAGTAATTAG TAGGGCATTT ACTGGACAAA TTGACAAAAT TATAATTGCA
CCCATACTAG GATTTGCATT ATTAGGAAAT TATCATTTAG GAATTCAATT TCTTTCACTT
TTGAGTATAT TACCTATGAG TATGATGCAG TATATCTTAC CCCAAGAATC CACAGGTCAT
TCTCATGTGA TTTTAAAGAA ATTAGCAGTT ATCGTAGCTG TTTTGTTTGC AGTTTTAGGA
ATCTTCCTAG GTCCAATAAT TCTGCCACAT TTCTTTCCAA AATTTGAGAG TGCTGCAGAA
ATAATTCCCA TAATGAGTTT AGTAGTAATT CCGAGAACAA TAACAGCTAT CACAAATGCA
AAATTGTTAG GAATTTTATC AACTAGGTTT ATTGTTATAG GTGTTGCAAT ATATCTTACA
ATACAAGTTT CAGGAATTTT GATCTTAGGG GAATTGTATT CGCTTAATGG AGTTGCGTGG
GCATTAGTTT TAGCAGAAGC TGGTCAAGCT ATTTTCCTAT ATGTTTCAAG TAAGTATTTC
TTAAAAAAAA CAAGTAATGC ATAA
 
Protein sequence
MVSLKDKIKN IKGIKDLTSI GFANISGNAI SALFWFYLAS LLTTTEYGEL SYLISIAGLA 
SVLSFVGGSH TITVLTAKKI NILSTLVTII LLIGVTIMFV LYLLFSNLSI SMYAISYLIY
GVSIGEILGN KLYRTYSILF IVQKATMVLA SILLYPNLGI DGVILGYAIS HFVPGYRIFK
MLKGKFSFSS LKPQWSFISN NYGLVISRAF TGQIDKIIIA PILGFALLGN YHLGIQFLSL
LSILPMSMMQ YILPQESTGH SHVILKKLAV IVAVLFAVLG IFLGPIILPH FFPKFESAAE
IIPIMSLVVI PRTITAITNA KLLGILSTRF IVIGVAIYLT IQVSGILILG ELYSLNGVAW
ALVLAEAGQA IFLYVSSKYF LKKTSNA