Gene Nmar_0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0116 
Symbol 
ID5773206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp104553 
End bp105578 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content30% 
IMG OID641315736 
Productglycosyl transferase group 1 
Protein accessionYP_001581454 
Protein GI161527628 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000347461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTC TGATCATTTC TCCGACTCAA GAAGGAATTG GTGGTGTTGC TAGACATGTA 
CAAGGTCTTA CAAAATTTTT AAAAAATGAT GGGCATGAAG TAGATGTAAT CTCTTCTGAA
AATACATTCA CAATTCCTAT ACGAAAATTA AAAAATCCTA GCTTCATGCT ATCTTCTTTT
TTAAAAACAA AATTTTCAAA AAAATATGAT GTAGTACATG CTCAAAATGT TGTATCTGCG
TTTGCAATGA AAAATGTTTT GGGAAAAAAA TTATTGGCAA TACACGGAAT TCATCATGAA
CAAGTAGATC ATTTACATGG AAAAACTGCT GGAAATGTAG CAAAGGATTA TGAAGATAAA
GCGTTAAACT GGGTTGACGC AATAACGGTT TCTTCAAAAG AAATGCTTGA TTATTACTCT
CAAAAAGGAT TGAACACGTT TTTTCTCCCA AATGCATTGG ATATACAATC AATTACCAAA
AAATCTAATC GAAAATTTGA CAAACAAATT GTTTATGCTG CTAGATTATC AAAAGAAAAA
GGTATTCTTG AAGTATTGGA TGTTGCAGAA AAATTACCAC AAGACATTCA TCTTTTAATT
TTAGGATCAG GGGTAGAAGA GAATAAAGTA AAAGAATTAT CAGAACTACA AAAAAATATT
CATTTTTTAG GATATCAAAA TAGAGAAAAC ACTCTATCTA TAATTCGTGG TTCTGATTTG
TTAATACAAC CATCTAGAAT GGAAGGTGGA CTAAGTTATA CTTTGTTGGA ATCTATGGCA
TGTGGAACTC CAATTATATG TACTGATGTT GGTGGTGCTA AAGATACTTT ATCTCATATG
AAAAATGCAT TTATTATCAA ACCTGAAAAT TCAACAGAAT TAAAAAATGC TATTAATCAA
TTAATGAACA ACTCAAAACA AAGAGAGGAA CTAAAGAACA ATGCTTTGGA TGAAATCAAA
AATCATGATT GGTCCGTTGT AGGGCCAAAA TATGTAGAAA TTTATCAAAA ATTACTTTCA
TCATAA
 
Protein sequence
MKILIISPTQ EGIGGVARHV QGLTKFLKND GHEVDVISSE NTFTIPIRKL KNPSFMLSSF 
LKTKFSKKYD VVHAQNVVSA FAMKNVLGKK LLAIHGIHHE QVDHLHGKTA GNVAKDYEDK
ALNWVDAITV SSKEMLDYYS QKGLNTFFLP NALDIQSITK KSNRKFDKQI VYAARLSKEK
GILEVLDVAE KLPQDIHLLI LGSGVEENKV KELSELQKNI HFLGYQNREN TLSIIRGSDL
LIQPSRMEGG LSYTLLESMA CGTPIICTDV GGAKDTLSHM KNAFIIKPEN STELKNAINQ
LMNNSKQREE LKNNALDEIK NHDWSVVGPK YVEIYQKLLS S