Gene Nmar_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0120 
Symbol 
ID5774767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp109495 
End bp110676 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content28% 
IMG OID641315740 
Productglycosyl transferase group 1 
Protein accessionYP_001581458 
Protein GI161527632 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000209107 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATTG CTTTAACGTG TCCCGCATTT TTACCTGCTA CTCAGTTTGG AGGGATTTTA 
TTTCTATGTT TAGACATAGC AAGATTTGTT TCAAAAAACC ATGAAACAAC TGTATATACC
ACAGATTTAG ATTTTGATAA CAGCGTCAAC AAATTTAATT CAGAGTTACC AAAGATTGAA
AAATACGAAA AATTTGTAAT TAAAAGAAAC CATGTTTTTT TTAAAATAAA ATTATTTTTT
ATAAATCCAG GATTGTTTTT TCAACTAAAG AAAGATAAGC CTGACGTAAT TCATGCCATA
GGAATTAGAG GGTTTCAAGC ATTTGTTTCT GCAGTATACT CAAAAATTTA CAAAATTCCA
TTATTATTAA GTGATCAAGG AGGATTACAT ACACATCCTG AATATCAAAA AGGGGCGGGT
AAAATTCTAA ATAAAATTCA GGAGCCTTTA GTAAAATTTG TAATTAATCA AGCTTCACAT
ATTATTGCAG CAAATGAATA TGAAAAATCA GTTTTTTTAA AATACTCAAA TGAGAAAAAA
ATTACCATAG TGCATAATGG AATAGATTAC AGAAACTTTG CTGCAAATAA TATAGATTTT
AAAGACAAAT ACAACATTAG TGAGTCATTT ATTTTATTTC TTGGCAGATT CACCAAAATT
AAAGGGATTG ATTTACTGCT TTTATCATTC AAAAAAATTG TTGACAAGAA AAAATTTCAA
GATTTAAAAC TTGTGATTTT AGGAGCCAAT TTTGGATATG AAAGAGAGAT GAATTCAATG
ATAGAAAAAT TAAATTTAAA AGAAAATATT TTGGTTATAG AAAAACCTAC AAGAGGAGAG
GTAATTTCAG CATATCATGC ATGTAAATTT CTTGTACTAC CATCAAGATG GGAAATGTCT
CCACTAACTC CTTTGGAAGG ATTCGCTTGT AAAAAACCAA CAATTAGTAC GAATATCTTT
GGAATACCAT ATGTTGTTTT GAATAACAAA AACGGTTTAC TTTTTGAACC AGAAAGTGTT
GATGATTTAA AAGAAAAAAT TGAAATTTTG TTAGAAGACA AAGAATTGGT AAAAAAACTA
GGAAGTAATG GTTACGAGTT TGTCAAAAAA GAATATTCTT CTGATAATAT GGGCAATCAA
ATTTTGAAAC TATATGAAAA ATCTCAGAAA AAAATGGAAT GA
 
Protein sequence
MKIALTCPAF LPATQFGGIL FLCLDIARFV SKNHETTVYT TDLDFDNSVN KFNSELPKIE 
KYEKFVIKRN HVFFKIKLFF INPGLFFQLK KDKPDVIHAI GIRGFQAFVS AVYSKIYKIP
LLLSDQGGLH THPEYQKGAG KILNKIQEPL VKFVINQASH IIAANEYEKS VFLKYSNEKK
ITIVHNGIDY RNFAANNIDF KDKYNISESF ILFLGRFTKI KGIDLLLLSF KKIVDKKKFQ
DLKLVILGAN FGYEREMNSM IEKLNLKENI LVIEKPTRGE VISAYHACKF LVLPSRWEMS
PLTPLEGFAC KKPTISTNIF GIPYVVLNNK NGLLFEPESV DDLKEKIEIL LEDKELVKKL
GSNGYEFVKK EYSSDNMGNQ ILKLYEKSQK KME