Gene Nmar_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0194 
Symbol 
ID5772997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp176568 
End bp177590 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content30% 
IMG OID641315812 
Productglycosyltransferase family 28 protein 
Protein accessionYP_001581528 
Protein GI161527702 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.775118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTCTTG TAAATTTTTT TTCTAGTCCA ATAGGATTAG GTCATGTAAC AAGAGATATT 
GCAATCAAGA ATAATTTCCA AAATATTACA ACTAATTTTG TTACAGGTAG TGGTGCTGCT
AAAATTCTAA AGAAATTAGA AATTCAGGTT GATGATGTAT ATCATCCACC ATCATTCATT
GTTGAGAATG GTACATTGAA AAGTCCTGCA AAATGGCTTT GGAATTACTA TCAATACTAT
AAAGATTGTA AAAACATTTC ACGAAATATT TTAGAAAAAA ATAGATCTAA TATTGTGATT
AGTGATGAGG ATTTTGCTTC ACTAACAGTA GCTCAAGAAA TGAAAATTCC AACTATTTTG
GTTACTGATA TTTTAGAGAC ACATTTTACA AAAGGTCTAG CATCATTTAT CGAAAAAAAG
ATGAATAAAT CAATGCAAGA GATCATAAAA AAATGTGAAA TTGTCATATT GCCAGAAATA
GGTGATGCAC AAGACAACAT ACAAAGAGTA GGACCCATAG TACGACAAAC AGATCACACT
AGAGAACAAT TACGAGAAAA ATTTTCATTT GATAAAAAAA CAATTGTTAT TTCAATTGGT
GGAACTGATG CAGGATTGTT TTTAATTGAA AAAGCACTAG AGGCAATTAC AAAAATCAAT
CAAGATGTTA AAATTGTACT AGTTTCAGGT CCATCAGTTG AAAAAAAATT TGAGAATGTA
GAAAATTTGG GATTTGTAGA AAATTTGCAT GAAATAATTT TTGCAGCTGA TGTGTTAATT
TCACTTGCAG GAAAATCAAC AATTGATGAG GCTAATGCAT ATGGTACGCC CGCAATATTC
ATTCCAATTA AAGGTCATTT TGAACAAGAG GATAATGCGA AAGAACAAGG ATTTGTTTTT
GAAGATATCA AAAGACTTGA CAAGTTAATT CTATCAAAAT TAGAAGAAAA GAGAAATAAA
GTCAATACCG AAGGTGCAGT AAAAGCTGCA AAAATCATTC AAAGCTTAAT AGATAACTAT
TGA
 
Protein sequence
MVLVNFFSSP IGLGHVTRDI AIKNNFQNIT TNFVTGSGAA KILKKLEIQV DDVYHPPSFI 
VENGTLKSPA KWLWNYYQYY KDCKNISRNI LEKNRSNIVI SDEDFASLTV AQEMKIPTIL
VTDILETHFT KGLASFIEKK MNKSMQEIIK KCEIVILPEI GDAQDNIQRV GPIVRQTDHT
REQLREKFSF DKKTIVISIG GTDAGLFLIE KALEAITKIN QDVKIVLVSG PSVEKKFENV
ENLGFVENLH EIIFAADVLI SLAGKSTIDE ANAYGTPAIF IPIKGHFEQE DNAKEQGFVF
EDIKRLDKLI LSKLEEKRNK VNTEGAVKAA KIIQSLIDNY