Gene Nmar_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0062 
Symbol 
ID5774179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp49278 
End bp50438 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content30% 
IMG OID641315679 
Productglycosyl transferase family protein 
Protein accessionYP_001581400 
Protein GI161527574 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis
[COG2246] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.963311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGTAG AAAAGCCAAA TAATCAGATC TCAATCATAA TTCCTACATA TAATGAATCT 
CAAAATATTC TCAACATTTT AAAATCAATT AAAGAAAATT TACCCAAAAA TATTTCGGCT
CAAGCAATTG TTGTTGATGA TAATTCTCCT GATGGAACAG GGAAAATTGT TGATGATTAT
CTAAAAAATT TGAAGAAAAT TACAAATTAT ACAATTGAAG TCATTCATAG AAAAACAAAA
GATGGTTTAG GTTCTGCAAT TCTTAAAGGA ATTCAGCAAG CAACAGGCGA TACAATTGTT
GTCATGGATT CTGATTTTTC TCATCCACCA CAAATTATTC CAAAATTAGT TGAATCAATA
AAAAAATACC AATACGACAT TGCAGTTGCA TCACGTTACA TTAAAGGTGG TAAAATTGAA
AATTGGTCTG CAAAAAGAAA ACTAATTAGT AAATTTGCAA CACTTATTGC AAAAAAAGGA
TTGGGAATTA ATACAAAAGA TCCAATGTCT GGGTTTTTTG CATTCAAAAA AAATATTCTT
AATGGACTAA ATATTGACGC AATTGGTTAC AAAATCCTTT TGGAAATTCT TGTTAAAACA
AAAAATGTTT CAATTACAGA AATTCCATAC ACATTTCAAG ATAGAGAATT AGGTTCTAGT
AAACTAAGTA TGAAAACAGT CTTTGACTAT TACAAATCGG TTTGGAAGCT TTACAGATAT
GGAAAGCCAG AAGAAGAGAA AGAGAAGAGA AAGTCTGTGA AATTTCTTTA CAAAGCAGCA
AGATTCTATA CAGTTGGAGC TTCTGGATTT GTAGTAAACT ATTTGATTTC ATTATTATTT
GCAGGTGGAA TTTCAGATAT GTGGTACTTG CATGCAAATG TTATTGGAAT TATTGCATCA
ATTTCAACTA ATTTTATTCT AAACAAAGCA TGGACATTTG GAGATAGAGA TTTCAGAATT
AAAAAGACAA TGTCACAATA TGGCAAGTTT GCATTGTTTA GTTCGCTAGG TGCATTAGTA
CAATTAGGAA TGGTGTATTT CCTAGTGGAT AGTGCTGAGA TTTCATATCC ATTAGCATTA
ATTTTAGCAG TGGCTACAGC AGCTTTTGGA AACTTTGTAT TAAACAAGAA ATTTACCTTC
AAAGAAAAAT TGCTAAACTA G
 
Protein sequence
MLVEKPNNQI SIIIPTYNES QNILNILKSI KENLPKNISA QAIVVDDNSP DGTGKIVDDY 
LKNLKKITNY TIEVIHRKTK DGLGSAILKG IQQATGDTIV VMDSDFSHPP QIIPKLVESI
KKYQYDIAVA SRYIKGGKIE NWSAKRKLIS KFATLIAKKG LGINTKDPMS GFFAFKKNIL
NGLNIDAIGY KILLEILVKT KNVSITEIPY TFQDRELGSS KLSMKTVFDY YKSVWKLYRY
GKPEEEKEKR KSVKFLYKAA RFYTVGASGF VVNYLISLLF AGGISDMWYL HANVIGIIAS
ISTNFILNKA WTFGDRDFRI KKTMSQYGKF ALFSSLGALV QLGMVYFLVD SAEISYPLAL
ILAVATAAFG NFVLNKKFTF KEKLLN