Gene Nmar_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0123 
Symbol 
ID5774409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp112659 
End bp113699 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content25% 
IMG OID641315743 
Productglycosyl transferase family protein 
Protein accessionYP_001581461 
Protein GI161527635 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000010329 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAATACAG AAGAGCCATT AGTCAGCATT ATTATTTTAA ATTATAATGC AGGTAAATTA 
ATTGAAAACT GTATTGAATC AATTCATAAA AGTGATTACA AGAATTTTGA AATAATATTA
GTTGATAATG TATCGACAGA TAATAGTCAA AATAAATGTA AAAAAAAATT TCCTGAAATT
AAACTAATAC AAAATCAAGA AAATTTAGGA TATTGTGGAG GAAACAATAT AGGGATAAAA
ACTGCCAAAG GGGAATTTAT TGTGATACTA AATCCAGATA CAATTGTAGA AAAATCTTGG
TTAAAGGAAT TTCTTCAAGA GTATAAAAAA ATAGGTTTAG GCCTTTACCA ACCAAAATTA
TTAGCATTAG ACGATACATC TAGAATAAAT TCTGCTGGAA ACATGATTCA AATTTTTGGT
TTTGGGTATT CTTTTGGAAA GGGTGAGAAA GAAAATTCAA ATCATGATAA AAATTATCTA
ATTAATTATG CTTCTGGTGC ATGCCTTTTT ACTACAAAGC AAGTATTAGA AAAAATCGGT
TTCTTTGATG ATTTTTTGTT TGCATATCAT GATGATTTAG AATTGGGATG GAGAGCTAGA
CAGTTAGGAA TTAAATCACA CTATGTTCCT AGGTGTGTGG TATATCATGC TGAAAGTTTT
AGTTTTGGTT GGAGCAAGAA AAAATATTTT CTTTTAGAAA GAAATAGACA TTATTGTTTA
CTGACACATT ATTCTAGAAA AACATTTTTT AAAATGCTAC CATCTTTGAT CATAATCGAA
ATAATTGTAA TAATGTTTTA CTTATCAAAA GGAATGATAA AAGAAAAAAT TGAGGGATAT
TCAAATATTT TAAAAAACTG GAATGGAATT AAGAAAAAAT ATTTAGAAAT AGAATCAAAG
AAAGAAATTA AAGATGTAGA AATCATCAAA GAATTTAAAA ATCAAATTGA AATTCCAAGC
ATAGTAACAG GAAGAATATA TTCAAAAAAA ATTAATTATA TTCTAAATAT CTTGTCAAAA
TTTTTTATTA AAATTTTATA A
 
Protein sequence
MNTEEPLVSI IILNYNAGKL IENCIESIHK SDYKNFEIIL VDNVSTDNSQ NKCKKKFPEI 
KLIQNQENLG YCGGNNIGIK TAKGEFIVIL NPDTIVEKSW LKEFLQEYKK IGLGLYQPKL
LALDDTSRIN SAGNMIQIFG FGYSFGKGEK ENSNHDKNYL INYASGACLF TTKQVLEKIG
FFDDFLFAYH DDLELGWRAR QLGIKSHYVP RCVVYHAESF SFGWSKKKYF LLERNRHYCL
LTHYSRKTFF KMLPSLIIIE IIVIMFYLSK GMIKEKIEGY SNILKNWNGI KKKYLEIESK
KEIKDVEIIK EFKNQIEIPS IVTGRIYSKK INYILNILSK FFIKIL