Gene Nmar_0117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0117 
Symbol 
ID5773852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp105630 
End bp106637 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content29% 
IMG OID641315737 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001581455 
Protein GI161527629 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000592797 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCAATCAA TTTTAAATAA AAAAAATATT CTTGTTACTG GTGGAACCGG TTCTATAGGA 
CAAGCTTTGG TTCAAAGAGC TATATCTGAT GGCGCAAAAC ATATCAAAGT TTTCAGTAAT
GATGAAAATG CCCTTTATGA AATGGAATTA GATTTTTCTA AACATAAAAA CATCGAATAC
ATAATTGGCG ATATTAGAGA TTTTGATAAA ATCAATTCAA TCGTAAAAAA TTGTGATATT
ATTTTCCATG CCGCTGCACT CAAACATGTA GATAGATGTG AATTATATCC ATTAGAAACA
ATGACCGTAA ACATAATTGG AACAAATAAT GTTGCAAAAG CTGCAGTCAA TGCGAATGTA
TCAAAAGTTA TTTCTATTAG TACTGATAAA GCTGTAAATC CTATAGGTGT GATGGGTGCA
ACAAAACTTC TTGCAGAAAA ATTAATTGCC GCTGAAGCAT ATCATTCAAA ATCAAAGACA
GTTTTTTCCT CTGTACGATT TGGAAATGTA TTTCATACTA GAGGTTCAAT ATTACCTAAG
ATAGAAAAAC AAATTCAAAA TGGTGGTCCT TTAACATTAA CTGATGAAAG AATGAAACGA
TTTTTTATGA CTAAAGAAGA TGCAGTTGAT TTAATTTTAA ACGCAGCTTA TACTGCTAAA
GGTGGAGAAA CTTTTATTCT CAAAATGCCT ATGCTGAATT TAAAAGATCT TTTTGAAGCA
ATGAAAATTG TAATTGGTCC AAAACATGGG TATTCATCAA CAAAAATAAA AACAAAAATT
ACTGGAATTA GACCTGGTGA AAAATTAACC GAATATCTAT TAACAAATTT TGAAATGGAA
CATTGTTTAG AAACAAAGAA TTTTTTTATA ATTCCTAAAA TGTTTGAATC TTTAGATCCC
AAAAAATATC CTGGTTCAAA AAAACCAAAG AACACAACAA AGTATTTCGA AACTGTTAAA
CCAATTTCAC AAGAACAGAT TGTCAAACTT TTAAAAAAAA TTTATTAA
 
Protein sequence
MQSILNKKNI LVTGGTGSIG QALVQRAISD GAKHIKVFSN DENALYEMEL DFSKHKNIEY 
IIGDIRDFDK INSIVKNCDI IFHAAALKHV DRCELYPLET MTVNIIGTNN VAKAAVNANV
SKVISISTDK AVNPIGVMGA TKLLAEKLIA AEAYHSKSKT VFSSVRFGNV FHTRGSILPK
IEKQIQNGGP LTLTDERMKR FFMTKEDAVD LILNAAYTAK GGETFILKMP MLNLKDLFEA
MKIVIGPKHG YSSTKIKTKI TGIRPGEKLT EYLLTNFEME HCLETKNFFI IPKMFESLDP
KKYPGSKKPK NTTKYFETVK PISQEQIVKL LKKIY