Gene Nmar_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0538 
Symbol 
ID5773620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp478209 
End bp479225 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content37% 
IMG OID641316171 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_001581872 
Protein GI161528046 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAGATA TGCGCAAAGA CTATGTTTCT GAGCGTTTCA TGATTGTCTC AAAAAAAGAA 
GACAAAGTAA AAGATCCAAA AAAATCTCCT TTTGCTCCTG GAAATGAATC TATGACAAAT
CCTTCTGTAT TGTCTCTTGT TGCAAAAGAT GGAATGCTAC AAAGACTACA AGACAGTGAT
GATGAATTTG TTGAAGGATG GTCAATCAGA GTTTTTGAAA GTAAAAATCC AATCGTCTCA
GTTGATACTG AAAACTCTTA CAGTGATAGA CCATTTTACA GCGAACCTGC ATATGGATAT
CATTACGTTG TTGTTGCATC TCCAAATCCA AAGGATACTT TTGCAACCAT TGACACTGAA
CAATGGTCAA ACATCTTAGT AGTAGTTCAA GATAGATTGA GATGGCTTTA CACTCAAAAA
GGTGTAACAT ATGTTTCAAT TTACGCTGAT CAAGGAGAAC TTTCTGGCAG TGCAAATTCT
CACCCTCATC TCAATATTCT TACCTTTTCA ACAATCCCTC CTATTATTGA AGAAGAGGCA
GAGGCATCTC ACAAAATTCT AAATGAAAAG GGTGTATGCC CAATGTGTCA GACTGTAAAT
GAGGAAATTG GTGGTCCTAG GCAAGTTCTT CAAACTGAAG GTTTTATTGC ATTTTGCCCT
TGGTCTCCAT CCTATCCATA TGAGTTTTGG ATTGCACCCA AGAAACACAC TACTAGCTTC
TCAAAGATTA CTCAAAAAGA AATTAACGAT TTGTCCTTGA TACTTAGAGC TACTCTTGGT
GGTTTGTCTC AAACTATCAA AAATGTGTCC TACAATCTAG TATTCCACCT TTCTCCTGAG
AAAAAGAATA GTAGACAAAT TCATTGGCAT ATTGAAATTT ACCCAATCAC AAAATCTTGG
TCTGGTTTGG AACGTGGTTA TGGAATTTTC TTAAATGATA TCTCTCCTGA AGAGGCTGCA
GAAAAACTAG GTGCTGCTTG CAGAAAGGAA CTGGCTAATC TAGTTGGAAT TGTGTGA
 
Protein sequence
MGDMRKDYVS ERFMIVSKKE DKVKDPKKSP FAPGNESMTN PSVLSLVAKD GMLQRLQDSD 
DEFVEGWSIR VFESKNPIVS VDTENSYSDR PFYSEPAYGY HYVVVASPNP KDTFATIDTE
QWSNILVVVQ DRLRWLYTQK GVTYVSIYAD QGELSGSANS HPHLNILTFS TIPPIIEEEA
EASHKILNEK GVCPMCQTVN EEIGGPRQVL QTEGFIAFCP WSPSYPYEFW IAPKKHTTSF
SKITQKEIND LSLILRATLG GLSQTIKNVS YNLVFHLSPE KKNSRQIHWH IEIYPITKSW
SGLERGYGIF LNDISPEEAA EKLGAACRKE LANLVGIV