Gene Nmul_A0680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0680 
Symbol 
ID3784057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp779609 
End bp780658 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content55% 
IMG OID637810762 
ProductUTP-glucose-1-phosphate uridylyltransferase 
Protein accessionYP_411379 
Protein GI82701813 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1210] UDP-glucose pyrophosphorylase 
TIGRFAM ID[TIGR01099] UTP-glucose-1-phosphate uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTA CCGCCACGCC TGTAATGAAC CGGAAAGATG TACTTTCGTT GACTGCTGGA 
CCCGCTGTTC GCAAATCCCC GCGAGCGGTC CGCAAGGCTG TGTTTCCCGT AGCCGGACTT
GGCACCCGTT TCCTGCCCGC GACCAAGGCA GTGGCCAAAG AAATGCTGCC CATTGTGGAC
AAGCCGTTGA TCCAGTATGC GGTGGAAGAA GCGGCTGCGG CAGGTATCGA AGAAATCATT
TTCATTACCC ATCGGAGCAA GCGCGCCATT GAAGACCATC TGCACCGGGC TGTGGAACTG
GAAAGTGAAT TAGCCTCGCA GGGAAAACAC GCTTCCCTGA AAATGCTGCG CCAGTTGACG
CCGGGTGGCC TTCATTTCAG CTTTGTCCGG CAGGAGGAGC CGCGGGGTTT GGGGCATGCA
ATTTACTGCG CGCGACATCT CGTGGGCAAC GAACCGTTCG CCGTACTGCT TCCGGACGAT
TTGATCGATG GAGATCCTCC TGTGCTGGCA CAGATGGTGT CCCAGTATGA ACAAGTCCAA
AGCAGCCTCA TAGCGGTGCG CGAGGTTACG CGCGAACAGA CGCGGCGGTA TGGAATTGTG
GATGCTTTTG ATGCAGAGGC AGAGAGCGAT ACGCTGAAAA TCAGGGGGGT AGTGGAAAAA
CCTTCTCCTG ACGCTGCGCC ATCCACGATG GCTATCGTAG GTCGTTACGT TCTGTCACCC
GCCATTTTTG ACTGCATCAG CAATCTCAAC CCGGGAACAG GAGGGGAAAT TCAGCTTACC
GACGGAATCT CCCGTCTTCT CAAGCTGGAA TCTGTCCTGG CCTACCGTTA CCAGGGGAAG
CATTATGATT GCGGCAGCAA GGCGGGCTTC CTGGAGGCAA CCATCGCCTA TGGTTTGCAG
CACCCGGAAG TGGCGATGGA GTTCAGGGAA ACCTTATTAA AGATAGGACA GGAACTTATT
CGCCAGGAGC TCATTCAGGA GTTTCAACAG GATCCGGAAC CTGTCGCCGC TGTTGCAAAC
GAGCCCATAT TGAAGGCGGC TCAGGCATGA
 
Protein sequence
MSITATPVMN RKDVLSLTAG PAVRKSPRAV RKAVFPVAGL GTRFLPATKA VAKEMLPIVD 
KPLIQYAVEE AAAAGIEEII FITHRSKRAI EDHLHRAVEL ESELASQGKH ASLKMLRQLT
PGGLHFSFVR QEEPRGLGHA IYCARHLVGN EPFAVLLPDD LIDGDPPVLA QMVSQYEQVQ
SSLIAVREVT REQTRRYGIV DAFDAEAESD TLKIRGVVEK PSPDAAPSTM AIVGRYVLSP
AIFDCISNLN PGTGGEIQLT DGISRLLKLE SVLAYRYQGK HYDCGSKAGF LEATIAYGLQ
HPEVAMEFRE TLLKIGQELI RQELIQEFQQ DPEPVAAVAN EPILKAAQA