Gene Nmul_A1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1238 
Symbol 
ID3785577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1423800 
End bp1424789 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content55% 
IMG OID637811323 
Producturease accessory protein UreD 
Protein accessionYP_411933 
Protein GI82702367 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0829] Urease accessory protein UreH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTATT CCGGGAGCGC CATCTTGAAA TCAGCCGCAA AAACCCATTT GAACCCGCCT 
GATGGGAATT CGACACCTTC ATCCGGCAGT ATTGCTACTC TGTGCCCGCC CACCTCTTCT
GAAGGAAGGC CATCGGATCC GAAGGTGAAT ATGGCTTCAG ATTCTCCATT GAGAGCTCAT
TTGCGGTTGA AGTTTGCGGA GAGTTCCGGC ATCACCCGCA TGGTGGAGCG AGATCATCAT
GGCCCCTTGT TGGTGCAGAA ACCTCTCTAT CCGGAGGGTT ATGAGGTATG CCAAGCTGTT
GTCATACACC CGCCGGGAGG CGTGGTCGCA GGGGATGAAT TGGGAATACG AGTACATGTC
GGTCCATCCG CTCATGCTCA GATAACTTCT CCCGGCGCAA CAAAATGGTA CAAATCCAAA
GGTCGGACCG CACGCCAGCA CGTTTACCTG CATGCGGAAG CAGGCGGTGT ACTGGAGTGG
ATGCCGCAGG AAACGATTTT TTTCAATAAT GCAAGAGTGA TGCTCCATCA CGAGGTCGAG
CTGGAGAAAG ATTCGGTTTA CATGAGTTGC GAGATTCTAT GCTTTGGTCG TACGGCATTC
GGAGAATCGT TCGATAGCGG TGAGATAAAA CAGCATACGA GTATCCGCCA GGAGGGAAAG
CTGGTCTGGT TTGAGAAGCT TCGTCTGGAG GGCGGAAGCA AAGCGATGAA TGGAAGGCTT
GCACTTGCCG GCCGCGCCGT TTGCGCCACT TTTATCATGA GTGGCAAACC CCTTCCAGCG
CAGGCGATCG ATCTTGTACG GGAAGAGGCG GTGCGCATCG GCGGAGAATC GGGGCAGGTG
GGGATTACCC AATTGAAATC GCTGCTGGTG GCACGTTTTC TGGGAGATTC GAGTGAAGTG
GCCAGACATG TGATGCTTTG CATCTGGCGG GCCGTACGCC CCATCACGCT CGGCCGGCCT
GCGATCGTGC CGCGCAGCTG GAATACCTGA
 
Protein sequence
MSYSGSAILK SAAKTHLNPP DGNSTPSSGS IATLCPPTSS EGRPSDPKVN MASDSPLRAH 
LRLKFAESSG ITRMVERDHH GPLLVQKPLY PEGYEVCQAV VIHPPGGVVA GDELGIRVHV
GPSAHAQITS PGATKWYKSK GRTARQHVYL HAEAGGVLEW MPQETIFFNN ARVMLHHEVE
LEKDSVYMSC EILCFGRTAF GESFDSGEIK QHTSIRQEGK LVWFEKLRLE GGSKAMNGRL
ALAGRAVCAT FIMSGKPLPA QAIDLVREEA VRIGGESGQV GITQLKSLLV ARFLGDSSEV
ARHVMLCIWR AVRPITLGRP AIVPRSWNT