Gene Nmul_A1241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1241 
SymbolureC 
ID3785580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1425586 
End bp1427292 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content54% 
IMG OID637811326 
Producturease subunit alpha 
Protein accessionYP_411936 
Protein GI82702370 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.630268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTCA AGGTTTCCCG CCAGACTTAC GCGGAATTGA TGGGCCCTAC GACCGGCGAC 
CGGATTCGCC TGGCTGATAC CGAGCTGATG ATCGAAATCG AGAAGGATTT CACCACCTAT
GGTGAAGAAG TGAAATTCGG GGGCGGCAAG GTAATACGCG ACGGCATGGG GCAATCACAG
CACAATCACG ATCAAGTGAT GGATACGGTC ATCACCAACG CGGTAATCAT CGACCACTGG
GGCATCGTCA AGGCTGATGT CGGTCTGAAG AACGGGAGGA TCGCCGAGAT CGGCAAAGCG
GGCAATCCCG ATATCCAGCC GAATGTTACG ATGTCGATCG GCGCGGCGAC GGAAATCATA
GCCGGCGAGA ACATGATTCT GACCGCGGGT GGAATCGATT CACATATCCA CTTCATTTCC
CCGCAGCAGG CAGAAGATGC GATGATGAAT GGCATTACCA CGATGCTGGG GGGAGGCACT
GGCCCGGCAG CGGGCACGGC AGCCACGACC TGTACGCCGG GCCCCTGGCA TATTCATTCC
ATGTTGCGGG CCTCGGATGG CATGGTGATG AACACCGGTT TTTACGGCAA GGGAAATGTA
AGTCTTCCGA CCCCGCTGGA AGAGCAGATT CTTGCCGGGG CATGCGGACT GAAGCTGCAC
GAAGACTGGG GTTCGACCTA CGCAGCTATC GACAATTGCC TGGCGGTGGC GGACAAGTAT
GATGTCCAGG TTGCCGTTCA CACAGATACC ATCAATGAAG GCGGATATCT GGAAAACACG
ATTGCAGCCA TGAAGGACCG TACCATCCAT ACTTTCCACA CCGAGGGGGC CGGTGGAGGG
CATGCGCCGG ACATCATCGC CGTCGTAGGT CAGGAAAACG TGCTGCCCTC ATCAACCAAT
CCAACCCGGC CTTATACCAT CAACACGCTG GATGAACATC TCGACATGCT GATGGTATGC
CATCACCTTC ATGCCAACAT CCCGGAAGAT CTCGCGTTTG CCGAGTCGCG CATTCGTAAG
GAGACCATAG CTGCGGAGGA TATACTGCAG GATATGGGTG CAATCTCCAT GATGTCCTCT
GATTCGCAGG CGATGGGGCG GATTGGAGAG GTGGTTCTGC GTACCTGGCA AACCGCGCAC
AAAATGAAAA TACAACGCGG GACTTTGCAG GAAGATACAT CGAAGAATGA TAATTTCCGC
GTCAAACGCT ACATCGCCAA ATACACCATC AATCCGGCGA TCACACATGG CATCTCGCAC
GCCCTCGGTT CGGTGGAGGT GGGCAAATAT GCAGATCTGG TGTTGTGGCG GCCCGCGTTT
TTTGGCGTAA AGCCTTCCGT GATTCTGAAA GGTGGAATGA TTGCGGCATC TTTAATGGGT
GATCCGAACG CCTCGATTCC CACCCCGCAA CCTGTCCATT ACCGTTACAT GTTCGGGGGA
TATGGCGGGG GTATCAAGAC CTCGTGCTTT ACCTTTGTCT CACAGGCAGC GCTGGCTGCA
GGTCTGGTCG ATCAATTGAA GCTGGATAAG AATCTGATCG AGGTCAAGAA TACGCGCAAC
CTGCGCAAGA AAGATATGAT CCATAACTCG GCGACACCTA AAATGGAAGT CGATCCGGAA
ACATACGAAG TTCGGGCGGA TGGACAGTTG CTGACCTGCG GGGCGGAGGA CGTTCTGCCG
ATGGCGCAAA GGTATTTCCT TTTTTAG
 
Protein sequence
MSFKVSRQTY AELMGPTTGD RIRLADTELM IEIEKDFTTY GEEVKFGGGK VIRDGMGQSQ 
HNHDQVMDTV ITNAVIIDHW GIVKADVGLK NGRIAEIGKA GNPDIQPNVT MSIGAATEII
AGENMILTAG GIDSHIHFIS PQQAEDAMMN GITTMLGGGT GPAAGTAATT CTPGPWHIHS
MLRASDGMVM NTGFYGKGNV SLPTPLEEQI LAGACGLKLH EDWGSTYAAI DNCLAVADKY
DVQVAVHTDT INEGGYLENT IAAMKDRTIH TFHTEGAGGG HAPDIIAVVG QENVLPSSTN
PTRPYTINTL DEHLDMLMVC HHLHANIPED LAFAESRIRK ETIAAEDILQ DMGAISMMSS
DSQAMGRIGE VVLRTWQTAH KMKIQRGTLQ EDTSKNDNFR VKRYIAKYTI NPAITHGISH
ALGSVEVGKY ADLVLWRPAF FGVKPSVILK GGMIAASLMG DPNASIPTPQ PVHYRYMFGG
YGGGIKTSCF TFVSQAALAA GLVDQLKLDK NLIEVKNTRN LRKKDMIHNS ATPKMEVDPE
TYEVRADGQL LTCGAEDVLP MAQRYFLF