Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1241 |
Symbol | ureC |
ID | 3785580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1425586 |
End bp | 1427292 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637811326 |
Product | urease subunit alpha |
Protein accession | YP_411936 |
Protein GI | 82702370 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.630268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTCA AGGTTTCCCG CCAGACTTAC GCGGAATTGA TGGGCCCTAC GACCGGCGAC CGGATTCGCC TGGCTGATAC CGAGCTGATG ATCGAAATCG AGAAGGATTT CACCACCTAT GGTGAAGAAG TGAAATTCGG GGGCGGCAAG GTAATACGCG ACGGCATGGG GCAATCACAG CACAATCACG ATCAAGTGAT GGATACGGTC ATCACCAACG CGGTAATCAT CGACCACTGG GGCATCGTCA AGGCTGATGT CGGTCTGAAG AACGGGAGGA TCGCCGAGAT CGGCAAAGCG GGCAATCCCG ATATCCAGCC GAATGTTACG ATGTCGATCG GCGCGGCGAC GGAAATCATA GCCGGCGAGA ACATGATTCT GACCGCGGGT GGAATCGATT CACATATCCA CTTCATTTCC CCGCAGCAGG CAGAAGATGC GATGATGAAT GGCATTACCA CGATGCTGGG GGGAGGCACT GGCCCGGCAG CGGGCACGGC AGCCACGACC TGTACGCCGG GCCCCTGGCA TATTCATTCC ATGTTGCGGG CCTCGGATGG CATGGTGATG AACACCGGTT TTTACGGCAA GGGAAATGTA AGTCTTCCGA CCCCGCTGGA AGAGCAGATT CTTGCCGGGG CATGCGGACT GAAGCTGCAC GAAGACTGGG GTTCGACCTA CGCAGCTATC GACAATTGCC TGGCGGTGGC GGACAAGTAT GATGTCCAGG TTGCCGTTCA CACAGATACC ATCAATGAAG GCGGATATCT GGAAAACACG ATTGCAGCCA TGAAGGACCG TACCATCCAT ACTTTCCACA CCGAGGGGGC CGGTGGAGGG CATGCGCCGG ACATCATCGC CGTCGTAGGT CAGGAAAACG TGCTGCCCTC ATCAACCAAT CCAACCCGGC CTTATACCAT CAACACGCTG GATGAACATC TCGACATGCT GATGGTATGC CATCACCTTC ATGCCAACAT CCCGGAAGAT CTCGCGTTTG CCGAGTCGCG CATTCGTAAG GAGACCATAG CTGCGGAGGA TATACTGCAG GATATGGGTG CAATCTCCAT GATGTCCTCT GATTCGCAGG CGATGGGGCG GATTGGAGAG GTGGTTCTGC GTACCTGGCA AACCGCGCAC AAAATGAAAA TACAACGCGG GACTTTGCAG GAAGATACAT CGAAGAATGA TAATTTCCGC GTCAAACGCT ACATCGCCAA ATACACCATC AATCCGGCGA TCACACATGG CATCTCGCAC GCCCTCGGTT CGGTGGAGGT GGGCAAATAT GCAGATCTGG TGTTGTGGCG GCCCGCGTTT TTTGGCGTAA AGCCTTCCGT GATTCTGAAA GGTGGAATGA TTGCGGCATC TTTAATGGGT GATCCGAACG CCTCGATTCC CACCCCGCAA CCTGTCCATT ACCGTTACAT GTTCGGGGGA TATGGCGGGG GTATCAAGAC CTCGTGCTTT ACCTTTGTCT CACAGGCAGC GCTGGCTGCA GGTCTGGTCG ATCAATTGAA GCTGGATAAG AATCTGATCG AGGTCAAGAA TACGCGCAAC CTGCGCAAGA AAGATATGAT CCATAACTCG GCGACACCTA AAATGGAAGT CGATCCGGAA ACATACGAAG TTCGGGCGGA TGGACAGTTG CTGACCTGCG GGGCGGAGGA CGTTCTGCCG ATGGCGCAAA GGTATTTCCT TTTTTAG
|
Protein sequence | MSFKVSRQTY AELMGPTTGD RIRLADTELM IEIEKDFTTY GEEVKFGGGK VIRDGMGQSQ HNHDQVMDTV ITNAVIIDHW GIVKADVGLK NGRIAEIGKA GNPDIQPNVT MSIGAATEII AGENMILTAG GIDSHIHFIS PQQAEDAMMN GITTMLGGGT GPAAGTAATT CTPGPWHIHS MLRASDGMVM NTGFYGKGNV SLPTPLEEQI LAGACGLKLH EDWGSTYAAI DNCLAVADKY DVQVAVHTDT INEGGYLENT IAAMKDRTIH TFHTEGAGGG HAPDIIAVVG QENVLPSSTN PTRPYTINTL DEHLDMLMVC HHLHANIPED LAFAESRIRK ETIAAEDILQ DMGAISMMSS DSQAMGRIGE VVLRTWQTAH KMKIQRGTLQ EDTSKNDNFR VKRYIAKYTI NPAITHGISH ALGSVEVGKY ADLVLWRPAF FGVKPSVILK GGMIAASLMG DPNASIPTPQ PVHYRYMFGG YGGGIKTSCF TFVSQAALAA GLVDQLKLDK NLIEVKNTRN LRKKDMIHNS ATPKMEVDPE TYEVRADGQL LTCGAEDVLP MAQRYFLF
|
| |