Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0887 |
Symbol | ureC |
ID | 5103533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 820729 |
End bp | 822396 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640506790 |
Product | urease subunit alpha |
Protein accession | YP_001190983 |
Protein GI | 146303667 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAAATTT CAAGGGAGAG ATACGCAGAA CTATACGGAC CAACAGAGGG GGATAAGATC AGACTGGGTG ACACAAACCT AGTTATCACG GTCGAGAAGG ACATGATTAG AAAGGGTGAT GAACTTGTGT TTGGTGCAGG CAAATCCGCC CGTGACGGAT TGGGTCTTCT TCCGACGGTG AAGGAAGAGG AGTCCATGGA TCTCGTTATC ACAAATGTGG TGATAATGGA CCCTTTACTT GGAATAGTTA AAGCCGACAT AGGAATAAAG GACGGAGTCA TCGTGGGGAT AGGTCATGGT GGTAACCCAT TTACCATGGA TGGAGTTGAC TTCGTGCTGG GACCGTCGAC CGAGGTAATT TCTGGAGAGG GGTTAATAGC CACTCCAGGT TTCATAGACA CTCACGTTCA CTGGGTTGCC CCACAGCAGG TATACGATGC GATCTCCGCA GGCTTCACGA CCTTAATTGG CGGAGGTACC GGTCCGGCCG AGGGGACCAA GGCAACCACG GTCACCCCAG GATCTTGGAA CTTGAGAGTG ATATTTTCTG CCCTGGACCA GTATCCCGTA AACTTCGGTC TAACTGCGAA GGCGTCATCA ACGTCAGTTA GCATGGAGCA AGTGCTGAAC CAGGGCGCGT GTGGATTCAA GATTCATGAG GACTGGGGAG CCATGCCGAG GGTAATTGAT GAAACCTTAA CCTTGGCTGA CCAGAGGGAC GTGCAGGTCA CTATTCACAC AGATACATCT AATGAGAGCG GATTCCTCGA GGACACCTTA AGCGCGATTG GCGGTAGGAC TATTCACGCC TATCACGTGG AAGGTGCGGG AGGAGGTCAC GCTCCAGACA TCATTAAAAT TGCAGGAGAA CCCAACATAC TTCCGTCCTC AACTAATCCC ACTAAACCCT TCACAGTCCA CACATATGAG GAACACCTGG AGATGCTCAT GGCAGTTCAT CACCTGAACC CAAAGGTACC TGAGGACGTG TCCTATGCGG AATCCAGAAT CAGGGCTGAA ACCATGGCTG CCGAGGATTA CCTCCACGAT CTTGGGGCAA TAAGCATGAT GTCTTCGGAC TCGCAAGCCA TGGGAAGGAT TGGGGAGACA GGAATTAGGA CATTCCAGCT TGCTCATAAG ATGAAGGAAC TTAACCTGAT CCCCATGCCT GACAATCAAA GGGTCCTGAG ATATCTCGCA AAAATAACCA TAAATCCTGC CATAACCCAC GGTATATCAG AGTACGTAGG ATCCCTGTCC CCTGGAAAGC TGGCAGATAT TGTTCTGTGG GACCCTAGGT TTTTCCCCGC GAAGCCCTAC ATGGTTATCA AGGGCGGAGC CATCTCATGG GCCCTAATGG GTGAAACCAA CGCCTCTATT GCATATGCTC AGCCTGTGCT TTACAAACCC ATGTTCGGAT TTACAGCGCC GGTATCCCTG CTATTTTCCT CACTGGATGG GGTAAACGAA GCGGGGAAAA ATGTCAAGAG GAGAGTGGTA CCAGTAAGAA ATACTAGGAC CATCTCGAAA TCTCACATGA AACTTAACGA TGCTACGCCT GAGATAGAGG TGGACCCTGA CAAATATGAG GTTAAGGTCG ATGGGGTAGT CCCGAAGATC CCGCCTTCTA AGGAATTGCC TCTAACCAGA TTATACTTCC TGTTTTAG
|
Protein sequence | MKISRERYAE LYGPTEGDKI RLGDTNLVIT VEKDMIRKGD ELVFGAGKSA RDGLGLLPTV KEEESMDLVI TNVVIMDPLL GIVKADIGIK DGVIVGIGHG GNPFTMDGVD FVLGPSTEVI SGEGLIATPG FIDTHVHWVA PQQVYDAISA GFTTLIGGGT GPAEGTKATT VTPGSWNLRV IFSALDQYPV NFGLTAKASS TSVSMEQVLN QGACGFKIHE DWGAMPRVID ETLTLADQRD VQVTIHTDTS NESGFLEDTL SAIGGRTIHA YHVEGAGGGH APDIIKIAGE PNILPSSTNP TKPFTVHTYE EHLEMLMAVH HLNPKVPEDV SYAESRIRAE TMAAEDYLHD LGAISMMSSD SQAMGRIGET GIRTFQLAHK MKELNLIPMP DNQRVLRYLA KITINPAITH GISEYVGSLS PGKLADIVLW DPRFFPAKPY MVIKGGAISW ALMGETNASI AYAQPVLYKP MFGFTAPVSL LFSSLDGVNE AGKNVKRRVV PVRNTRTISK SHMKLNDATP EIEVDPDKYE VKVDGVVPKI PPSKELPLTR LYFLF
|
| |