Gene Msed_0887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0887 
SymbolureC 
ID5103533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp820729 
End bp822396 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content50% 
IMG OID640506790 
Producturease subunit alpha 
Protein accessionYP_001190983 
Protein GI146303667 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAAATTT CAAGGGAGAG ATACGCAGAA CTATACGGAC CAACAGAGGG GGATAAGATC 
AGACTGGGTG ACACAAACCT AGTTATCACG GTCGAGAAGG ACATGATTAG AAAGGGTGAT
GAACTTGTGT TTGGTGCAGG CAAATCCGCC CGTGACGGAT TGGGTCTTCT TCCGACGGTG
AAGGAAGAGG AGTCCATGGA TCTCGTTATC ACAAATGTGG TGATAATGGA CCCTTTACTT
GGAATAGTTA AAGCCGACAT AGGAATAAAG GACGGAGTCA TCGTGGGGAT AGGTCATGGT
GGTAACCCAT TTACCATGGA TGGAGTTGAC TTCGTGCTGG GACCGTCGAC CGAGGTAATT
TCTGGAGAGG GGTTAATAGC CACTCCAGGT TTCATAGACA CTCACGTTCA CTGGGTTGCC
CCACAGCAGG TATACGATGC GATCTCCGCA GGCTTCACGA CCTTAATTGG CGGAGGTACC
GGTCCGGCCG AGGGGACCAA GGCAACCACG GTCACCCCAG GATCTTGGAA CTTGAGAGTG
ATATTTTCTG CCCTGGACCA GTATCCCGTA AACTTCGGTC TAACTGCGAA GGCGTCATCA
ACGTCAGTTA GCATGGAGCA AGTGCTGAAC CAGGGCGCGT GTGGATTCAA GATTCATGAG
GACTGGGGAG CCATGCCGAG GGTAATTGAT GAAACCTTAA CCTTGGCTGA CCAGAGGGAC
GTGCAGGTCA CTATTCACAC AGATACATCT AATGAGAGCG GATTCCTCGA GGACACCTTA
AGCGCGATTG GCGGTAGGAC TATTCACGCC TATCACGTGG AAGGTGCGGG AGGAGGTCAC
GCTCCAGACA TCATTAAAAT TGCAGGAGAA CCCAACATAC TTCCGTCCTC AACTAATCCC
ACTAAACCCT TCACAGTCCA CACATATGAG GAACACCTGG AGATGCTCAT GGCAGTTCAT
CACCTGAACC CAAAGGTACC TGAGGACGTG TCCTATGCGG AATCCAGAAT CAGGGCTGAA
ACCATGGCTG CCGAGGATTA CCTCCACGAT CTTGGGGCAA TAAGCATGAT GTCTTCGGAC
TCGCAAGCCA TGGGAAGGAT TGGGGAGACA GGAATTAGGA CATTCCAGCT TGCTCATAAG
ATGAAGGAAC TTAACCTGAT CCCCATGCCT GACAATCAAA GGGTCCTGAG ATATCTCGCA
AAAATAACCA TAAATCCTGC CATAACCCAC GGTATATCAG AGTACGTAGG ATCCCTGTCC
CCTGGAAAGC TGGCAGATAT TGTTCTGTGG GACCCTAGGT TTTTCCCCGC GAAGCCCTAC
ATGGTTATCA AGGGCGGAGC CATCTCATGG GCCCTAATGG GTGAAACCAA CGCCTCTATT
GCATATGCTC AGCCTGTGCT TTACAAACCC ATGTTCGGAT TTACAGCGCC GGTATCCCTG
CTATTTTCCT CACTGGATGG GGTAAACGAA GCGGGGAAAA ATGTCAAGAG GAGAGTGGTA
CCAGTAAGAA ATACTAGGAC CATCTCGAAA TCTCACATGA AACTTAACGA TGCTACGCCT
GAGATAGAGG TGGACCCTGA CAAATATGAG GTTAAGGTCG ATGGGGTAGT CCCGAAGATC
CCGCCTTCTA AGGAATTGCC TCTAACCAGA TTATACTTCC TGTTTTAG
 
Protein sequence
MKISRERYAE LYGPTEGDKI RLGDTNLVIT VEKDMIRKGD ELVFGAGKSA RDGLGLLPTV 
KEEESMDLVI TNVVIMDPLL GIVKADIGIK DGVIVGIGHG GNPFTMDGVD FVLGPSTEVI
SGEGLIATPG FIDTHVHWVA PQQVYDAISA GFTTLIGGGT GPAEGTKATT VTPGSWNLRV
IFSALDQYPV NFGLTAKASS TSVSMEQVLN QGACGFKIHE DWGAMPRVID ETLTLADQRD
VQVTIHTDTS NESGFLEDTL SAIGGRTIHA YHVEGAGGGH APDIIKIAGE PNILPSSTNP
TKPFTVHTYE EHLEMLMAVH HLNPKVPEDV SYAESRIRAE TMAAEDYLHD LGAISMMSSD
SQAMGRIGET GIRTFQLAHK MKELNLIPMP DNQRVLRYLA KITINPAITH GISEYVGSLS
PGKLADIVLW DPRFFPAKPY MVIKGGAISW ALMGETNASI AYAQPVLYKP MFGFTAPVSL
LFSSLDGVNE AGKNVKRRVV PVRNTRTISK SHMKLNDATP EIEVDPDKYE VKVDGVVPKI
PPSKELPLTR LYFLF