Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1816 |
Symbol | ureC |
ID | 4809800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2150837 |
End bp | 2152555 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640107230 |
Product | urease subunit alpha |
Protein accession | YP_001038230 |
Protein GI | 125974320 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTAA AAATAAGCGG CAAAGATTAT GCCGGTATGT ATGGCCCGAC AAAAGGCGAC AGGGTGAGGC TGGCAGACAC GGATCTCATT ATTGAGATTG AGGAAGATTA CACGGTTTAT GGAGATGAGT GCAAATTCGG AGGAGGTAAA TCCATAAGGG ACGGAATGGG CCAGTCTCCT TCGGCTGCAA GAGATGACAA GGTTTTGGAT TTGGTAATTA CCAATGCCAT AATCTTTGAC ACATGGGGGA TTGTAAAGGG AGATATAGGT ATAAAAGACG GAAAAATAGC CGGAATCGGG AAGGCGGGAA ATCCGAAAGT AATGAGCGGC GTGTCGGAGG ATTTAATAAT CGGGGCCTCT ACCGAAGTTA TTACCGGAGA AGGACTTATT GTGACTCCGG GAGGAATTGA TACACATATA CATTTTATAT GCCCCCAGCA GATTGAGACC GCATTGTTCA GCGGTATCAC AACAATGATT GGTGGCGGAA CGGGACCGGC AGACGGAACC AATGCCACCA CTTGCACACC GGGAGCCTTT AACATCCGGA AAATGTTAGA GGCGGCAGAG GACTTTCCGG TAAATTTAGG TTTTTTGGGG AAAGGGAATG CTTCTTTTGA GACTCCTCTG ATAGAACAGA TTGAAGCAGG GGCGATTGGC TTAAAGCTCC ATGAGGATTG GGGAACCACA CCCAAGGCTA TAGATACATG CCTGAAAGTT GCGGATCTTT TTGATGTACA GGTGGCTATA CATACCGATA CACTGAACGA GGCAGGATTT GTAGAGAATA CTATAGCGGC TATAGCCGGA AGGACAATTC ACACTTACCA TACCGAGGGA GCGGGCGGCG GGCACGCACC GGACATAATT AAAATTGCAT CACGCATGAA TGTACTGCCC TCGTCTACCA ATCCCACCAT GCCTTTTACC GTCAATACAT TGGATGAACA TCTCGATATG CTTATGGTAT GCCATCATCT TGACAGCAAG GTAAAAGAGG ACGTTGCTTT TGCCGATTCG AGGATCCGGC CTGAGACAAT AGCCGCAGAA GACATACTGC ACGATATGGG AGTATTCAGC ATGATGAGTT CCGATTCCCA GGCCATGGGA CGCGTGGGAG AGGTTATTAT AAGGACCTGG CAGACTGCAC ATAAAATGAA GCTTCAAAGA GGTGCCCTGC CGGGGGAAAA GAGCGGCTGT GACAATATAA GGGCTAAAAG ATACCTTGCC AAGTATACCA TAAACCCTGC TATAACCCAT GGAATTTCAC AGTATGTGGG CTCCCTGGAG AAAGGGAAAA TAGCCGACTT GGTCCTCTGG AAGCCTGCAA TGTTTGGTGT AAAGCCTGAA ATGATTATTA AGGGCGGCTT TATAATAGCC GGCAGGATGG GCGATGCAAA TGCGTCCATA CCCACACCTC AGCCTGTAAT ATATAAAAAC ATGTTCGGTG CCTTCGGAAA GGCAAAGTAC GGAACCTGTG TGACTTTTGT TTCAAAGGCT TCGCTGGAAA ATGGCGTTGT GGAAAAGATG GGGCTTCAAA GAAAAGTGCT TCCGGTCCAG GGATGCAGGA ATATCTCAAA AAAATATATG GTACACAACA ATGCAACGCC TGAAATTGAA GTTGATCCTG AAACCTATGA GGTAAAGGTG GACGGTGAGA TTATCACCTG CGAACCATTA AAGGTCTTAC CCATGGCGCA GAGATATTTC TTGTTTTAA
|
Protein sequence | MSVKISGKDY AGMYGPTKGD RVRLADTDLI IEIEEDYTVY GDECKFGGGK SIRDGMGQSP SAARDDKVLD LVITNAIIFD TWGIVKGDIG IKDGKIAGIG KAGNPKVMSG VSEDLIIGAS TEVITGEGLI VTPGGIDTHI HFICPQQIET ALFSGITTMI GGGTGPADGT NATTCTPGAF NIRKMLEAAE DFPVNLGFLG KGNASFETPL IEQIEAGAIG LKLHEDWGTT PKAIDTCLKV ADLFDVQVAI HTDTLNEAGF VENTIAAIAG RTIHTYHTEG AGGGHAPDII KIASRMNVLP SSTNPTMPFT VNTLDEHLDM LMVCHHLDSK VKEDVAFADS RIRPETIAAE DILHDMGVFS MMSSDSQAMG RVGEVIIRTW QTAHKMKLQR GALPGEKSGC DNIRAKRYLA KYTINPAITH GISQYVGSLE KGKIADLVLW KPAMFGVKPE MIIKGGFIIA GRMGDANASI PTPQPVIYKN MFGAFGKAKY GTCVTFVSKA SLENGVVEKM GLQRKVLPVQ GCRNISKKYM VHNNATPEIE VDPETYEVKV DGEIITCEPL KVLPMAQRYF LF
|
| |