Gene Cthe_1816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1816 
SymbolureC 
ID4809800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2150837 
End bp2152555 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content47% 
IMG OID640107230 
Producturease subunit alpha 
Protein accessionYP_001038230 
Protein GI125974320 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGTAA AAATAAGCGG CAAAGATTAT GCCGGTATGT ATGGCCCGAC AAAAGGCGAC 
AGGGTGAGGC TGGCAGACAC GGATCTCATT ATTGAGATTG AGGAAGATTA CACGGTTTAT
GGAGATGAGT GCAAATTCGG AGGAGGTAAA TCCATAAGGG ACGGAATGGG CCAGTCTCCT
TCGGCTGCAA GAGATGACAA GGTTTTGGAT TTGGTAATTA CCAATGCCAT AATCTTTGAC
ACATGGGGGA TTGTAAAGGG AGATATAGGT ATAAAAGACG GAAAAATAGC CGGAATCGGG
AAGGCGGGAA ATCCGAAAGT AATGAGCGGC GTGTCGGAGG ATTTAATAAT CGGGGCCTCT
ACCGAAGTTA TTACCGGAGA AGGACTTATT GTGACTCCGG GAGGAATTGA TACACATATA
CATTTTATAT GCCCCCAGCA GATTGAGACC GCATTGTTCA GCGGTATCAC AACAATGATT
GGTGGCGGAA CGGGACCGGC AGACGGAACC AATGCCACCA CTTGCACACC GGGAGCCTTT
AACATCCGGA AAATGTTAGA GGCGGCAGAG GACTTTCCGG TAAATTTAGG TTTTTTGGGG
AAAGGGAATG CTTCTTTTGA GACTCCTCTG ATAGAACAGA TTGAAGCAGG GGCGATTGGC
TTAAAGCTCC ATGAGGATTG GGGAACCACA CCCAAGGCTA TAGATACATG CCTGAAAGTT
GCGGATCTTT TTGATGTACA GGTGGCTATA CATACCGATA CACTGAACGA GGCAGGATTT
GTAGAGAATA CTATAGCGGC TATAGCCGGA AGGACAATTC ACACTTACCA TACCGAGGGA
GCGGGCGGCG GGCACGCACC GGACATAATT AAAATTGCAT CACGCATGAA TGTACTGCCC
TCGTCTACCA ATCCCACCAT GCCTTTTACC GTCAATACAT TGGATGAACA TCTCGATATG
CTTATGGTAT GCCATCATCT TGACAGCAAG GTAAAAGAGG ACGTTGCTTT TGCCGATTCG
AGGATCCGGC CTGAGACAAT AGCCGCAGAA GACATACTGC ACGATATGGG AGTATTCAGC
ATGATGAGTT CCGATTCCCA GGCCATGGGA CGCGTGGGAG AGGTTATTAT AAGGACCTGG
CAGACTGCAC ATAAAATGAA GCTTCAAAGA GGTGCCCTGC CGGGGGAAAA GAGCGGCTGT
GACAATATAA GGGCTAAAAG ATACCTTGCC AAGTATACCA TAAACCCTGC TATAACCCAT
GGAATTTCAC AGTATGTGGG CTCCCTGGAG AAAGGGAAAA TAGCCGACTT GGTCCTCTGG
AAGCCTGCAA TGTTTGGTGT AAAGCCTGAA ATGATTATTA AGGGCGGCTT TATAATAGCC
GGCAGGATGG GCGATGCAAA TGCGTCCATA CCCACACCTC AGCCTGTAAT ATATAAAAAC
ATGTTCGGTG CCTTCGGAAA GGCAAAGTAC GGAACCTGTG TGACTTTTGT TTCAAAGGCT
TCGCTGGAAA ATGGCGTTGT GGAAAAGATG GGGCTTCAAA GAAAAGTGCT TCCGGTCCAG
GGATGCAGGA ATATCTCAAA AAAATATATG GTACACAACA ATGCAACGCC TGAAATTGAA
GTTGATCCTG AAACCTATGA GGTAAAGGTG GACGGTGAGA TTATCACCTG CGAACCATTA
AAGGTCTTAC CCATGGCGCA GAGATATTTC TTGTTTTAA
 
Protein sequence
MSVKISGKDY AGMYGPTKGD RVRLADTDLI IEIEEDYTVY GDECKFGGGK SIRDGMGQSP 
SAARDDKVLD LVITNAIIFD TWGIVKGDIG IKDGKIAGIG KAGNPKVMSG VSEDLIIGAS
TEVITGEGLI VTPGGIDTHI HFICPQQIET ALFSGITTMI GGGTGPADGT NATTCTPGAF
NIRKMLEAAE DFPVNLGFLG KGNASFETPL IEQIEAGAIG LKLHEDWGTT PKAIDTCLKV
ADLFDVQVAI HTDTLNEAGF VENTIAAIAG RTIHTYHTEG AGGGHAPDII KIASRMNVLP
SSTNPTMPFT VNTLDEHLDM LMVCHHLDSK VKEDVAFADS RIRPETIAAE DILHDMGVFS
MMSSDSQAMG RVGEVIIRTW QTAHKMKLQR GALPGEKSGC DNIRAKRYLA KYTINPAITH
GISQYVGSLE KGKIADLVLW KPAMFGVKPE MIIKGGFIIA GRMGDANASI PTPQPVIYKN
MFGAFGKAKY GTCVTFVSKA SLENGVVEKM GLQRKVLPVQ GCRNISKKYM VHNNATPEIE
VDPETYEVKV DGEIITCEPL KVLPMAQRYF LF