Gene Cthe_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1777 
Symbol 
ID4810022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2098540 
End bp2099742 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content46% 
IMG OID640107191 
Productamidohydrolase 
Protein accessionYP_001038191 
Protein GI125974281 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000349054 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCTGA TAAGAAACGG TAAAATTTTA ACCATGGCCG GAGTAAATTA TGAAAACGGA 
TATATTTTGA TTGATGCGGG GAAAATAGTT GAAGTGGGAG AATATCCTGC CGCATTTAAT
CAGGAAGTTT TGAATTCCGG TGATTTGGAA GTTATTGATG CAAAGAATAA ATACATACTT
CCTGGGCTTA TTGATGCGCA CTGTCATGTG GGAATGTGGG AAGATTCCGT TGGCTTTGAA
GGGGATGACG GCAATGAAGC AACAGATCCT GTTACTCCTC ATCTTAGGGC TATAGATGCG
GTGTATTATT TGGACCGAGC ATTTGAGGAG GCGCGGGAAA ACGGAGTTAC CACCGTGGTT
ACAGGGCCGG GGAGCGCCAA TGTGATAGGT GGACAGTTTG TTGCCTTGAA AACTTACGGA
AGACGAATAG AGGAAATGGT GGTAAAAGAC CCTGTAGCCA TGAAAGTGGC CTTTGGAGAA
AACCCAAAGA CAGTGTACAA TGAAAGAAAA ACGGCGCCTA CAACCCGTAT GGCCACTGCG
GCCATTCTCA GGGAAAACCT GATGAAAGCC AAAGAGTACA AGGAATTGAT GGATGAGTAC
AACAAAAATC CGGAAGAAAA CGACAAACCG GAATATGATA TGAAAATGGA AGCTCTGCTG
AAGGTTTTAA ACAGGGAAAT TCCGATAAAA GCACATGCGC ACAGGGCGGA TGACATCCTT
ACCGCCATAA GGATAGCAAA GGAATTTGGG CTAAGGCTTA CAATAGAGCA TTGCACCGAA
GGCCATCTTA TAAAGGACAT TCTTGCAGAG GAAGGAGTTT CGGCAATTGT GGGGTCGTCA
CTTACCGACA GGTCAAAAGT GGAGCTTCGG AACCTCAGTT TGAAAACACC TGGAATTTTG
GCGAAGGCGG GAGTCAAGGT GGCCATAATG ACGGACCATC CATGTACTCC GATACAGTAT
TTGATACTGT GTGCGGCTAT GGCGGTAAGA GAGGGTATGG ACGAAATGGA GGCCCTCAGG
GCAGTTACCA TAAATGCCGC CGAACTTACA GGAATAGCCG ACCGGGTGGG AAGCATAGAA
GTGGGGAAGG ATGCGGACAT TGCCATCTAT GACGGTCATC CCTTTGACAT AAGGTCTAAA
GTTTCCACAA CCATTATTAA CGGAAAGGTT GTTTACGAGA GGAAGAAACA TGAAAGAGAT
TAG
 
Protein sequence
MLLIRNGKIL TMAGVNYENG YILIDAGKIV EVGEYPAAFN QEVLNSGDLE VIDAKNKYIL 
PGLIDAHCHV GMWEDSVGFE GDDGNEATDP VTPHLRAIDA VYYLDRAFEE ARENGVTTVV
TGPGSANVIG GQFVALKTYG RRIEEMVVKD PVAMKVAFGE NPKTVYNERK TAPTTRMATA
AILRENLMKA KEYKELMDEY NKNPEENDKP EYDMKMEALL KVLNREIPIK AHAHRADDIL
TAIRIAKEFG LRLTIEHCTE GHLIKDILAE EGVSAIVGSS LTDRSKVELR NLSLKTPGIL
AKAGVKVAIM TDHPCTPIQY LILCAAMAVR EGMDEMEALR AVTINAAELT GIADRVGSIE
VGKDADIAIY DGHPFDIRSK VSTTIINGKV VYERKKHERD