Gene Cthe_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1199 
Symbol 
ID4810152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1429136 
End bp1430431 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content44% 
IMG OID640106622 
Productamidohydrolase 
Protein accessionYP_001037624 
Protein GI125973714 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.242168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACATAC TGATAAAGAA TGCCGACATA ATTACCTGCA ATGCGTCAGA TGACGTGTTG 
CAGGGTGCGT TTTTGGGCAT AAAGGATGGA TATATTGATT TTATAGATAC AAAAGAAGAT
GCTTTAAAAG ACTTTAAGGC CGACCGGATT ATTGACGCAA AGGGAAAACT GGTTATGCCG
GGTTTGGTGA ATGCCCACAC CCACAGCGGG ATGACAATAC TCAGGAACTT TGCAAACGAC
CTTGCATTGG AAGACTGGCT TTTCGGCAAT GTACTTCCCG TGGAAGAGAA ACTTACACCG
GAAGACATAT ACTGGGGTAC ATTGCTGGGA ATAGCCGAGA TGATAAAATC AGGCACTACG
ACTTTTGCCG ACATGTATCT TCATATGGAA GAAGTGGCAA GGGCTGTTTC GGAAACGGGC
ATAAGGGCAA ATCTTTGCAG AAGTCCGCTT AAAGACAGCG ATAAAAGTGT GGAAGATGCC
GTTCGGTGTT TTGAATATTT TAAGAAGTGG GACAACAGCT TTAACGGTAG AATAAAAGTG
TACATTGAAG TTCACTCGGT TTATCTTTTT GACGAACCGT CGCTGCGTAT GTCGGCGGAA
GTTGCAAAAG AGATCAACAC AGGAATTCAC ATACATGTGC AGGAGACTTT GAAAGAGTGT
GAGGACAGCA ACAAAAAGTA TGGTATGAGT CCTGCGGAAA TTTGCTGTAA GACCGGCATT
TTTGACGTTC CGGTAATCGC TGCCCACTGT GTGCATTTGT CCGACGGGGA TATGGGTATA
ATCAGGGATA AGGGCGTAAA TGTGATCCAC AACCCCACCA GCAATTTAAA GCTGGGCAGC
GGAATAGCCA AAGTGGATGA TATGCTCAAA AACGGTATCA ATGTGGCTTT GGGAACTGAT
GGTGCCGCAA GCAACAATAA TCTTAACATG TTTGAAGAAA TGCATTTGGC GGCGCTGATA
CACAAAGGGG TTCACATGGA TCCCACATTG ATTGGTGCTT CCTGTGCATT AAAGATGGCA
ACCGTAAACG GAGCAAAGGC ACTTGGGTTT GGAGGCGAGA TTGGAGAAAT TTCAAAGGGA
ATGAAGGCGG ACCTTATCCT TATAGATATG GACAAGACGC ATCTGTGCCC TGTTAACGAC
CCTGTTTCGG CCGTGGTATA CTCCGCGCAA AGCTCGGACG TTGACACGGT AATAATTGAC
GGCAATATTG TGATGGAAAA CAGAGAGCTT AAGACCATAG ATGAGGAAAA AGTAAAATTT
AATGTTAAGG AAATTGCCAA AAGAGTATTG AGATAA
 
Protein sequence
MNILIKNADI ITCNASDDVL QGAFLGIKDG YIDFIDTKED ALKDFKADRI IDAKGKLVMP 
GLVNAHTHSG MTILRNFAND LALEDWLFGN VLPVEEKLTP EDIYWGTLLG IAEMIKSGTT
TFADMYLHME EVARAVSETG IRANLCRSPL KDSDKSVEDA VRCFEYFKKW DNSFNGRIKV
YIEVHSVYLF DEPSLRMSAE VAKEINTGIH IHVQETLKEC EDSNKKYGMS PAEICCKTGI
FDVPVIAAHC VHLSDGDMGI IRDKGVNVIH NPTSNLKLGS GIAKVDDMLK NGINVALGTD
GAASNNNLNM FEEMHLAALI HKGVHMDPTL IGASCALKMA TVNGAKALGF GGEIGEISKG
MKADLILIDM DKTHLCPVND PVSAVVYSAQ SSDVDTVIID GNIVMENREL KTIDEEKVKF
NVKEIAKRVL R