Gene Cthe_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0104 
Symbol 
ID4808728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp136030 
End bp137127 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content44% 
IMG OID640105513 
Product5-amino-6-(5-phosphoribosylamino)uracil reductase / diaminohydroxyphosphoribosylaminopyrimidine deaminase 
Protein accessionYP_001036538 
Protein GI125972628 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase
[COG1985] Pyrimidine reductase, riboflavin biosynthesis 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD
[TIGR01508] 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)pyrimidine 1'-reductase, archaeal 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTGAAA AAGAGTTTTT CATGAGTAGA GCCATTAAAC TAGCAAAACT TGGATGGGGA 
AAAACCAATC CCAACCCTTT GGTGGGGGCG GTTGTTGTCA AGGATGGAAA AATAATTGCA
GAAGGTTACC ACAAGCAACT GGGAGGTCCT CATGCCGAAG TGGAGGCTTT TAACAACGCA
AAGGAGGACG TTGCCGGCGG CACTTTGTAT GTCAATTTGG AGCCATGTTC CCATTACGGC
AGGACGCCGC CCTGTGCTCA AAAAATTATT GATGTTGGGA TAAAAAAAGT GGTTGCAGCC
ATAAAGGACC CCAACCCGAA GGTTTCGGGC AGAGGTTTTG AAATGCTTAA AAACGCAGGA
ATTGAGGTTG AAGTCGGGGT ACTTGAGGAA GAGGCCATAA GACTCAACGA AATATTTATC
ACTTACATTG TAAAAAAGAA ACCTTTTGTA ATTTTAAAAA CGGCAATGAC ATTGGACGGC
AAGATAGCAA CAGCTTCGGG AGACTCCAAA TGGGTGACGG GGGAAAAAGC AAGAAATCAT
GTCCATGTGA TAAGAGACAG GGTTGCGGCA GTAATGGTGG GAATTAATAC TGTCCTGAAA
GATAATCCAT ACCTTACCAC AAGACTTCCC GACAAGGAAG GAAGCGATCC TGTGAGGATA
GTTGTGGACA GCAAAGGCTC AATTCCCCTG GACTCAAATG TTATAAATTC CAATTCAAAA
GCCGGAGTTA TACTTGCCAC TACCTCCCGT ATAGACAAGG AAAAGGAAAA AATGCTGATT
GACAAAGGAG TAAAAATCAT TAAGGCGGAC GGCAGTGACG GAAGAGTGGA TTTAAAACTT
CTGATGGATG AACTTTACAA ACTTGAAATT GACAGTGTGC TGCTGGAAGG AGGAGGTACG
CTGAACAGCT CTGCCATTTC CGCGGGAATT GTGGACAAAG TAATGTGTTT TATCTCTCCG
AAGATAGTGG GGGGAGAAAA TGCTCCCACC TCGGTTGAAG GAATCGGAGC ATTAAGAATG
TGTGAAGCCA TAGGAGTAAA AAATATCAGT GTTGTAAAGT TTGGGGAAGA CATACTTCTG
GAAGGATATA TAGAATGA
 
Protein sequence
MSEKEFFMSR AIKLAKLGWG KTNPNPLVGA VVVKDGKIIA EGYHKQLGGP HAEVEAFNNA 
KEDVAGGTLY VNLEPCSHYG RTPPCAQKII DVGIKKVVAA IKDPNPKVSG RGFEMLKNAG
IEVEVGVLEE EAIRLNEIFI TYIVKKKPFV ILKTAMTLDG KIATASGDSK WVTGEKARNH
VHVIRDRVAA VMVGINTVLK DNPYLTTRLP DKEGSDPVRI VVDSKGSIPL DSNVINSNSK
AGVILATTSR IDKEKEKMLI DKGVKIIKAD GSDGRVDLKL LMDELYKLEI DSVLLEGGGT
LNSSAISAGI VDKVMCFISP KIVGGENAPT SVEGIGALRM CEAIGVKNIS VVKFGEDILL
EGYIE