Gene Cthe_0328 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0328 
Symbol 
ID4808477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp416428 
End bp418029 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content44% 
IMG OID640105742 
Productpeptide chain release factor 3 
Protein accessionYP_001036759 
Protein GI125972849 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG4108] Peptide chain release factor RF-3 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR00503] peptide chain release factor 3 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA AAATCAGTTA CGAAGTAGAA AGAAGAAGGA CGTTTGCAAT TATATCCCAC 
CCCGATGCCG GTAAAACAAC TTTAACGGAA AAGCTCCTTC TGTATGGAGG TGCCATAAGA
CTGGCCGGGT CGGTCAAAGC GAGAAAAGCA AACAAATATG CTACATCCGA CTGGATGGAG
ATTGAAAAAC AGAGAGGTAT CTCGGTTACC TCAAGTGTTT TGCAATTTGA ATATAACAAC
TACTGCATTA ATATACTGGA TACTCCGGGC CACCAGGACT TTAGTGAGGA TACCTACCGT
ACCCTTATGG CTGCCGACAG CGCGGTCATG TTAATCGACG GTGCCAAAGG AGTTGAGGAA
CAGACAATCA AGCTCTTTCA TGTTTGCAAA ATGAGGGGTA TTCCCATATT CACTTTTGTA
AACAAAATGG ACAGAGCCAG CAAGGACCCC TTTGAACTCA TGGAAGAGCT GGAGAATGTC
CTCGGGATCC GCTCCTATCC CATGAATTGG CCGATAGGAA CCGACGGAGA TTTTAAGGGA
GTATACAACA GAAAGCTTTC CCAGATAGAA TTGTTTGAAG GCGGCAATCA CGGCCAAACC
GTTGTCTCCT CAATTAAAGG CAGCGTTGAT GACAAGATAT TTGCAGACCT TCTCGGAGAT
CATTACCACA AAAAGCTCTG TGAAGATATA GAGCTTCTTG ATATGGCCGG AGATCCTTTT
GACAAAGAAA AAATATTGAA GGGTGAACTT ACTCCAATGT TTTTCGGAAG TGCCATGACC
AATTTCGGAG TTCAGCCTTT TCTTGAGGAG TTTTTACAGC TGGCACCCAA ACCCGGCATA
AAGATGTCTT CCGAAGGAGA AGTTGACCCT GAATCGGACA AGTTTACTGG ATTTGTGTTT
AAAATCCAGG CGAACATGAA TCCCGCCCAC AGGGACAGAA TAGCCTTCAT CAGAATCTGT
TCCGGACGAT TTACCCGCGG GATGACAGTT TACCACGTTC AACAGGGAAA AGAAGTGCGT
CTTTCCCAGC CACAACAGTT TATGGCGCAG GAACGCACCA TTGTTGAGGA AGCTTATGCC
GGAGATATAA TAGGTGTGTT CGATCCCGGT ATATTCCACA TCGGAGACAC TTTAAGCGAA
GGCAACAGCA ATCTTAGATT TGAAGGAATC CCAATATTCC CGGCCGAACA TTTCGCCAAA
GTTACTCCGG TAGACACCAT GAAACGTAAG CAATTTATAA AAGGTATCAA CCAGCTCTCT
GAGGAAGGTA CAATACAGGT TTTCAAACAA ATTGACATAG GGTTTGAGTC CATTATCGTC
GGAGTTGTCG GAATTCTGCA GCTTGAGGTT TTGGAGTACA GACTCAAGCA CGAGTACGGC
GTGGATTTAA GAATTCAGAA ACTTCCTCAC AGGCATGCCC GCTGGATTGT GTCCAGTGAC
ACGGAGCCCA GAAACTTAAC GTTAACGAGC ACAACAATGA TTGTTCATGA CCAGGATGAA
AGATATGTTC TGTTGTTTGA AAACGAATGG TCCATACGGT GGGCTGAAGA AAAAAATCCT
TCGCTGAAAC TTGAGGACAT TGCAAAAAGA GATTTGCTGT AA
 
Protein sequence
MQNKISYEVE RRRTFAIISH PDAGKTTLTE KLLLYGGAIR LAGSVKARKA NKYATSDWME 
IEKQRGISVT SSVLQFEYNN YCINILDTPG HQDFSEDTYR TLMAADSAVM LIDGAKGVEE
QTIKLFHVCK MRGIPIFTFV NKMDRASKDP FELMEELENV LGIRSYPMNW PIGTDGDFKG
VYNRKLSQIE LFEGGNHGQT VVSSIKGSVD DKIFADLLGD HYHKKLCEDI ELLDMAGDPF
DKEKILKGEL TPMFFGSAMT NFGVQPFLEE FLQLAPKPGI KMSSEGEVDP ESDKFTGFVF
KIQANMNPAH RDRIAFIRIC SGRFTRGMTV YHVQQGKEVR LSQPQQFMAQ ERTIVEEAYA
GDIIGVFDPG IFHIGDTLSE GNSNLRFEGI PIFPAEHFAK VTPVDTMKRK QFIKGINQLS
EEGTIQVFKQ IDIGFESIIV GVVGILQLEV LEYRLKHEYG VDLRIQKLPH RHARWIVSSD
TEPRNLTLTS TTMIVHDQDE RYVLLFENEW SIRWAEEKNP SLKLEDIAKR DLL