Gene Cthe_1274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1274 
Symbol 
ID4809779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1550424 
End bp1551731 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content40% 
IMG OID640106697 
Productnucleoside recognition 
Protein accessionYP_001037699 
Protein GI125973789 
COG category[S] Function unknown 
COG ID[COG3314] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR02871] sporulation integral membrane protein YlbJ 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.174951 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTTT ACAGGCTGAC AATAATCATA CTTATCGCAT TGGTATTGGC CATTAATATA 
AAATCCATAA AAACCATAAA GGTTATTTAC CTTAAATCCC TCGTGCTTCC TTTAGTGTGC
ATAACTTTCA TCCTTATGCT CATTATTTTT TCCGACACCG CGGTAAAATC CGCCGGCAGC
GGGCTTAACC TGTGGTTTAA TGTTGTATTT CCTTCCCTCT TCCCCTTTTT TGTTGCATCC
GAAATCCTTT ACAGGACAGG GTTTATTAAA GCCATAGGAA TACTTTTGGA ACCCATAATG
CGTCCTCTTT TCAATGTGCC CGGCTGCGGC TCCTTTGCTT TTGCCATGGG AATAACCAGC
GGTTATCCCG TCGGTGCCAA AATCACCGCA AGCATGAGGG AAGAAAAACT CCTTAGCAAA
ACAGAATCCG AAAGGCTTTT GTCTTTCACC AACAACTCAG GCCCCCTCTT TATTATCGGC
GCCGTTGCCG TAGGCATGTT CAAAATGCCT GAGCTTGGAC TTCTGCTTTT AGCCTGTCAC
ATCCTTGCAA GCATCACCGT GGGAATTCTT TTTCGCTTCT ATGGCAGAAA CAATAAGAAA
ATCAAGATGA AAGACGACAA AAATCTCTGG AGAAGATTTA AAAAAGAATT GATTTATACC
TGCAAACAAG AATTAAACCC CGGAACAATG CTGGGAGAAG CCATAAGAAA CTCCGTTAAC
GTGCTGCTTT CCATTGGAGG ATTCATTACT CTTTTTTCAG TTATTATTAA TATTCTGATT
GAAATCGGGT TTATATCCTG CCTGGCGTCT TTTATTTCGC CGTTTCTGTC ACCCTTTGGA
ATAAGCAGAG AAATAGTTTT GGCGGTATTA AGTGGTTTTT TTGAAATGAC AACAGGAACA
AACATGGCAA GCAAAGCGGC AAACGCAACC CTCCAGGGAC AACTTGCAGC GGTGAGCCTG
TTGCTCGGCT GGGCTGGCCT TTCTGTGCAT TTTCAGGTTT ACAGCATTAT AAGCCACACC
GATATAAGCA TAAAGCCTTA TTTATTTGGT AAAATGCTTC AGGGAGTGTT TGCAGCAATT
TATATATCAA TAGCAATGAA ATTACCGTTT ACGGCTTCTT TGACAGCAAA AAGCGTTCTT
AGTGTTATAA CACCTTTTTC AGACTTTACA TGGTACAATG CCTTCATATA TTCGGCTCAG
AATGTGTTTA TTTCATTTTT GATCCTTTTG ATTTTGACGG CAATATCACT TATATTTCAT
TTTATAAAAC ACGTATGCAA GACTCTTTTG AAACGTTCCG TATTTTAA
 
Protein sequence
MNLYRLTIII LIALVLAINI KSIKTIKVIY LKSLVLPLVC ITFILMLIIF SDTAVKSAGS 
GLNLWFNVVF PSLFPFFVAS EILYRTGFIK AIGILLEPIM RPLFNVPGCG SFAFAMGITS
GYPVGAKITA SMREEKLLSK TESERLLSFT NNSGPLFIIG AVAVGMFKMP ELGLLLLACH
ILASITVGIL FRFYGRNNKK IKMKDDKNLW RRFKKELIYT CKQELNPGTM LGEAIRNSVN
VLLSIGGFIT LFSVIINILI EIGFISCLAS FISPFLSPFG ISREIVLAVL SGFFEMTTGT
NMASKAANAT LQGQLAAVSL LLGWAGLSVH FQVYSIISHT DISIKPYLFG KMLQGVFAAI
YISIAMKLPF TASLTAKSVL SVITPFSDFT WYNAFIYSAQ NVFISFLILL ILTAISLIFH
FIKHVCKTLL KRSVF