Gene Cthe_1165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1165 
Symbol 
ID4810833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1389543 
End bp1390754 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content39% 
IMG OID640106587 
ProductYbbR-like protein 
Protein accessionYP_001037590 
Protein GI125973680 
COG category[S] Function unknown 
COG ID[COG4856] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0430232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAGT TACTGAAGAA GGATTTAACT TTAAAAATAA TCTCTGTCTT TTTCGCCATA 
TTTCTCTGGT TTATTGTTTT GGACAGCTCT AATCCGGTAA CCTGGGTTGA ATTGAATGTG
CCTTTGAAAG TTGAAAATGA AAGTTCACTT AAAGAAAAGG GAATAATGCT TAAGAATGAG
AACTTTCCGA GAAATGTTTC CGTCAGTCTA AAGGGAAGAA AAAGCGCTTT TAACAATATA
GGTTTAAATG ACATTGAGGC AATTGTTGAC CTTTCAAAGG TGGAGGATGT TAATACTCAG
TTTTTATATG TCAATGTTTA TACAAATAAA AAAGGTGTGT CTTTTCAGGG AGTAACACCG
AGAGTTGTGG AAATAGAACT GGAAAAACTG GGTGAAAATC CTTTTCCTGT TAATGTAGTT
ATCACAGGAA AACCGAAGGA AGGCTACACA GTGGTAAAGG CAAATGCAAT ACCGACAACG
GTTTCAATTG AAGCGCCGGA CGAAATAATA AATTCCATCG GTGAAGTCAG GGCTTATGTT
GATGTTGACA ATCTCAGTAA CGATATTATT GTAAACAAGG AATGTGTGGT TTACAACAAA
GAAGGAGAAA AAATAGTTGA GCTGGATAAA AAAATAAGTG TTGACATCAA TATTGAAATC
GCGAAAGAAG TGCCTATAGT ACCGGCCGTA AGGGGGAGAC CGGCAAAAAA TTACACCGAC
GGCATACACA GGGTTGTGCC GGAAAAGGCG TGGATTTCGG GACCTTCTGA CGTCATTGAC
CTTATTGACA ACTTGAAAAC CGAACCTATT GATATTGAAA ATATGTCGCA GAGCATGACC
AAAATTGTAA ATCTCGTTCT GCCGGATGGG GTTCGCCTTG TTGACACTCC AAGAAGTGTT
TATGTGGATG TGGTTATTGA GGAACTGGCA GAAAGGGAAT TTGTCTTTAA CAAGGAAAGC
ATTGCGTTTG ACAATGCAGT AAAAAATAAT TCACTTAAGT ATGAAATTTT GGATGATGAG
ATAAAAATAA CTTTGACCGG TACCAGACAG GAGTTGAACA AGATTTCGCC TGAGAGTCTC
AAGCTTAGCG TTGATGTAGG CGGGCTTTCG GAAGGGGAGT ATAAGAGGCC CCTTAACGTG
GTTATCCCTG ATACTGTGAA TCTTTCCGGA AGCTATGATG TTAAAATCAG TGTGAAAAAA
ACCGGAAGTT AA
 
Protein sequence
MNELLKKDLT LKIISVFFAI FLWFIVLDSS NPVTWVELNV PLKVENESSL KEKGIMLKNE 
NFPRNVSVSL KGRKSAFNNI GLNDIEAIVD LSKVEDVNTQ FLYVNVYTNK KGVSFQGVTP
RVVEIELEKL GENPFPVNVV ITGKPKEGYT VVKANAIPTT VSIEAPDEII NSIGEVRAYV
DVDNLSNDII VNKECVVYNK EGEKIVELDK KISVDINIEI AKEVPIVPAV RGRPAKNYTD
GIHRVVPEKA WISGPSDVID LIDNLKTEPI DIENMSQSMT KIVNLVLPDG VRLVDTPRSV
YVDVVIEELA EREFVFNKES IAFDNAVKNN SLKYEILDDE IKITLTGTRQ ELNKISPESL
KLSVDVGGLS EGEYKRPLNV VIPDTVNLSG SYDVKISVKK TGS