Gene Cthe_1765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1765 
Symbol 
ID4810009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2087115 
End bp2088266 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content41% 
IMG OID640107178 
Producthypothetical protein 
Protein accessionYP_001038179 
Protein GI125974269 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000291236 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAG TCCTGTCCGC AATAATCATT ATTTTTATGT TAGCTTTCAT GCAGGCCTAT 
GCGGCTAATG ACAGCAGGCT GTCCTATGAC ACGGCAAAAG AAGTCATGCT GAAAAACAGC
AGGGCCGTGG CAAAGCAGAA GCTGTCGGAA AGGAAAGCCT TTTATCAGTA CAACGGTGTG
GTGCAGCGTA CCCGGGGAAT TGAGACGGAG ATGACTGTCA TAGATACTCC TATGGGAAAA
TACTATTATG TTTATCCTCC GAATATACAG GTGCTTCTGA CCAAACAGGC TGAACTTCTG
CCCCTTCAGA TGAGATATTA CTGGAGAATG GCCGACAACG GCAGGATTGT TACCGAAAAG
GCACTTTCAC TGGGGCTTCG GGATTTGTAC CTCGGATTTA TGAAATCCGA CATGGATTAT
CGGCTGAGTT TGGAGAAGCT CGAGCTTCAG GAAAAGAAAT ACAATGCCGC AAAACTTAAA
GCTGAAAAAG GGCTGATTTC AGGGATTGAA CTTGAAGAAG CGGAGTATGA TTACCTGAAA
GCAAAAAAAG ATGTTGAAAA ATATAAAAGA AGCCGGGAGA ACATGCAAAG GAGCATAAAC
TCCTACATAG GTGTGCCAAT TGACACGGCC TATGATAAAG TATTATTTTC CGAATATACA
AGAAATCTTG TGGTAAAACC TTTGGAATAT TATACCGAGG CGGCATTGGA GAACAGGCTT
GAGATAATTT CCGTAGCGGA GGAGATAAAG ACAAAAGAAA AGCATTTGGA GATCCTTGAA
ATAGGCAGGG CAAAGGATAT ATATCCTGAC ATACGCAAAG AGTATGAAGA TGTTTTGCTG
GAAATAGAGA CTTTGAAAGT CAAGCTTGAG AAAGCCAGGT ATGATGTTGA AAACAACATA
AAGTCTGCGT ACATAGATGT AATAAAAGAA AAAGACAACA TGGATAATTT GATGGCAACG
TTGAATATGC AAAAAAGGAA TTTTGAAAGA CTTAAAGCCC GGTATGAGCA GGGTTTCATA
CCTGAAACGG TAATTGAAGA AATGGAGCTT GCAATCGAGG AATTGCAAAA CGGAGTTAAT
CTTACGGTTT ACAACTATAA TACGAAAAAA ATGAAGCTTG AAGAGGCGGC GGGCTTGGGT
CCGGCGTATT AG
 
Protein sequence
MKKVLSAIII IFMLAFMQAY AANDSRLSYD TAKEVMLKNS RAVAKQKLSE RKAFYQYNGV 
VQRTRGIETE MTVIDTPMGK YYYVYPPNIQ VLLTKQAELL PLQMRYYWRM ADNGRIVTEK
ALSLGLRDLY LGFMKSDMDY RLSLEKLELQ EKKYNAAKLK AEKGLISGIE LEEAEYDYLK
AKKDVEKYKR SRENMQRSIN SYIGVPIDTA YDKVLFSEYT RNLVVKPLEY YTEAALENRL
EIISVAEEIK TKEKHLEILE IGRAKDIYPD IRKEYEDVLL EIETLKVKLE KARYDVENNI
KSAYIDVIKE KDNMDNLMAT LNMQKRNFER LKARYEQGFI PETVIEEMEL AIEELQNGVN
LTVYNYNTKK MKLEEAAGLG PAY