Gene Cthe_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1764 
Symbol 
ID4810008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2086012 
End bp2087100 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content38% 
IMG OID640107177 
ProductOuter membrane protein-like protein 
Protein accessionYP_001038178 
Protein GI125974268 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00913626 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA GGATTTTGGC TTTGTTTTTG ACTGTGCCTT TGCTGTTTAC CGTTACAACG 
GTTCTGGCAG CGGGAAAAAA TCTCATACTT AGTATAGAAG GTTTTCAAAA ATATGCTGTC
GAAAACAGCA AACAGGCTGT TCTTGATGAT TTGGAAATAA AGACGAAAGA AAGCGCCCTT
GAAGATGCAA AAGAAGATGC GGAGTTTCTG GCCCCGGCCG GCACGAAAAG TGAAAGGTAC
AATAATGATA TAAAGAAGGA AGTTTCCCCC CTTGAAGCGG AGGCAAATCT TGAGTATGCC
AAAAGAGTCA AAGAAAAGAA TATTGACGAT TTGAAACTGG ATGTCTACAA AGCGGCTCTT
GACAAGCTGC TTGTTGAAAA GGAGCTTGAT ACGGAAAAAG CAAAGCTGGA CATCCTGAGT
GAAAAATACT CCATGGCTGA GGCCAGATAC AAAGAGGGAA AGATTACCGA AAATGACCTC
AATGACGCAA AATATGCTTT GGATGCCAAA AAGATTGATG TGCAGAGAGT TGAAAAAAGT
CTTAAGACAG CGGAGCTTGA ACTTAAAAGA CTGTTGAGTC TTGAACTTGA CGACAAGGAT
TTAAAAGTTG ATGAAAAACT TACACTGGCA AAGTGGAAGG ATGTAAATCT TGAAAAAGTT
ATTAAAGACG CTATTGAAAA AAACATAGAT ATATATAAGA AAGCTGAAGA CCTTAAAGCA
AAAGAAAAGA TCATGGAGAT TACGGAAAGG TATTATACCG ACAGCAATTC AATATATAAT
GAAAACAAGA CAAATCTTGA GCTGGCGAAA GTGGAACTGG AAGATGCAAA AATAAATCTG
GAAGTGGAAA TCAGAAACAA ATACAATGAT TTGCTTACGG CGAAGGACAA TGTGGAACTT
GCTTCCAAGT GGGAAAATAT TCAGAAAAAG AAGCTTGAAA ATGTGGAGCT TAAGTTTAAA
AACGGCCTCG TAAGCAAGGA AGAGGTTCTA AGTCAAAAGG AAAAATATCT GGACGCACAG
TATCAGATGT TTTCAGCGAT TCGTGATTTT AATGTTTTAA AGGCTGAATT TGAAGCGTTG
TACAATTAA
 
Protein sequence
MKKRILALFL TVPLLFTVTT VLAAGKNLIL SIEGFQKYAV ENSKQAVLDD LEIKTKESAL 
EDAKEDAEFL APAGTKSERY NNDIKKEVSP LEAEANLEYA KRVKEKNIDD LKLDVYKAAL
DKLLVEKELD TEKAKLDILS EKYSMAEARY KEGKITENDL NDAKYALDAK KIDVQRVEKS
LKTAELELKR LLSLELDDKD LKVDEKLTLA KWKDVNLEKV IKDAIEKNID IYKKAEDLKA
KEKIMEITER YYTDSNSIYN ENKTNLELAK VELEDAKINL EVEIRNKYND LLTAKDNVEL
ASKWENIQKK KLENVELKFK NGLVSKEEVL SQKEKYLDAQ YQMFSAIRDF NVLKAEFEAL
YN