Gene Cthe_2923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2923 
Symbol 
ID4810206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3442610 
End bp3443893 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content41% 
IMG OID640108346 
Productprotein translocase subunit secY/sec61 alpha 
Protein accessionYP_001039314 
Protein GI125975404 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000208113 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTTGT TTACAACTAT AAGAAATGCC TGGAAAATAG CTGATTTGAG AAAAAAGATG 
TTTTTTACAT TACTGATGAT ATTCATATTC AGACTGGGTT CTTTTATACC GGTTCCGGGT
TTGAATCCTG ATGCATTAAA AAGCATGGTG GATCAAGGAA CAATTTTTGG ATTCTTTAAC
ATATTATCCG GTGGAGCGTT TGAAAATGCA ACGATCTTTG CCATGAGTAT AACCCCTTAT
ATCAACGCTT CGATCATAAT ACAGCTTTTA ACAGTGGCAA TCCCCAAACT TGAAGCTCTT
GCAAAAGAAG GAGAAGAAGG TAGAAAAGCT ATTGCAGAAT ATACGAGATA TGGAGCAGTT
GTTCTTGGAT TCCTCCAGGC AACAGCATTT TACTTCGGAT TGGCCCAGGC GGTTAATGAA
AGAAATGTAT TGTCATTTAT TACAATAACT CTTACATTTA CAGCGGGTAC CGCCTTCCTC
ATGTGGCTGG GCGAACAAAT TACGGAATAT GGAATAGGAA ACGGAATATC CTTGCTTATC
TTTGCAGGTA TTGTATCAAG AGGACCCAGG GGAATACTTT ATCTGTGGGA TCTGTACAGG
TTGGAAAGAC TGGGTAAAGG TATCCTTGGA ATTTTTGGAG TACTGGGCGT ATTGCTTCTC
TTCGTAGTAA TTATTGCTTC AGTTGTATGG GTTGATCAGG CTGAGCGCCG TATACCCGTA
CAATATGCAA AACGTGTTGT CGGCAGAAAA ATGTATGGCG GGCAGAGCAC TCATATTCCG
ATTAAGGTTA ATATGGCCGG AGTTTTGCCT ATCATATTTG CCACATCATT TGTTGCACTG
CCTGCAACAA TAGTGGGATT CTTCTTCCCA AACTCAACTC ATCCTGTAGC CGAGTACTTT
AGAAGTTTTC AGAGCAGGAT TGAAGTAGCA ATATTGACCG GTCTTTTGAT TATCTTCTTT
ACGTTTTTCT ATACATTTAT CCAGTTCAAT CCTGTTGAGG TTGCAAACAA TCTTAAGAAA
AACGGCGGGT TCATACCTGG AATAAGACCG GGGAAACCAA CGTCTGACTA TATTTACAAG
GTGGTTAGCA GAATAAGTTG GTTTTCAGCC CTGTTCCTCG CCATAATCCA AATATTGCCT
TCATTATTGC AGGCAATAAC CGGAATCAGA GGAATATGGT TTGCAGGAAC CAGCGTGCTT
ATCCTTGTCG GCGTTGCCCT CGAAACAGTT AAGCAGATTG AGTCACAGAT GATTATGAGA
CACTACAGAG GATTTCTGGA GTAA
 
Protein sequence
MGLFTTIRNA WKIADLRKKM FFTLLMIFIF RLGSFIPVPG LNPDALKSMV DQGTIFGFFN 
ILSGGAFENA TIFAMSITPY INASIIIQLL TVAIPKLEAL AKEGEEGRKA IAEYTRYGAV
VLGFLQATAF YFGLAQAVNE RNVLSFITIT LTFTAGTAFL MWLGEQITEY GIGNGISLLI
FAGIVSRGPR GILYLWDLYR LERLGKGILG IFGVLGVLLL FVVIIASVVW VDQAERRIPV
QYAKRVVGRK MYGGQSTHIP IKVNMAGVLP IIFATSFVAL PATIVGFFFP NSTHPVAEYF
RSFQSRIEVA ILTGLLIIFF TFFYTFIQFN PVEVANNLKK NGGFIPGIRP GKPTSDYIYK
VVSRISWFSA LFLAIIQILP SLLQAITGIR GIWFAGTSVL ILVGVALETV KQIESQMIMR
HYRGFLE