Gene Cthe_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0853 
Symbol 
ID4810471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1027899 
End bp1029584 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content40% 
IMG OID640106269 
Producttype II secretion system protein E 
Protein accessionYP_001037280 
Protein GI125973370 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000472298 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAC AAAAAAGAAA AGGTCTTGGC GACATTTTAG TGGAAGCCGG GCTTATTTCA 
AAAGAGCAGC TGGATAAAGC TTTAAAACTT CAGAAAAAAA CAGGCCAAAA ACTTGGAGTT
TTGCTGGTTT CTGAAGGAAT TGTGACCCAG GAGGACATAA TGAGGGTCCT GGAGGAAAAA
ATAGGTGTTT TACGTGTGGC ATTGGAAGAA TGCAACATTG ATCCCGCTGT TTGCAGCTTA
ATCCCCGAAA AACTTGCCAG AAGGTATGAA TTGATTCCTA TAGCACAAAA AGACGGAGTT
CTTAGGGTTG CCATGAGCGA TCCTTTAAAT GTTTTTGCCA TTGATGATAT TGAGGATTAT
ACAGGTATGA GAGTTGAGCC TGTAGTTGAT TTTGCGTCGT CAATAAAAAA TGCCATTGAC
AAATATTACA GAACACAGCA TGTTTTGGTG GAGCCTGTAA AGGAAAAAGG AATTTTATTT
AAAATTGATG AGGAAACAAT AGAGCTTGAA AGCGTTGAGG CGGAAAATGA ATCTGCCTCA
ATGCTTTTAA ATTCCATAAT AGAGCAGGCG ATAAGAAACG GGTCCGGAGA TATACATATT
GAACCTTTGC AAAATGCATT AAAAATAAGG TTTAGAACCG ACGGACAAAT GCATGAGGTC
ATGAGAACGG AAATTGGCAT GCTAAATGGT GTTTTGGCAA AGATAAAGGC AATTTGCGGT
ATGAATATGA ACGAAAAGGC AGTTCCGCAG GACGGCAGGG TGAAGGTAAG TCTGGACGGA
AGAGATTACA ATCTTAAGGT GTCGATTCTT CCGACCGTTT TTGGAGAGAA AATTGCAATC
CGTATTGTTC ATAAAAAGAC TTCCGTCATT CCAAAAGAGC AGCTGGGAAT TTGTCAGGAG
GACCTCGTAA AATTTGAGAG AATGATAAAA AGTCCTAAAG GATTGGTTTT GATAACAGGT
CCTGAAGGAA GCGGCAAAAC CACAACTTTG TATTCCGCCG TAAGTGAAAT CAACAGTCCG
AATGTACATA TAATTACCAT TGAAGACCCT GTTGAATACG TTATTGAAGG AGTAAACCAG
GTACAGGTCA ACATGAAGAC AGGCCTGACT TATGAAAAAG GTCTAAGTTC AATTTTAGAA
CAGGGACCGG ATGTAATTGT CATTGGGGAC ATAAAGGATG CGAAAACGGC TGAAATAGCT
GTAAAGGCGG CAATGGGAGG GCATCTTGTA CTTGGAGCTT TTTGTGCCAA TGATACTTTG
GACGCAGTGT TAACTCTTGT GGAAATGGGA ATAGATCCGT TTTTTATTGC ATCGTCCCTG
ATAGGGGTAA TTTCTCAAAG GCTTGTGAGA AAAATTTGTC CCAACTGCAT AAAGAAGTAT
GTTGCAACAG ATGAGGAACT TTCACTTCTT GAACTGGACA GACCCGTCGA ACTGTATTCG
GGAAATGGGT GCGCAGAATG TTCCGGTACC GGATACAAAG GGAAATTGGG TGTTTTTGAG
GTGCTGAATG TGGACAAGAG CTTCAGGGAT ATGATGAAGG AAAACTTTGC AAAGGAGAAA
TTGAGAAAAT TTTGTGTTTT AAGGGGAATG AAAACTTTAA AAGAAAATGC AAAACAGCTT
GTTCTTGAGG GAAAAACCAC TGCTTTTGAG ATGTCAAGAA TGCTGTCTTT TGAAGAAGAA
TTATAA
 
Protein sequence
MQKQKRKGLG DILVEAGLIS KEQLDKALKL QKKTGQKLGV LLVSEGIVTQ EDIMRVLEEK 
IGVLRVALEE CNIDPAVCSL IPEKLARRYE LIPIAQKDGV LRVAMSDPLN VFAIDDIEDY
TGMRVEPVVD FASSIKNAID KYYRTQHVLV EPVKEKGILF KIDEETIELE SVEAENESAS
MLLNSIIEQA IRNGSGDIHI EPLQNALKIR FRTDGQMHEV MRTEIGMLNG VLAKIKAICG
MNMNEKAVPQ DGRVKVSLDG RDYNLKVSIL PTVFGEKIAI RIVHKKTSVI PKEQLGICQE
DLVKFERMIK SPKGLVLITG PEGSGKTTTL YSAVSEINSP NVHIITIEDP VEYVIEGVNQ
VQVNMKTGLT YEKGLSSILE QGPDVIVIGD IKDAKTAEIA VKAAMGGHLV LGAFCANDTL
DAVLTLVEMG IDPFFIASSL IGVISQRLVR KICPNCIKKY VATDEELSLL ELDRPVELYS
GNGCAECSGT GYKGKLGVFE VLNVDKSFRD MMKENFAKEK LRKFCVLRGM KTLKENAKQL
VLEGKTTAFE MSRMLSFEEE L