Gene Cthe_1339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1339 
Symbol 
ID4809479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1629675 
End bp1630913 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content43% 
IMG OID640106763 
Producttype II secretion system protein E 
Protein accessionYP_001037764 
Protein GI125973854 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCTGG AAGAAAAAGA GAAGCTTATT GCTCAGATAA GAAAGCATAT AAGCGAAAAC 
CTGGATTTGA GAAAGGACTT TTCGGATGAA GAAATAAAGG ACATTATTAC AAATGTTGTT
TTTGAAAGGT CAAGGGATTA TTACCTCAGC GTGGGGGAGA AGAAGGAAAT TGCCGATGCG
ATTTTTAATT CCATGAGAAG GCTGGATGTT CTTCAACCCC TCATTGATGA CAAAAGTATT
ACTGAAATAA TGATAAACGG CCCGGACTCA ATATTTATTG AAAGGGACGG AAGAGTCTCA
AAATTGAACG TAAAATTTGA AAGTCGACGC AAGCTGGAGG ACGTAATTCA GACTATTGTA
TCAAGGGTGA ACAGGACGGT AAATGAGGCG TCTCCGATTG TTGATGCCAG GTTGCCGGAC
GGTTCCCGTG TAAACGTGGT TTTACCGCCG ATAGCTTTAA ACGGGCCTGT GGTTACCATA
AGAAAGTTTC CGGAAAAACC GATGACGATA GAGCAGCTTA TAAAATACGG TTCAATCACC
GAGGAAGTTG CTGAAGTGCT GGAGAGGCTG GTTAAAGCAA AATATAATAT ATTTATCTGC
GGAGGTACGG GCTCGGGAAA AACCACATTT TTGAATGCTC TTAGCAATTT TATTCCGAAG
GACGAGAGAA TTGTTACAAT AGAAGACTCG GCAGAGCTTC AGATTACCGG GGTGGAGAAC
ATTGTCAGGC TGGAAACAAG GAATGCCAAT ACGGAGGGAA AGGGAGAGAT TACAATCAGG
GATCTCATAA GAACTTCACT TCGTATGAGG CCGGAGAGAA TTATTGTGGG TGAGGTGCGT
GGAAAAGAGG CACTCGACAT GCTTCAGGCG ATGAATACCG GACATGACGG TTCCCTTTCC
ACCGGACACG CAAACTCCAC AAAGGACATG CTTTCAAGGC TTGAAACCAT GGTGCTGAGC
GGTGCCGAAA TGCCTTTGGA GGCTATCAGA CAACAAATAG CTTCTGCGAT AGATATAATT
ATTCATTTGG GAAGGCTCAG GGATAAATCG AGAAGAACCC TTGAGATTAC AGAGGTTGTG
GAATACAAGA ATGGGCAGAT TGTTCTAAAT CCGCTTTATG AGTTTGTCGA AGAGGGGGAA
ACTCCGGAAA AACAGGTGAT TGGCACCTTG AGAAGAACAA AGAATGAAAT GGTGAACAAA
CTCAAATTCA AGATGGCGGG TATATCCGAC AAGTTTTAA
 
Protein sequence
MELEEKEKLI AQIRKHISEN LDLRKDFSDE EIKDIITNVV FERSRDYYLS VGEKKEIADA 
IFNSMRRLDV LQPLIDDKSI TEIMINGPDS IFIERDGRVS KLNVKFESRR KLEDVIQTIV
SRVNRTVNEA SPIVDARLPD GSRVNVVLPP IALNGPVVTI RKFPEKPMTI EQLIKYGSIT
EEVAEVLERL VKAKYNIFIC GGTGSGKTTF LNALSNFIPK DERIVTIEDS AELQITGVEN
IVRLETRNAN TEGKGEITIR DLIRTSLRMR PERIIVGEVR GKEALDMLQA MNTGHDGSLS
TGHANSTKDM LSRLETMVLS GAEMPLEAIR QQIASAIDII IHLGRLRDKS RRTLEITEVV
EYKNGQIVLN PLYEFVEEGE TPEKQVIGTL RRTKNEMVNK LKFKMAGISD KF