Gene Cthe_1107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1107 
Symbol 
ID4811405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1316762 
End bp1319125 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content40% 
IMG OID640106529 
Producttype II secretion system protein E 
Protein accessionYP_001037532 
Protein GI125973622 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E
[TIGR02538] type IV-A pilus assembly ATPase PilB 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAGGGG TAGATACAAG AGGTATTGAT GAAATACTTT TGGAAATGGG AGTTTTAAAA 
ATAGTTGACC TTAAAAAAGC CTGGGATATT CAGAGGGAAA GCAATAAGAA CATTGAGGAC
GTGCTATTGG AACTCGGTCT TGTATCCCAG AAAGACATAA TGCATGCCAA TGCCGTGAAG
ATGGGTATTC CTTTCGTCGA TCTGTCAACT TATCAAATCA GCGATTCCAG TGTTCCTTTA
TTAATTACAC GAAACATAGC AAACCGGTAC AAGGTTATAC CCATAGAAAA GGAAAATGGT
GTTTTGACAG TTGCAATGAG TGATCCGACG GATATTTTCT GTATAGATGA CATACGTCTT
GCAACTGCTT TGGAAATAAA GCCTGTCCTT GCGGATGTCA AGGAGATAGA AAGACTTATT
GTTGAATATT TCGGAGAAGA AAAGAAGCCG CAGGAATCGA AATTAAAAGC GGAAAATGAG
GAGCAGAACA AGAAAGAAGA ACTTTTAAAA ATGGAGGAAG AACTTCTTGG AAGAGAGATA
TATAATAACA TAAAAGCCGA TGTTGAGACG AGAGAGCCGG AATTTGATTT GGCAAGCAAG
GGATTTGATC AGAATACTTA CAACGAAAGC GGTATTTTTA AAGATAAAAT AGGAAACATC
CTTGTACGTG CGGGAGTTAT TACACAGGAT CAACTGGAAA ATGCCCTCAG TATACAGAAA
AAATCTGGCG GACTTATCGG TCAGATTTTA GTAAAGCAAG GCTATATTGA CAGAAGGTCT
CTTTATGAAT TTCTCCAAAA ACAGATGGGA GTCGAATACG TTGATATTGA GGGAATTGAA
ATTGATGAGG ATATCATTGG TTTGGTATCC CCGAATTTGG CAAAAACCCA CAAGGTAATC
CCTATTGAAA AAGTGGACGG GAATTTAAAG GTTGCGATGA GCGACCCGAT GAACATATTT
TCAATTGACG ATTTGAGGCT TACAACCGGC CTTGAAATTA TTCCTTGCCT TGCCGATGAA
GAGCAGATTT CGGCACAGTT GGAAAAATAC TACGGAAAAG CTTCCAGGAA GACCAGTGCG
AAAGAAATAG AGCAGAAGGT TGCGGATCTT GACGAGGAAA TTAAGAAAGT AAATGAAAAA
ATTGCGGTTG AAATAACTCA AACTGAAGAT GAAGATACAA CAATTGACAT TAGCGATCTT
GAAAATGCCC CCATTGTCAA GATGGTTAAT ATTATCTTCC AGAAAGCCGT GGCTACAAGA
GCAAGTGATA TTCACATAGA ACCCCAGGAA GATTGTGTTT TAATAAGATT CAGAATAGAC
GGACAACTGG TAGAGATAAT GAGATATGAC AGAAAGATTC TTTCTTCAAT TGTTGCCAGA
ATAAAAATCA TCAGCGGTCT GAATATTGCG GAAAAGAGAA TTCCTCAGGA CGGAAGAATA
GGAATAAAAA TTGACGACAG GGAGTATGAC ATGCGTGTTT CCGTTCTGCC TACAATGTTC
GGAGAAAAAG TTGTTATAAG AATTGCCGAC AAGGAAGGCT TTAATGTTTC GAAGAAAGAG
TTGGGATTCT TTGAAGATGA TTTGGAGAAA TTTGACCAGA TAATATCAAG TCCTTACGGT
ATTATACTGG TTACCGGACC TACCGGAAGC GGTAAGTCAA CAACGCTTTA TACCGCGCTG
AGGGAATTGT GCAAGCCTAA TGTCAACATA CTTACTGTTG AGGATCCTGT TGAAAGTACT
ATAAAGGGTA TAAATCAGGT ACAGGTGAAT GTAAAAGCTG GTTTGACATT TGCTACGGCA
CTGAGAGCGT TCTTAAGACA GGACCCCGAC ATAATAATGG TGGGAGAGAT ACGTGACTCG
GAAACAGCTG AAATAGCAAT CCGTGCAGCT ATCACGGGAC ACCTTGTTTT CAGTACATTG
CACACTAACG ATGCTGCAAG TTCTGTTACA AGAATGATTG ACATGGGAAT AGAGCCGTTC
CTGTTGTCTT CGGCTCTGGT TGGATTGATT GCGCAAAGAC TTGTAAGAAG ACTCTGTCCT
CATTGCAAGG AAGCTTTCCA GCCGGATAAG AATGAGAGAG AGATTCTTGG CTTGAAAGAT
GATGAAGAAG TTACAATATA TCGTGCAAAA GGGTGCGACG AATGTAACAA TACCGGTTAC
AAAGGAAGAA TAGCGGTTTA TGAGATTCTG ACGGTCAATA GGGAAATAAA GGAACTGATA
TCCAAGAACG TAAGTTCCGA TGTAATAAAA GATGCGGCTA TTAAAATGGG CATGAAAACA
TTGAGAATGA ACTGTACGAG GTTGGTTAAA GAAGGTATTA CTACGATAGA TGAGATGCTT
CGTATTGCAT ATTCCATAGA CTAG
 
Protein sequence
MLGVDTRGID EILLEMGVLK IVDLKKAWDI QRESNKNIED VLLELGLVSQ KDIMHANAVK 
MGIPFVDLST YQISDSSVPL LITRNIANRY KVIPIEKENG VLTVAMSDPT DIFCIDDIRL
ATALEIKPVL ADVKEIERLI VEYFGEEKKP QESKLKAENE EQNKKEELLK MEEELLGREI
YNNIKADVET REPEFDLASK GFDQNTYNES GIFKDKIGNI LVRAGVITQD QLENALSIQK
KSGGLIGQIL VKQGYIDRRS LYEFLQKQMG VEYVDIEGIE IDEDIIGLVS PNLAKTHKVI
PIEKVDGNLK VAMSDPMNIF SIDDLRLTTG LEIIPCLADE EQISAQLEKY YGKASRKTSA
KEIEQKVADL DEEIKKVNEK IAVEITQTED EDTTIDISDL ENAPIVKMVN IIFQKAVATR
ASDIHIEPQE DCVLIRFRID GQLVEIMRYD RKILSSIVAR IKIISGLNIA EKRIPQDGRI
GIKIDDREYD MRVSVLPTMF GEKVVIRIAD KEGFNVSKKE LGFFEDDLEK FDQIISSPYG
IILVTGPTGS GKSTTLYTAL RELCKPNVNI LTVEDPVEST IKGINQVQVN VKAGLTFATA
LRAFLRQDPD IIMVGEIRDS ETAEIAIRAA ITGHLVFSTL HTNDAASSVT RMIDMGIEPF
LLSSALVGLI AQRLVRRLCP HCKEAFQPDK NEREILGLKD DEEVTIYRAK GCDECNNTGY
KGRIAVYEIL TVNREIKELI SKNVSSDVIK DAAIKMGMKT LRMNCTRLVK EGITTIDEML
RIAYSID