Gene Cthe_1973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1973 
Symbol 
ID4810756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2353780 
End bp2355303 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content40% 
IMG OID640107389 
Producttetratricopeptide TPR_2 
Protein accessionYP_001038384 
Protein GI125974474 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATAA ATGTAAAAGA TATAATAGCT GATTCGCATC TGGCTTTTCT AAACCAGGAA 
AACGAAACGG CTTTAAGGCT GGCCAGACAG GCAATATCCC TGGCGCCGGA CAATCCTGAC
GCATACAGAT GCGCCGGAAA CGCTTGCATG TCATTGGACC GTTATGATGA GGCCATTGAG
TATTACAGGA CAGCTGTCAA ATATGATCCT GACAACGGCA ACAGGTATTA TGACCTTGGA
TTTGCGTTAG CTTCCGCTGA AAGATTTTCC GATGCCCTGA AAAATCTTGC GAAGGCAGAA
GAGTTAGGAT GTACTCCGGA AAATCTTGTT CAGCTTTACA ACTTGCTTGG AATCCTTTGC
TTTGATATTG GAAGGTACGA TGATGCTCTT GTAAATTTAA ACAAAGCGGA AAAGCTTATC
GGTATCGATG TTGACATTCT TAAGCGCAAA GCTGTTATAT ACGGTATCAA AAACGATATT
AAGAATGGAC TGATGACGGC AAATCAAATC AAACTGGTTA CTCCATCTGA ATATACCGGC
TATCAGATTG CATTTAAGTT GTTGGTTCAG GCAAAGCGTC TGGATGAAGC TGAGAAAGAG
TTGGAAAGAG CCAGAAAATA TGCATCACCG ACGATGGATT TCTATTTTGA CTACATGACT
CTTGAGCTTG AAAAGTATAA GACAGACAAT GACAAAGAAC ATTTTAAAAC TGCCCTTGGA
ATGATTGAAA AGGCTTTAAA AACGGTGAAA CCTACGGTGA AAGAAGTGGT TGAAAGTTAC
ATAAACGCAG CTGAGATATA TCTTCAGCTT GAAGACGCCG ACAGAGCCAT TGAGTGCCTT
AATGCTGCCC AAAATCCCAT CTGGGCTTAT AACAACGGGT TTGACGTGGT TGTAAGGAAC
TATGAGCCTG TTACTTTGAC GGAATATGAC ATCGAAGATA TGATTGAGGC TGACAGGAAA
AAGATAGAGG AACAGTTTGG AGATTATGGA TTTGAGGAAA TGGTTGAGAG TATCGAACCT
GATGAGGAAG GCAACAGGGA TTATTTAACT GTAATCGAAG ATGAAGTGAA AAGTGATGCA
AAGGAAGACT CAGAAGTATA TAAACTTGAT GAGTCCGAAA AGGTTGAATA TTCTTCTGAT
AATATAGATC AGATAAACAG GCTTTATGTA GGAGCGTACA CAGTAAAGAA AGATTTTGAA
AAAGTTATTG AATATTCCAG AAAATTACAA GCAAGTGAAA ACACATATAG TGTTTACCTG
GGCAAATATA CTGAAGCGAA TGCTATGAAG GAACTTGGTT TACCGGACTT TGCGGCAAAA
TATGAAGAAA TAATCAAATT TTTCAGAAAT GCTATGATTA AAGACCCTAC GGATCTGACT
GCTGTTACTT TCAGAGTCCA GTGCTACATT GACATTGGGC AGTATGATGA GGCCGAACAG
CTTTGCAGCC TTTTGACGAA GGAAGTGAGA AATTCATTTA TGGAAAAGAT TAAAAAAGCG
AAGTCCGGAG GTGAGCAGCA TTGA
 
Protein sequence
MNINVKDIIA DSHLAFLNQE NETALRLARQ AISLAPDNPD AYRCAGNACM SLDRYDEAIE 
YYRTAVKYDP DNGNRYYDLG FALASAERFS DALKNLAKAE ELGCTPENLV QLYNLLGILC
FDIGRYDDAL VNLNKAEKLI GIDVDILKRK AVIYGIKNDI KNGLMTANQI KLVTPSEYTG
YQIAFKLLVQ AKRLDEAEKE LERARKYASP TMDFYFDYMT LELEKYKTDN DKEHFKTALG
MIEKALKTVK PTVKEVVESY INAAEIYLQL EDADRAIECL NAAQNPIWAY NNGFDVVVRN
YEPVTLTEYD IEDMIEADRK KIEEQFGDYG FEEMVESIEP DEEGNRDYLT VIEDEVKSDA
KEDSEVYKLD ESEKVEYSSD NIDQINRLYV GAYTVKKDFE KVIEYSRKLQ ASENTYSVYL
GKYTEANAMK ELGLPDFAAK YEEIIKFFRN AMIKDPTDLT AVTFRVQCYI DIGQYDEAEQ
LCSLLTKEVR NSFMEKIKKA KSGGEQH