Gene Cthe_1383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1383 
Symbol 
ID4809378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1688143 
End bp1689300 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content37% 
IMG OID640106807 
Producttetratricopeptide TPR_2 
Protein accessionYP_001037808 
Protein GI125973898 
COG category[S] Function unknown 
COG ID[COG1729] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATT TTAAAAATGA GCTTCAAAAT TATCCAATGA TTGATTTGGA TAAGTTGTCC 
CAGGCGAACC CGAACATACC TGACAATATA AAAAATTCAA TTGTGCTTTA CAATAAAGCT
TTGGAGGATT TTCGCGCAAA AAGCGAGGAT ATAGCCATAA TTGAGCTTAA AAAAGCCATT
TCCCTGAATC CGGATTTTCA TGAGGCAATG AATTTGTTAG GTATCTTCTA TATGTATATA
GGAGAGAACG ATAAAGCTGC AGAAGTTTTC CAAAAAGTTG TGGATGCGGA AAAAAACAGC
GTAATGGCGA TGAGATATTT AAAAGAAATT GATTCCGGGT ATGATCCTGT CGGAAATAAA
CAGGAAAAGG ATAAAAAATC CAGGAAGAAG AAAGAAAGAA ACAGAGGGGC AGCTCAGCTT
TCAAACCAGG TGACGGTAAA AAGCAGTGCC TCTTTTTCCT TTAAGAAGCT GATAAAAATA
TGGGAATACA AGCCCATGGA CACGGCAAGA CTGTTTTTGG GATTTGTAAT TGGTGCTCTT
CTGGTTTTCC TCTTAAGTTA TAATTATTAT TTCAGAGAAG AGAATAATGA GCAATTGGAG
CAGTTAACAG AGGAAAATAA CACTCTTATT GGAGAAAAAA ATGAGATTCA GAAAAAGTAT
GATGAACTGA ACGAGAAATA TCAGGGATTA AACGACACGT TTGAAGAAGT GAAAAAGCAG
GTTGACTATT ATTTGAATGC TTCAAAACTT CTTCAAATTG AGAAATATGC TTCCCAGAAC
CAGTATCGTG AAGCGGCCGA TTTATTATTG TTATTGAAAA ACACCGCATT TACCGGAGTG
GAAAAAGAAA AGTTTGACAA ATTATCCCAG GATGTCATGC CTAAAGCTGC GCAGGAAGAA
TATAATAAAG GAAGAGAATT GTACAACAGA AAAAATTACC AGGAAGCCGT GGAGAGATTT
GAAAGATCCC GCTCTTACAG TGACAATTGG AGGTATGCGG TAAATAATCT CTATTATCTG
GGAGTATGCT ATCAGGAACT CAACAACACC ACCAAGGCTT TGGAGATATT TGAAGAGGTT
GTAAATAAAT ATCCGAACAC TTCCTATGCC GGATACTCAA GGGAACGTAT AAACTATATA
CGAGGCAGCC AGCAATGA
 
Protein sequence
MIDFKNELQN YPMIDLDKLS QANPNIPDNI KNSIVLYNKA LEDFRAKSED IAIIELKKAI 
SLNPDFHEAM NLLGIFYMYI GENDKAAEVF QKVVDAEKNS VMAMRYLKEI DSGYDPVGNK
QEKDKKSRKK KERNRGAAQL SNQVTVKSSA SFSFKKLIKI WEYKPMDTAR LFLGFVIGAL
LVFLLSYNYY FREENNEQLE QLTEENNTLI GEKNEIQKKY DELNEKYQGL NDTFEEVKKQ
VDYYLNASKL LQIEKYASQN QYREAADLLL LLKNTAFTGV EKEKFDKLSQ DVMPKAAQEE
YNKGRELYNR KNYQEAVERF ERSRSYSDNW RYAVNNLYYL GVCYQELNNT TKALEIFEEV
VNKYPNTSYA GYSRERINYI RGSQQ