Gene Cthe_2475 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2475 
Symbol 
ID4809855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2946915 
End bp2948348 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content37% 
IMG OID640107890 
ProductSPP1 family phage portal protein 
Protein accessionYP_001038870 
Protein GI125974960 
COG category 
COG ID 
TIGRFAM ID[TIGR01538] phage portal protein, SPP1 family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGCTTA ATCTTTTTAA TTTCAGGAAC TTTAAAGACT TATTCCGCAA TGATATAAAC 
ATGATGACTG TAGAAGAAAT TTTATATAAC GAAATCAAAG AGTTCCAGGC ATCCGATAGA
AGGGCCTGGA TGGTTATTGG CGATAGATAT TACCGGTGCG AAAATGACAT CCTTAACAGG
CGTATAGTAC GCCATACAGA GAGCGGAGAC ATTGAAGATA GGTCAAAAGC AAACAATAGG
TTGGCCCATG GTTTTGTTAA AAACCTTGTG GATGAAAAAA TAGGATATCT GCTTACAAAG
GATTATTCGC TGAAGTGCGA CAATAAAGAA TATATTGAGA AAGTTAAAAA CGTCTTGGGT
AAATATTTTC AATACACCCT TACCAGGCTC GGATATGAAG CGTCGAATAA AGGCATAGCA
TGGTTACAAG TTTACATAAA TGAGCAGGGC AAATTTGGAA TGATGATAAT TCCTGCTGAA
CAGTGCGTTC CACTCTGGAA AGATAACACT CACACTGAAC TTTATGGCAT GATTAGATAT
TATGTGCAGA CAGTTTATGA AGGCAAGGAA AAGAAGCAGA TCACTCGCGT GGAATATTAC
ACGGATAAAG AGGTTTATTT TTATGTTCTC GATAATGACC ATCTTATCCC GGATATAGAG
CAATATGAAG GAGGGCCCAT ACTACACTAT AAAAAAGGGG AAGAAGGCCG AAGTTGGGGG
AAAGTGCCTT TTATTGCCTG GAAGAATAAC CATCTTGAAT ATCCGGATGT TAAATTCATT
AAATCGCTTG TGGACGCTTA CGATAAGTCA CGGAGTGAAA TAGATAATTT CATTGAAGAA
ACAAAAAATC TTATCTATGT TTTAAAAGGC TATGGCGGAG AAAATTTATC TGATTTCATG
AAAGACCTTA ATTACTACCG GGCTATAAAA ATAGATGATC CAGAGCATGG TGGAGTTGAT
ACACTAACAC CGAAAATAGA TATTCAGGCA GCAAAGGAAC ATTTCGAACA ATTAAAGCGG
GATATAAATG AGTTTGGCCA AGGTGTGCCC AAGGACCTTG ACAAATATGG CAATTCTCCC
AGTGGGACAG CATTGAAGTT TTTATATAGT GGGCTGGATT TAAAATGCAA CCACTTGGAA
GTAGAATTTA GACAGTCATT TAATCAGCTT TTGTATTTTG TAAACAGATA TCTCGCAGAA
AACGGTCAGG GAAATTATGA GAATGAAAAT GTAGAGCTAA TTTTCAATAG AGATATACAG
ATTAATGAAA CTGAAACTAT CAATAATTGT GTTAACAGTA AAGGCATTAT TAGCGATGAG
ACTATCCTTG CAAATCATCC ATGGGTGTCT GATGTAGAAG AAGAATTAAA GCAGATTGAG
AAAGAAAGAA AATCAGAGGA ACCGCCAATG TTTGGTGAGG GGGATGAAGA GTGA
 
Protein sequence
MLLNLFNFRN FKDLFRNDIN MMTVEEILYN EIKEFQASDR RAWMVIGDRY YRCENDILNR 
RIVRHTESGD IEDRSKANNR LAHGFVKNLV DEKIGYLLTK DYSLKCDNKE YIEKVKNVLG
KYFQYTLTRL GYEASNKGIA WLQVYINEQG KFGMMIIPAE QCVPLWKDNT HTELYGMIRY
YVQTVYEGKE KKQITRVEYY TDKEVYFYVL DNDHLIPDIE QYEGGPILHY KKGEEGRSWG
KVPFIAWKNN HLEYPDVKFI KSLVDAYDKS RSEIDNFIEE TKNLIYVLKG YGGENLSDFM
KDLNYYRAIK IDDPEHGGVD TLTPKIDIQA AKEHFEQLKR DINEFGQGVP KDLDKYGNSP
SGTALKFLYS GLDLKCNHLE VEFRQSFNQL LYFVNRYLAE NGQGNYENEN VELIFNRDIQ
INETETINNC VNSKGIISDE TILANHPWVS DVEEELKQIE KERKSEEPPM FGEGDEE