Gene Cthe_1618 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1618 
Symbol 
ID4809313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1948913 
End bp1951201 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content51% 
IMG OID640107034 
Producthypothetical protein 
Protein accessionYP_001038035 
Protein GI125974125 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAGACA ATTTTGGCTT GAAGATTGGG ATTGAAGGCG AAAAGGAATT TAAGAACGCC 
ATTCGTGAGA TCAACCAAAG TTTTAAGGTA CTGGGCAGCG AGATGAACCT GGTTGCATCT
CAGTTTGATA AGCAAGATAA ATCAGTTGAA GCTGTTACTG CAAGAAACAA GGTGCTGAAT
AAAGAAATCG AATTGCAGAA AGAAAAAATT GCTACTTTGG AGAAAGCGCT TGCCAACGCC
GCCTCCTCTT TCGGGGAGAC CGACAAGCGG ACGCAGTCCT GGCAGATACA GCTTAACAAC
GCCAAAGCAG AGCTGAACAA AATGGAGCGC GAGCTGGAAG CGAACAACAA AGCGCTGGAC
AATGCGGGAA AAGAGTTTGA CGAAGCGGAA AAACAGGCGG GCGAATTTGG CAGAGAAATT
AAAAAGGCCG CGGATCAGGC GGATGACGCA GGCGGGCGCT TTGAAAAACT GGGAGGTGTA
TTGAAAGGTA TCGGTGTGGC CATGGGAGCA GCCCTGGCCG CTATTGGTAC AGCAGCGGTC
GGTGCGGGAA AAGCTCTTGT GGATATGTCG GTAAATTCGG CGGCCTATGC CGATGAAATC
CTTACCGCCT CGACCGTAAC CGGCATGTCC ACCGACAGCC TGCAGGCGTA TAAATACGCC
GCGGAGCTTG TGGATGTCTC CTTAGATACT TTAACCGGCA GCATGGCAAG GAACGTCAGA
TCCATGTCTT CAGCACGGAA AGGCACCGGT GAGATCGCGG ACGCTTACCG GAAGCTCGGC
GTTTCGGTCA CGGACGCCAA CGGCAACCTG CGCGACAGCG AAGCCGTATA CTGGGAAACC
ATAGACGCAC TTGGCAAGGT GTCCAACGAA ACTGAGCGTG ACGCGCTGGC CATGCAGATT
TTCGGAAAGT CCGCACAGGA ACTCAATCCC CTGATTGCGC AGGGTTCGGC AGGGATAGCG
GAGCTGACCG AGGAAGCAAA GCGCATGGGC GCAGTGATGA GCGAGGATTC ATTGAACGCT
CTCGGGAAAT TTGACGACAG CATCCAGCGG CTCAAGGCGG GCGGCGCGGC GGCCAAAAAC
ATGCTGGGCA CCGTGCTGCT TCCCCAGCTT CAGATATTGG CCGACGACGG GGTTGTGCTT
CTCGGGGAAT TTACTCGTGG ATTATCTGAA GCAAACGGAG ACTGGACGAA GATCAGCGGG
GTCATCGGCA ATACGGTGGG AAGCCTTGTA AACATGCTGA TGGAAAACCT GCCGAAGCTT
ATCCAGGTAG GATTGGATAT CGTCACCTCC ATCGGCGGGG CTATTGTGGA CAATCTGCCG
GTTATTATCG ACGCGGCGGT GCGGATTGTC ATGACGCTGC TGCAGGCTTT AATCGATGCA
CTGCCGCAGA TAACCGACGG TGCTTTGCAG CTTGTTATGG CGCTGGTGCA GGGAATTATT
GACAACCTTC CCGCTTTGGT GGAAGCCGCG GTGCAAATGA TTGCCACACT GGCGTCCGGT
ATCGGGGAGG CGCTTCCGGA GCTGATACCC GCTGTTGTCG AAGCCATTAT CCTCATTGCC
GAGGTACTTC TTGACAATAT GGATAAAATT CTTGACGCAG CGTTTCAGAT CATACAGGGG
TTGGCGCAGG GACTTTTAAA TGCATTGCCA GAACTAATTG AAGCACTGCC GAGGATAATT
ACAACAATCA TTGACTTTGT GACGAACAAT ATGCCGAAGA TCATAGAATT GGGAATTACG
CTTATCGTAC ATCTTGCTGC CGGGCTTGTG AAAGCCATTC CAGAACTAGT AAAGTCTTTA
CCTCAGATTG TTGCGGCAAT TATTGAAGGT TTGGGCAAGG CGGTTGTTTC AGTAGTTGAG
ATTGGTAAGA ACATTGTAAA AGGCATCTGG GAAGGTATTA AAAGCCTTGG TAGCTGGATT
AAGGATAAGG TTTCCGGTTT CTTTTCCGGT ATTGTTGATG GAGTAAAGAA TTTTCTTGGA
ATCAGATCTC CGTCCACTGT TTTTGAAGGC ATTGGCGGCA ATATGGCACT GGGTATTGGT
GAGGGATTTG ACAAGGCTAT GGCCAGAGTG GCAGACGATA TGCAAAATGC AGTGCCGACA
GATTTTAATA TATCTCCTGA TATTAATGTA AGTGGAAGAG GTGAATTTAG CGGTTTAGCT
TCTGGGCCGC TTGTTGTGGT GCAGCAGATG ATTGTTCGTG GTGAAGAAGA CATACGTAGG
ATTTCACAGG AGTTATATAA CCTGATGCAG ACAGGTTCAA GGGCGCAGGG ACGTTTTATA
ACAGCGTAA
 
Protein sequence
MADNFGLKIG IEGEKEFKNA IREINQSFKV LGSEMNLVAS QFDKQDKSVE AVTARNKVLN 
KEIELQKEKI ATLEKALANA ASSFGETDKR TQSWQIQLNN AKAELNKMER ELEANNKALD
NAGKEFDEAE KQAGEFGREI KKAADQADDA GGRFEKLGGV LKGIGVAMGA ALAAIGTAAV
GAGKALVDMS VNSAAYADEI LTASTVTGMS TDSLQAYKYA AELVDVSLDT LTGSMARNVR
SMSSARKGTG EIADAYRKLG VSVTDANGNL RDSEAVYWET IDALGKVSNE TERDALAMQI
FGKSAQELNP LIAQGSAGIA ELTEEAKRMG AVMSEDSLNA LGKFDDSIQR LKAGGAAAKN
MLGTVLLPQL QILADDGVVL LGEFTRGLSE ANGDWTKISG VIGNTVGSLV NMLMENLPKL
IQVGLDIVTS IGGAIVDNLP VIIDAAVRIV MTLLQALIDA LPQITDGALQ LVMALVQGII
DNLPALVEAA VQMIATLASG IGEALPELIP AVVEAIILIA EVLLDNMDKI LDAAFQIIQG
LAQGLLNALP ELIEALPRII TTIIDFVTNN MPKIIELGIT LIVHLAAGLV KAIPELVKSL
PQIVAAIIEG LGKAVVSVVE IGKNIVKGIW EGIKSLGSWI KDKVSGFFSG IVDGVKNFLG
IRSPSTVFEG IGGNMALGIG EGFDKAMARV ADDMQNAVPT DFNISPDINV SGRGEFSGLA
SGPLVVVQQM IVRGEEDIRR ISQELYNLMQ TGSRAQGRFI TA