Gene Cthe_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1709 
Symbol 
ID4808884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2031877 
End bp2034156 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content36% 
IMG OID640107122 
ProductPhage-related protein-like protein 
Protein accessionYP_001038123 
Protein GI125974213 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID[TIGR01760] phage tail tape measure protein, TP901 family, core region 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGAG ATGCAAATAC CGTAGTTGCA AGGGTAGGAC TTGATGATAG AGGTTTTCAA 
GAAGGTGTAG CAAAAATTCA AAGAAGTCTA AAGGTTGTTC AAAGTGAATT TGCAGCAGCT
TCTTCTAAGC TTGGTGATTT TGGCAAATCT GAAGAAGGAC TAAGACTTAA ATCAGATACC
TTAAATAAAC AGATAGAACT TCAGAAGGAT AAAGTTGCGG CATTAGAAAA AGCATATCAA
AAGAGTGTAG AAACAAAGGG TGAAGATGCA AAGGCTACTG AAAATCTTAA AATTAAGCTT
AATTATGCTA CAGCAGAACT AAATAAAATG GAGAATGAGC TGAAAGAGGC AACAAGAGAA
CTTAAGGAAA AAAGCTCGGC TTGGTATAAG CTGTCTGAAA GCATGAATAG TGCAGGAGAA
AAGATGAAAT CTGTAGGAGA TAAGATGTCT TCTATAGGAA GTAAGCTTTC TACTGCTGTA
ACACTTCCTT TAGTTGGAAT AGGAACTGCT GCAACAAAAA TGGCTATGGA TGCAGTGGAA
TCTGAAAATC TCTTTGAAGT AGCTATGGGT TCAATGGCAG GCGATGCAAG AAAGTGGTCA
GAAGAAACCT CAAAAGCTCT AGGACTCAAT GCTTTCAATG TAAGAAAAAA TGTAGCAACT
TATAATGCCA TGCTTACCTC TATGGGGTTA ACTTCACAAG AGTCATTAAA GATGTCAGAA
GGATTAACTC AGCTTTCCTA TGATATGGCT TCTTTCTATA ACTTAAAACC AGAAGAGGCA
TTTGAGAAAT TAAAATCTGG TATTAGTGGA GAGGCAGAAC CACTTAAAGC TTTAGGTATA
TTAGTTAATG ATAATACAAT TAAAACCTAT GCTTATTCTC ATGGAATTGC AAAGCAGGGT
GAACAGCTTA CTGAAGCACA AAAGGTTCAA GCAAGGTATG GTGCTATAAT GGAAGCTACA
AAAAATGCTC AAGGTGACCT TGCAAGAACT ATGGATTCAC CAACCAATAA GCTTAGAGTT
ATGAAAGAGC AAACACAGCA GCTTGGCATT CAGTTTGGAC AACTTTTAAT TCCTATACTT
GAAAAACTAA TGAACACTAT AAAACCTCTT TTAGATAAGT TCCAAGGGCT ATCAAAGGAA
CAGCAAGAAA CAATTATTAA AATCGGATTA GTAGTTGCAG CAATAGGTCC AGTAATCATG
ATTATAGGTA AGGTAATAAG TATTGCAGGA ACTCTTTCTA CTGTAATTGG AACAGTGAGT
GGAGCAATGG CAGCAGCAGG TGGTGCATCT GGAGCCTTAG GAGCTGCTTT TGCAGCAATA
ACTGGTCCAG TTGGCATTGC AGTAGCGGCT ATTACAGGTC TTATTGCTAT TTTTGTAGCC
TTATACAAAA ATAATGAGGA CTTTAGGAAT TCAGTAAATA CAGTATGGAA TGGAGTTAAA
GCTTTAATAA GTGGTGTCAT TGAAAGCTTA AAGGCTATGT TTCAAGCCTT TATTACCTTA
GCAAATCAAA TATGGAAAAA GTATGGTGAT GATTTTGTAA AGATAATAAC AACTGCTTTT
AATTTAGTAG CAACTATTGT AAATACCACA CTTAAAGCCA TTCAAGATGT TATAAAAATA
GTTACCAGTG CAATAAAAGG TGATTGGAAG GGTGTATGGG AAGGAATAAA AAATCTTACC
TCTGACTTAT GGAATGGAAT AAAGAATGTG ATAAAATCAG CCATTGATTT AGTTAAAGGA
ACTATAAAAA CAGAATTTGA ATTTATCAAA GGCATAATCT TAGGAATATG GAATGGCATT
AAAGGAATAA CTTCAGCAGT TTGGAATGAG ATAAAATCAG CTATTGAAAA TCCAATAAAT
GCAGCAAAGA ATGCTGTAGG TAATGCTATA AATGCAATTA AAGGATTTTT CAGCAATCTA
CATTTACCAG AAATAAAAAT ACCTAAAATA AAACTTCCTC ATTTTAGTAT TGAGGGAGAG
TTTAGTTTGA AACCTCCAAG TGTACCTTAC CTAGGTGTAG ATTGGTATGC GAAGGGTGGT
ATATTTAATA GACCTAGTAT AATCGGTGTC GGTGAAGCAG GAACTGAAGC TGTACTTCCT
ATAGATAGGT TAGATGAGCT TATGGCAAGG GCAATTGAAA AAGCAAAAGG AGGAAGTGGA
AGCGGATTAA CACTTCATAT AGAAAATTTC ATTAATAATT CAGATAAGGA TATAGAGCAG
CTTGCCTATG AGCTTGAATT TTACAGGCAG AGAGTTTCAA TGGGAAGGGG TGGTGCTTAA
 
Protein sequence
MARDANTVVA RVGLDDRGFQ EGVAKIQRSL KVVQSEFAAA SSKLGDFGKS EEGLRLKSDT 
LNKQIELQKD KVAALEKAYQ KSVETKGEDA KATENLKIKL NYATAELNKM ENELKEATRE
LKEKSSAWYK LSESMNSAGE KMKSVGDKMS SIGSKLSTAV TLPLVGIGTA ATKMAMDAVE
SENLFEVAMG SMAGDARKWS EETSKALGLN AFNVRKNVAT YNAMLTSMGL TSQESLKMSE
GLTQLSYDMA SFYNLKPEEA FEKLKSGISG EAEPLKALGI LVNDNTIKTY AYSHGIAKQG
EQLTEAQKVQ ARYGAIMEAT KNAQGDLART MDSPTNKLRV MKEQTQQLGI QFGQLLIPIL
EKLMNTIKPL LDKFQGLSKE QQETIIKIGL VVAAIGPVIM IIGKVISIAG TLSTVIGTVS
GAMAAAGGAS GALGAAFAAI TGPVGIAVAA ITGLIAIFVA LYKNNEDFRN SVNTVWNGVK
ALISGVIESL KAMFQAFITL ANQIWKKYGD DFVKIITTAF NLVATIVNTT LKAIQDVIKI
VTSAIKGDWK GVWEGIKNLT SDLWNGIKNV IKSAIDLVKG TIKTEFEFIK GIILGIWNGI
KGITSAVWNE IKSAIENPIN AAKNAVGNAI NAIKGFFSNL HLPEIKIPKI KLPHFSIEGE
FSLKPPSVPY LGVDWYAKGG IFNRPSIIGV GEAGTEAVLP IDRLDELMAR AIEKAKGGSG
SGLTLHIENF INNSDKDIEQ LAYELEFYRQ RVSMGRGGA