Gene Cthe_1707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1707 
Symbol 
ID4808882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2029269 
End bp2031173 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content33% 
IMG OID640107120 
Productphage minor structural protein 
Protein accessionYP_001038121 
Protein GI125974211 
COG category[S] Function unknown 
COG ID[COG4926] Phage-related protein 
TIGRFAM ID[TIGR01665] phage minor structural protein, N-terminal region 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTATG TATATGATAA GAAGACAACT AGAGGAAACT TTGACAATAA TGGGCTTGCT 
GTTTTAGATG AATGCCTTAT GGCTGAGATA AATGAAGAGT TAAATGGAGA TTATAGTTTA
GAAATTGAAT ACCCTGCTCA ATCTAAAAAA GCACAGTATT TAGAGGAACT CAATATTATT
AAAGCTGATG GACAGCTTTT TAGAATATAT AAAGTAGAGA GAACACAGGA TAAAATAAGT
AAAGTTAAGG TATGGGCAAG ACATATATTC TATGATCTTG CCTTTTATTT TATAGAATCA
GCAAAGGTGC TTAATGCAAA TATGAAAGAA GCTCTTGAAG CAAGCATACC GCCCGAACTT
CAAGGGTTGT TTTTATTCAA GGCATTAGAA GAGAATTTAG CACCTTTTGC TGTTAAAGAG
GTTAATGCAG TAGATGCTAT TTTTAGACTT ATTGAAATTT ATGGTGGAGA ACTTTTTAGA
GATAACTTTA ATATAGAGAT AAAAGAGTCC ATTGGTGAAA ACAATGGAAT TTTAATAAAA
TACGGTAAAA ACATAAATGG AATGAAGGTT ATTGAGGATA CCAGTGAACT TGCTACAAGG
ATATATGCAG TAGGAGCAAA TAATTTATTG TTGCCAGAAA GATATATAGA AGTAGAGGGA
GAAAGAGCAA AATTACTTCC CTATCCCATA ACCAAAAGGG TTGAGTTTAA GGAATGCAAG
GATGTAGAAA GCCTTAGAGC AAGGGCAGAA GAATATGCTG AAAAGGCTGC AAGTCCAAAG
GTATTTATAA CAATAGATTT TATGGAGCTA AGTAAGACAG AAGAGTATAA AAATTATAGT
CATCTTACTA AAGTAAGTGT TGGTGATTTT GTAAAGGTAA GAAATGAAAA GATAGCTGTA
ACTACTGATT TAAGAGTTAT AAAGAAAAAG ACAGACCTTA TAAATCCTAT AAATACAAAG
ATAGAGCTTG GTGATCCTTT AAACACAATT ATTGAAAAGC TGGATACCAG TAAGCTTTTA
GAGGAAATAA ACAGTGCAAT AAGCGGTACT TTAAGCAGTG TAATAATCAA GAAAAACAGC
GATACTATAA CAATCAGCAC TAGCAGCTAT CCAGCAATGA TTATAGGAAT AACAACAAAA
GCAGATACAA ACTTAAATTG TAATATCACC ATGACAGGGA AAGCTAGTGC AGACTGCACA
TTAACAATTC TGTTTTCCTT AGATGGAAAA TACTACGATT TTAAGCCAAT TCAAAAGTTA
GCATCAGGAG ATAATGTTAT AGGATTGCCC CTTCCAATGC CACAGGTAAC GGCAGGAGAC
CATACTTTTA TGGTAGAGAT GAAGGTTTCA AATGGAACAT TTACAATTGA AAAGAATAAT
CTGCAGGTAA GTATTGAGGG AAGAGACTTA GAGGGTGGAC TAAGTGCAAG TATACCAAGG
GCAGAGATTG TTTATACTTT CCTTTATAAT TTGTTCAATA TCAAGATTGG AACATACAGT
TTTAATACAG GACACAGTTT TAAGTATTAT TTAGACAACA ATGTGAGTGG AGTAGATAGC
TATAGTCTTG AGAATTTCAA TACAAGATTT AATATTGCTG CTATATCTCA TGGAAACCCA
AAGTTAACCT TAAATGTAAT GGGAATTATA GAAGAGTTTA ATAACAGTAA ATCAAGCAGC
TATGCTTTTG ACGCCGATTG GGTTGAGTTT AGTTCTGACT ACGATAAAAA CAAAGATGGA
ACCTATGACT TTTATAATAG GGTAACAATA AAGGAGCCCA TTTTAAAAAG AGAAGGCGGA
TATATTGAGG ACTTAGGAAA TGGCGTAATT TATGCTGCAA ATGTGCGGGA TACAACACTT
TATAAGGATT TGATTGCAAT AAATGCATAT CTTAAGCAAG GTTAG
 
Protein sequence
MIYVYDKKTT RGNFDNNGLA VLDECLMAEI NEELNGDYSL EIEYPAQSKK AQYLEELNII 
KADGQLFRIY KVERTQDKIS KVKVWARHIF YDLAFYFIES AKVLNANMKE ALEASIPPEL
QGLFLFKALE ENLAPFAVKE VNAVDAIFRL IEIYGGELFR DNFNIEIKES IGENNGILIK
YGKNINGMKV IEDTSELATR IYAVGANNLL LPERYIEVEG ERAKLLPYPI TKRVEFKECK
DVESLRARAE EYAEKAASPK VFITIDFMEL SKTEEYKNYS HLTKVSVGDF VKVRNEKIAV
TTDLRVIKKK TDLINPINTK IELGDPLNTI IEKLDTSKLL EEINSAISGT LSSVIIKKNS
DTITISTSSY PAMIIGITTK ADTNLNCNIT MTGKASADCT LTILFSLDGK YYDFKPIQKL
ASGDNVIGLP LPMPQVTAGD HTFMVEMKVS NGTFTIEKNN LQVSIEGRDL EGGLSASIPR
AEIVYTFLYN LFNIKIGTYS FNTGHSFKYY LDNNVSGVDS YSLENFNTRF NIAAISHGNP
KLTLNVMGII EEFNNSKSSS YAFDADWVEF SSDYDKNKDG TYDFYNRVTI KEPILKREGG
YIEDLGNGVI YAANVRDTTL YKDLIAINAY LKQG