Gene Cthe_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1737 
Symbol 
ID4810167 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2058779 
End bp2060371 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content32% 
IMG OID640107150 
Productphage NTP-binding protein 
Protein accessionYP_001038151 
Protein GI125974241 
COG category 
COG ID 
TIGRFAM ID[TIGR01618] phage nucleotide-binding protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAATTTT CCCACAGCAG AGTTGAGTGC TTTGAAAAGT GCAAGTACCA ATTCAAACTG 
CGATATAAAG ACAAAGTAAG GACAATACCT TCACCGGCAG CAGATAATGC TTTAATTGCA
GGTTCAGCAT TACATCTTGG GATTGAGAAA GGTATAGAGG CAATGGAGCA ATATTATTAT
AATCAGTATC CAGTTATTAC TGATGCTCAT GTAAATGAAG TTATTAAGCT TACATCATTA
GTGAAAAAGG CTCAGATTGT AATAAATACA ATGCTTCATA ATAAAGAGCC GACAGAAAAA
TATGAGTTTA AAATTGATTT TCCAGAATTT ATAGGATTTG TAGACTTTAT TATCCAAACA
CAGGATGGAA GTCTTAGCAT TTATGATTTT AAATACAGCA ATAATATAGA GCATTATCTG
GAGTCAAAAC AGCTGCACTT ATATAGGTAT TACTTAGAAA AACTTGGATT TAAGGTATCA
GAAATAGGAT TTATTTTTAT TCCTAAAACA GCTATAAGGC AAAGGAAAAC TGAGGATTTA
TATCAGTTTA GAAAAAGGCT TCATAAAACT TTAGAAGCTA TGGAAGTTAA GGTGATTCAA
ATTCCATATG ATGAGACTAA GGTTCAAGAA TTTAAGCTAA GATGCCGGGA AATTATTAAT
GAAAAAGAAT ATGAAAAAAC ACCATCAAGA CTTTGTGATT GGTGTGAATA TCAAAATTAT
TGTGAAGGAG GACAAACAGA TATGTTATTA CCAGAAAATG TTAGGAGAGA ACTTCAAATA
GACAGGTATC CGGACATGTG GATATATGCC GACAGTTACG TTGGAAAATC AACTTTTGTT
GATCAGTTCG ATGATTTATT ATTCTTAAAT ACTGATGGAA ACACAGATAA CACAACAAGT
CCAGTTATAA AAATAGCTGA TGAAGTAACT TTTGAAGGAA GACTTAAAAA AGTCAAAATG
GCTTGGGAAG TATTTTTAGA TGTTATTACA GAGCTTGAAA AGAAAGATAA CACTTTCAAA
AGAGTGTGCA TTGATTTAGT TGAGGACTTA TATGAACACT GCAGGCTTTA TATGTATAAC
AAGTTAGGAA TAGACCATGA GCAGGATGCA GGTTTTGGTA AAGGATGGGA TATGGTTAGA
ACTGAATATT TATCAGCCAT AAAAAGACTT AAGAATTTAG GATATCAAAT AATTTATATT
TCTAAGGAAG TAACTACAGA AATAACACTA AAAAATGGAG CTAAGCTTAC AACCATAAAA
CCTAATATTA ATGAAAAAAT AGCTAATGTT TTAGCAGGAA CAGTAGATTT AACTGTAAGA
GCCTTTATGG ATGGAGAAGA AAGATACTTG CAGCTTGAAA AGAAAGAAAA TATCTTCGGC
GGTGGCAGAT TCAATTTTAA AGTTCCAAAA GTAGAGCTTG ATAAGGGTGA GTTTATGAAA
GCTTTAGAAG ATGCTCAGGA AGGTGTAAAA ACTTATTCTA AATCAGAAAC AGATACTTCT
GATAATACAG CAGCAGTAGA TAATACAACT GTATTAGAAA CATCTGAAGT TAAAGAAGAA
TCAGTAAAAA AGAGCAGACG CTCTAGAAAA TAA
 
Protein sequence
MQFSHSRVEC FEKCKYQFKL RYKDKVRTIP SPAADNALIA GSALHLGIEK GIEAMEQYYY 
NQYPVITDAH VNEVIKLTSL VKKAQIVINT MLHNKEPTEK YEFKIDFPEF IGFVDFIIQT
QDGSLSIYDF KYSNNIEHYL ESKQLHLYRY YLEKLGFKVS EIGFIFIPKT AIRQRKTEDL
YQFRKRLHKT LEAMEVKVIQ IPYDETKVQE FKLRCREIIN EKEYEKTPSR LCDWCEYQNY
CEGGQTDMLL PENVRRELQI DRYPDMWIYA DSYVGKSTFV DQFDDLLFLN TDGNTDNTTS
PVIKIADEVT FEGRLKKVKM AWEVFLDVIT ELEKKDNTFK RVCIDLVEDL YEHCRLYMYN
KLGIDHEQDA GFGKGWDMVR TEYLSAIKRL KNLGYQIIYI SKEVTTEITL KNGAKLTTIK
PNINEKIANV LAGTVDLTVR AFMDGEERYL QLEKKENIFG GGRFNFKVPK VELDKGEFMK
ALEDAQEGVK TYSKSETDTS DNTAAVDNTT VLETSEVKEE SVKKSRRSRK