Gene Cthe_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2423 
Symbol 
ID4808139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2893768 
End bp2896764 
Gene Length2997 bp 
Protein Length998 aa 
Translation table11 
GC content37% 
IMG OID640107837 
Producthypothetical protein 
Protein accessionYP_001038818 
Protein GI125974908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AGTTTTATCT TCGAACGTTT GCGCTTCTGA TAGTTCTCTG CGTTTTTGTC 
AACCTTTCAT TAATAGCAGG TTTTAATAGG CTTTTTGTAG AAAATGTTTA TGCAGATGAA
GGTGTTGAAT TTCCGGTTTC AAGCAACAAG GTTTATGTAA CATTGAAAAA CATCAAAACA
GGTGTGCCAT CAGATACTAT AGCCTTGAAG ATTGGTATTA TCAATTTAAA TAAAGCAATA
AACATAAATT TGAATGATAT TAAACTAAGA TATTACTTTA CTAATGACGG CTGCTCCCCT
ATACAGGTTA ATATAAAATT ATTTGGCACA GAAACGGAGA GTTTCAACCC TGAACTGGTT
AAAACTTCAG TAGTGACAGG TTTGTCCTAT CCGGGTGCTG ACAGCTATGT TGAAATAGGA
TTTACCGGTT CTGTAGAGTT AAATTGTGAT CGCAAACCTA TATACATTGA ACTTGATATC
AAAGAAAACA GCCCTGATCG TAACTTCGAT CAATCCAATG ATTTTTCCAA TAATAATTAC
TATACTCCCT TTTTGCCTGA AGAGTTTTTT GCATCGGGAA GAGTGCCGGT TTTCATGTAC
GATCCAAAAA AACGCGACTA TGTGCTTTTG ACCGGTGTAC TCCCCAGCGA AACATCCAAG
ATTCCGAATC CAACTCCGAC GCCTGTAGTA TCTCCTTCTC CTTCACCAAT AGTACCCGAA
GGTGAAAAAA TACTTGCTAC TGCTTCAGGA AATATCATTA TACCAAAACC AAGTCCTGCC
CCTGTTGGAT TATTTGAACC GGCTGATATT GGATTCCCGC ATGAAGGTTC CATAGATGTG
GGTTTCTCTC CACGCAAAAA AGAAGCCTTG ATTTTGCTTG ATTCATCTTA TGAATCCAAC
GATGTAGATG ATGGACCGAC AGGTATTTTT AAATATTGTC TGTTCTCCAG TGGTGATTCT
TTATACCAGG GAGACAATAT AACAATTGAA GGGGATGTGT TTACAAGAAA TACAATGAAT
GTTACGACTT CAGGTATTAA AATAACAGGA AAAGTTGAAT ACTCATTTAG AGATCGAAAC
TCATATGCTG GTCCTTTGGG TGGTAAAGAA ATAGAGATGG AACCGGCTGA TGCAGAGAGG
TATGATCGTT ATCTTGTACC TGATGAAAAT GACGACTATT CAGCATTATT TTCAATGATA
CAGACTAAGG TTACAAATCT TCCTGAAAAA GATGACGCTA AGTTTTTAAT TACAGAAGAC
ACAGTCGAAA AATACATTTC TCCCGCAAAT CCTAATTGGC GGAAAAATGC AGTACAGTTT
TACTATGATG ACAATGACAA ATCCGTATCT ATTGAATACC GTTCAAAAGA AGCCGGAGGA
TTTAGTCGCG AATACGGTAA CTCCTTACCG CAATATTCTA TAAAACAAGA CAAAGGCGAT
GCATTTGTAC TTAAATCCAA TATGTTTTTT GACGGCAATC TTATAATAAG CGTCAAAGGT
ATTAGGCAAG AATTAATTGA TGGTGCCACA TCTGCTTTTA TATACGCTTA TGGCGATATA
ATTCTACAAG GAAATGGAGC AACGTTTAAT GATGTGTATC TTATAACAAA ATATGGAAAC
ATTTATATTG AAACAGATAG CTGTAATGTC AACGGCATTG CATTTGCTCC AAACGGAAAA
ATTGTCATTA ACGGTCGAAG CAACAATCTA CAAGGTAGTT TCGTTGCAAG AAAAATCCAA
TGTGAGCCAG GTAATAGTGT TTTTAAAGGA CCCACTGATG ACCAATTAGA AGATATTGAA
GATGCCCTGA AAAGTACAGA AGGTTTTGAC ACCATTAGAA ACTCAATAGC CTTACTTCCC
TATATATTTG ATGAATATAC AAGAGCTGGT ATCATAACCT ATTCGGATTA TGCAAACATA
AATGACTCAC CGATTAACGA TAGTTGGAAA TTTTTTGATG CTGCTACTGA GCGGGAAGAA
TTTTTAAATT ATACTTTGAC ACTGTCAGTG GACGAGGACA GCAAAAGAAG CAATCTTGGT
GACGGACTGA GAAAAGCTTT GGATGTATTC AATAAATACT CAGATCCCGA AGCGGACAAA
TATATATACA TCTTTACAAG TCTTGATCCA AATGCATACA CCAGATCGCA TCTTTCTGAC
GGACTATTTG AAACTGACCC GAAAGTTGAT ACTAATGCGG CCTATATTTA TGATGAAACT
GTCAACGGGG AAGGAAACCA ATATGTTAGA GAAATAATGA AATTAATTGA AAAGTATAAT
AATAATCACG TCAATGGCAA AATAAAACTT ATACTTGTTG ATTTGACTAA TTATATTAAA
GAATTCAATA TAAAGAACGG TGCAAAAGAA TCTGAAATAG AAGTTGACGT TTTAACAAAT
CTTGCCTATG ACCTGGGAAT TGACATTTCT GACTCTGATG AAAAAGCTTA CTACTGTCCT
TCATTAGAGG ATATACAATC ACTTTCAATT ATAAACGAAT TGGCATATCG TTCAAACAGT
ATGCCGCCTA AACTTGCTGT GGAAAACTTG AAAATCAGTT CAGCACAATT TGAACTGTCA
CTGCCAAGTT ACATCAAACC GGTTGAATTG TTCTTCAAGA GGGCAAGTAA TACAAAAGAG
TCAATTGTTA ACTTGTCAGG ACTTGCAGCA TCTGGTGGCA AATACAATAT AACCTATACT
TTCAGCGGCG ATGAGCTGGC AACCCTTACA AGGATTAGCG ACGGCTTGAA ATATGACTTG
GAAAGCAACG GATTATACAT GACCCTGATT GTCAACAGTA GTGACGATTG GGACGATGGA
GATAATCCTC TGACCGTTAA AGGTACCGTT GACATTGCCG GACCTAAAAT AACATATAAA
TTGTTTGATG ACAAAAATAA TGACGGCGTA AGGTCCGCAG GTGAAGCTGA ATTTGAAGTT
GTAGTGCCGT TCGACAATAT TAAATTCAAT GTAGAGTACA AGAAGGATAT CAACTAA
 
Protein sequence
MKRKFYLRTF ALLIVLCVFV NLSLIAGFNR LFVENVYADE GVEFPVSSNK VYVTLKNIKT 
GVPSDTIALK IGIINLNKAI NINLNDIKLR YYFTNDGCSP IQVNIKLFGT ETESFNPELV
KTSVVTGLSY PGADSYVEIG FTGSVELNCD RKPIYIELDI KENSPDRNFD QSNDFSNNNY
YTPFLPEEFF ASGRVPVFMY DPKKRDYVLL TGVLPSETSK IPNPTPTPVV SPSPSPIVPE
GEKILATASG NIIIPKPSPA PVGLFEPADI GFPHEGSIDV GFSPRKKEAL ILLDSSYESN
DVDDGPTGIF KYCLFSSGDS LYQGDNITIE GDVFTRNTMN VTTSGIKITG KVEYSFRDRN
SYAGPLGGKE IEMEPADAER YDRYLVPDEN DDYSALFSMI QTKVTNLPEK DDAKFLITED
TVEKYISPAN PNWRKNAVQF YYDDNDKSVS IEYRSKEAGG FSREYGNSLP QYSIKQDKGD
AFVLKSNMFF DGNLIISVKG IRQELIDGAT SAFIYAYGDI ILQGNGATFN DVYLITKYGN
IYIETDSCNV NGIAFAPNGK IVINGRSNNL QGSFVARKIQ CEPGNSVFKG PTDDQLEDIE
DALKSTEGFD TIRNSIALLP YIFDEYTRAG IITYSDYANI NDSPINDSWK FFDAATEREE
FLNYTLTLSV DEDSKRSNLG DGLRKALDVF NKYSDPEADK YIYIFTSLDP NAYTRSHLSD
GLFETDPKVD TNAAYIYDET VNGEGNQYVR EIMKLIEKYN NNHVNGKIKL ILVDLTNYIK
EFNIKNGAKE SEIEVDVLTN LAYDLGIDIS DSDEKAYYCP SLEDIQSLSI INELAYRSNS
MPPKLAVENL KISSAQFELS LPSYIKPVEL FFKRASNTKE SIVNLSGLAA SGGKYNITYT
FSGDELATLT RISDGLKYDL ESNGLYMTLI VNSSDDWDDG DNPLTVKGTV DIAGPKITYK
LFDDKNNDGV RSAGEAEFEV VVPFDNIKFN VEYKKDIN