Gene Cthe_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1185 
Symbol 
ID4810137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1413880 
End bp1415160 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content39% 
IMG OID640106607 
Producthypothetical protein 
Protein accessionYP_001037610 
Protein GI125973700 
COG category[R] General function prediction only 
COG ID[COG4908] Uncharacterized protein containing a NRPS condensation (elongation) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000105816 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATC CTAAAAAAGT GGAATGGAGC AGGCTGGACA ATGCTTCAAA ATATTTTGCG 
GCAACATACA GTGAGAGGGA TGAAAAGGTA TTCAGAATCT CATGTGAGTT GTTTGAGGAA
GTAGATCCGG AAATTTTGCA ACAAGCTCTT GATGAGACTA TTGAGAGATT TCCGTATTAT
AAATCCGTTT TAAGAAGAGG AATATTTTGG TATTACCTTG AGGACAGCGA TATCAGGCCT
TTGGTTGAAA AAGAGGATAA ACCGGTCTGC GCACCGATTT ACAGAAAATA CGAAAGAAAT
CTGTTGTTTA GAGTTCTTTA CTACAACAAA AGAATCAGTC TCGAAGTATT TCATGCCCTT
TCGGACGGAA CAGGGGCTCT TAGGTTTATG ATGACACTGG TTTACCATTA TTTGACGATC
AAACACAAAG ATGAGTTTTC CGGCAAAATA CCCGAATTAA ATTACAATGC ATCCATCGGC
GAAAAAAAGG ACGACAGTTT CGAACGGTAT TATCAAGGCA GGCGTTTTAA AAAGCAGGCA
AGGGAAAAGA AAGAAAAAAA GCCGTTTAAG AGAGTATATC GCATACGGGG AACCAGAATT
GAGGAAAACA GAATTAAGAT AATAGAAGGC ACAATGTCTG CAAAAGCCGT ATTAAATGAA
GCACATAAAT ATAACACGAC AATGACCGTG TTTTTATCGG CGCTGTTGCT TCGCTCAATT
TACATGGATA TGCCGGCCCG AAAAAAAGAC TATTCTTTGG TGTTAATAGT ACCTATTAAC
CTCAGACAGT TTTTCAAATC GGAAACGGCA AGCAATTTTT TCAGTACGAT GAGCATTGAG
TATAAGTTTA CCGAAGAAGG CATGGAGCTT GATAAAATAA TCGCAAGTCT GAATGAGAGT
TTTAAAAAAG AACTTACGGA AGAAAGGCTG AGCGAGAAGA TTAACTGGCA AATGTCCATT
GAAAAAAATC CTTTTGCCAG AATTATGCCC CTGCCGCTTA AAAATCTCTT TATTCGTATT
GCTGATGAAG TGGTGGAAAG CAGAACCACC GCATGCATAT CCAACTTGGG CAAAATACAA
ATGCCTCCTG AGTTTGAAAG GTATATCCGA CAGTTCAGTG TCGTACCCAA TGTCAGAAGA
CCTCAGATTG CGGTATGTAC ATACGGGGAC AAAATGGCGG TAGCTTTTGG TTCGCCGTTC
AAAGAAACCG AGATACAAAA AAATTTTTTC AAATCCCTGT CGGAAATGGG GATTAAAATT
GAAATAGTGT CAAACATGTA G
 
Protein sequence
MNYPKKVEWS RLDNASKYFA ATYSERDEKV FRISCELFEE VDPEILQQAL DETIERFPYY 
KSVLRRGIFW YYLEDSDIRP LVEKEDKPVC APIYRKYERN LLFRVLYYNK RISLEVFHAL
SDGTGALRFM MTLVYHYLTI KHKDEFSGKI PELNYNASIG EKKDDSFERY YQGRRFKKQA
REKKEKKPFK RVYRIRGTRI EENRIKIIEG TMSAKAVLNE AHKYNTTMTV FLSALLLRSI
YMDMPARKKD YSLVLIVPIN LRQFFKSETA SNFFSTMSIE YKFTEEGMEL DKIIASLNES
FKKELTEERL SEKINWQMSI EKNPFARIMP LPLKNLFIRI ADEVVESRTT ACISNLGKIQ
MPPEFERYIR QFSVVPNVRR PQIAVCTYGD KMAVAFGSPF KETEIQKNFF KSLSEMGIKI
EIVSNM