Gene Cthe_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1232 
Symbol 
ID4809924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1474841 
End bp1476571 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content39% 
IMG OID640106655 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001037657 
Protein GI125973747 
COG category[I] Lipid transport and metabolism 
COG ID[COG1022] Long-chain acyl-CoA synthetases (AMP-forming) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTT CACCGGTTTT CGAAGTAAGA ACGATTAAAA ACTTAAGAGA CATGATTGAG 
CAAAGCAGCA AACTGTTTGC CAACAAAGAT GCTTTTCGCG TGAAAACAAA AGATAATTCA
TATAGAGGAA TTACTTTCGC CGAATTCAAG AATGATATTG ATGCTTTCGG AACAGCCCTG
CTTGATTTGT TGGGAACAGA AAAAGGATTT GTCGCTGTTA TAGGTGAAAA CAGGTATGAA
TGGTGTGTTA CCTATCTTGC AACTATAAAC GGCGTCGGAG TAGTTATACC ACTGGACAAA
GAACTACCCC TTCCCGAACT GGAAAACTTG TTAAAACAGT CCAATGCCAA TGCCATTGTC
TACTCGGGAA AATTTCATGA TGCAATTAAA GAAATGTCTT CCCGTTTAAG CAATATCAAA
TATTTCATTA ATATGAACAC CAATGAGCAT GAGGATGATA AATTTTTATC CTTTTGGGTT
CTCCTTGAAA AAGGAAAAAA ACTTTTGGAA TCAGGAAAAA AGGACTATCT TAATGCTCCC
ATAGATGAAA ACGCAATGAG TGCAATGATT TTTACTTCGG GTACAACGGG CCAGGCTAAA
GCCGTTATGT TGTCCCACAA AAATATTTGC TCAAATATGA TGGCCGTTTC AGCTTCTGTT
TATATGGACA GCACAGATTC CGTGCTTTCA ATCCTTCCCT TGCACCATAC CTATGAATGC
ACCGCAGGTT TCCTCACTAT GATATATAAC GGTGCAACAA TAACTTTCAA TGAGGGACTA
AAATACATCG GCAAAAATCT CAAAGAGGCA CAACCGACAA TCCTTATTCT CGTACCTCTT
ATTCTGGAAA GCATGTACAA TAAAATATGG GAACAGGCTT CAAAAGACAA AAGCCTTAAA
TTTAAGCTGA AAGCCGGACT TTTTATTAGT AATTTGCTAT ATAAGGTTTT TAAAATTGAC
ATACGAAGAA AGTTGTTTAA ATCCGTAATT GACAATGTTG GCGGTAAATT AAGGCTGGTC
ATTTCAGGTG CTGCGGCCCT TGACCCTGAA GTGGCAAAAG GATTTGAGGC CATGGGTATA
AAAGTCCTTC AGGGATATGG TCTTACCGAA GCTTCTCCAA TAGTTGCAGT GAATCGCGAC
AAGTCGTACA GACACGATTC AGTAGGACTT CCTCTTCCCG GGCTTGACGT CGAAATCATC
AACCCCGACA AAGAGGGATT TGGAGAAATA ATAGTCAAAG GTGATAGTGT AATGCTTGGC
TATTACAATA ATGATGACGC CACCAAAGCA GTTCTTAAAG ACGGATGGCT CTATACCGGA
GACCTTGGCC GCATGGATGA AAAGGGCTTT ATATACATTA CCGGACGCAA GAAAAACATT
ATAGTAACCA AAACAGGAAA GAATATTTTC CCTGAAGAAG TTGAAGCCTA TCTTAACAAA
AGCCCATATA TTAAAGAATC TCTGGTTTCG GGAAGAGAAA ACGATAAAAA CGATGAAACA
ATAGTAGTAG CTCAAATTGT ACCCGATATG GATGCAATCA AAGCCAAGCT TAAAACGGAC
ACAGTTCCGT CACCCGAAGA GGTTTACAAA TTGATTAAGG CAGAAATTAG GGCTATAAAC
AAAAACATGC CGGTCTATAA AAGAGTTGTT GATATAACCA TTCGTGAAAA CGAATTTGCC
AAAACATCTT CCAAGAAGAT TAAACGATAT CTTGAGAAAA CTAATGTATA A
 
Protein sequence
MKTSPVFEVR TIKNLRDMIE QSSKLFANKD AFRVKTKDNS YRGITFAEFK NDIDAFGTAL 
LDLLGTEKGF VAVIGENRYE WCVTYLATIN GVGVVIPLDK ELPLPELENL LKQSNANAIV
YSGKFHDAIK EMSSRLSNIK YFINMNTNEH EDDKFLSFWV LLEKGKKLLE SGKKDYLNAP
IDENAMSAMI FTSGTTGQAK AVMLSHKNIC SNMMAVSASV YMDSTDSVLS ILPLHHTYEC
TAGFLTMIYN GATITFNEGL KYIGKNLKEA QPTILILVPL ILESMYNKIW EQASKDKSLK
FKLKAGLFIS NLLYKVFKID IRRKLFKSVI DNVGGKLRLV ISGAAALDPE VAKGFEAMGI
KVLQGYGLTE ASPIVAVNRD KSYRHDSVGL PLPGLDVEII NPDKEGFGEI IVKGDSVMLG
YYNNDDATKA VLKDGWLYTG DLGRMDEKGF IYITGRKKNI IVTKTGKNIF PEEVEAYLNK
SPYIKESLVS GRENDKNDET IVVAQIVPDM DAIKAKLKTD TVPSPEEVYK LIKAEIRAIN
KNMPVYKRVV DITIRENEFA KTSSKKIKRY LEKTNV