Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1232 |
Symbol | |
ID | 4809924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 1474841 |
End bp | 1476571 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106655 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001037657 |
Protein GI | 125973747 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTT CACCGGTTTT CGAAGTAAGA ACGATTAAAA ACTTAAGAGA CATGATTGAG CAAAGCAGCA AACTGTTTGC CAACAAAGAT GCTTTTCGCG TGAAAACAAA AGATAATTCA TATAGAGGAA TTACTTTCGC CGAATTCAAG AATGATATTG ATGCTTTCGG AACAGCCCTG CTTGATTTGT TGGGAACAGA AAAAGGATTT GTCGCTGTTA TAGGTGAAAA CAGGTATGAA TGGTGTGTTA CCTATCTTGC AACTATAAAC GGCGTCGGAG TAGTTATACC ACTGGACAAA GAACTACCCC TTCCCGAACT GGAAAACTTG TTAAAACAGT CCAATGCCAA TGCCATTGTC TACTCGGGAA AATTTCATGA TGCAATTAAA GAAATGTCTT CCCGTTTAAG CAATATCAAA TATTTCATTA ATATGAACAC CAATGAGCAT GAGGATGATA AATTTTTATC CTTTTGGGTT CTCCTTGAAA AAGGAAAAAA ACTTTTGGAA TCAGGAAAAA AGGACTATCT TAATGCTCCC ATAGATGAAA ACGCAATGAG TGCAATGATT TTTACTTCGG GTACAACGGG CCAGGCTAAA GCCGTTATGT TGTCCCACAA AAATATTTGC TCAAATATGA TGGCCGTTTC AGCTTCTGTT TATATGGACA GCACAGATTC CGTGCTTTCA ATCCTTCCCT TGCACCATAC CTATGAATGC ACCGCAGGTT TCCTCACTAT GATATATAAC GGTGCAACAA TAACTTTCAA TGAGGGACTA AAATACATCG GCAAAAATCT CAAAGAGGCA CAACCGACAA TCCTTATTCT CGTACCTCTT ATTCTGGAAA GCATGTACAA TAAAATATGG GAACAGGCTT CAAAAGACAA AAGCCTTAAA TTTAAGCTGA AAGCCGGACT TTTTATTAGT AATTTGCTAT ATAAGGTTTT TAAAATTGAC ATACGAAGAA AGTTGTTTAA ATCCGTAATT GACAATGTTG GCGGTAAATT AAGGCTGGTC ATTTCAGGTG CTGCGGCCCT TGACCCTGAA GTGGCAAAAG GATTTGAGGC CATGGGTATA AAAGTCCTTC AGGGATATGG TCTTACCGAA GCTTCTCCAA TAGTTGCAGT GAATCGCGAC AAGTCGTACA GACACGATTC AGTAGGACTT CCTCTTCCCG GGCTTGACGT CGAAATCATC AACCCCGACA AAGAGGGATT TGGAGAAATA ATAGTCAAAG GTGATAGTGT AATGCTTGGC TATTACAATA ATGATGACGC CACCAAAGCA GTTCTTAAAG ACGGATGGCT CTATACCGGA GACCTTGGCC GCATGGATGA AAAGGGCTTT ATATACATTA CCGGACGCAA GAAAAACATT ATAGTAACCA AAACAGGAAA GAATATTTTC CCTGAAGAAG TTGAAGCCTA TCTTAACAAA AGCCCATATA TTAAAGAATC TCTGGTTTCG GGAAGAGAAA ACGATAAAAA CGATGAAACA ATAGTAGTAG CTCAAATTGT ACCCGATATG GATGCAATCA AAGCCAAGCT TAAAACGGAC ACAGTTCCGT CACCCGAAGA GGTTTACAAA TTGATTAAGG CAGAAATTAG GGCTATAAAC AAAAACATGC CGGTCTATAA AAGAGTTGTT GATATAACCA TTCGTGAAAA CGAATTTGCC AAAACATCTT CCAAGAAGAT TAAACGATAT CTTGAGAAAA CTAATGTATA A
|
Protein sequence | MKTSPVFEVR TIKNLRDMIE QSSKLFANKD AFRVKTKDNS YRGITFAEFK NDIDAFGTAL LDLLGTEKGF VAVIGENRYE WCVTYLATIN GVGVVIPLDK ELPLPELENL LKQSNANAIV YSGKFHDAIK EMSSRLSNIK YFINMNTNEH EDDKFLSFWV LLEKGKKLLE SGKKDYLNAP IDENAMSAMI FTSGTTGQAK AVMLSHKNIC SNMMAVSASV YMDSTDSVLS ILPLHHTYEC TAGFLTMIYN GATITFNEGL KYIGKNLKEA QPTILILVPL ILESMYNKIW EQASKDKSLK FKLKAGLFIS NLLYKVFKID IRRKLFKSVI DNVGGKLRLV ISGAAALDPE VAKGFEAMGI KVLQGYGLTE ASPIVAVNRD KSYRHDSVGL PLPGLDVEII NPDKEGFGEI IVKGDSVMLG YYNNDDATKA VLKDGWLYTG DLGRMDEKGF IYITGRKKNI IVTKTGKNIF PEEVEAYLNK SPYIKESLVS GRENDKNDET IVVAQIVPDM DAIKAKLKTD TVPSPEEVYK LIKAEIRAIN KNMPVYKRVV DITIRENEFA KTSSKKIKRY LEKTNV
|
| |