Gene Cthe_3124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3124 
Symbol 
ID4809687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3690976 
End bp3692613 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content45% 
IMG OID640108557 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001039512 
Protein GI125975602 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATTA CTGAATTTTT GGAACGCAAT GCTGCGTTGT ACGGTAGTGA AAAGTGCCTG 
ACGGAGATAA ATCTTGACCT TCAGGAGAGT CATAATGTCA TATGGCGTGA GTATGAACTC
ATCGAAAACA ATCCGGCAGG GGAATATCGC AGAGATATGA CCTGGAAGGT TTTTGACGAA
AAAGCAAATC GGTTTGCAAA TCTCTTGATT AAAAGAGGAA TCAAAAAGGG CGACAAGGTG
GCTATACTTT TAATGAACTG TTTGGAATGG CTGCCTATTT ATTTTGGGAT ATTGAAGGCG
GGTGCTGTCG CAGTACCGTT GAATTTCCGC TATACAGCCG AGGAAATAAA ATACTGTCTT
GAGTTGTCCG ATTCCATAGC ATTGGTTTTT GGTCCTGAAT TTATCGGTCG CATAGAGAAT
ATTTACGACC AGATAATACC CAAAATAAAA TTATTGTTGT TTGCCGGAGA AAACCGTCCT
TCCTTTGCCG AAAGTTATGA CCGTCTGACG GCAAATTGTC CTTCCGAAGC ACCAAAGGTT
GAAATTACCG ATGATGACGA TGCGGCAATA TATTTTTCAT CAGGAACCAC AGGATTTCCA
AAGGCAATTT TGCATGCCCA CAGAAGCCTT GTGTCGGCTT GCTATACGGA GCAGAAACAT
CATGGCCAGA CACGGGAGGA TAATTTTTTG TGTATTCCGC CCCTTTATCA TACCGGTGCG
AAAATGCACT GGTTCGGAAG TTTGCTGTCA GGCAGCAAAG CAGTATTGCT AAGGGGTATT
AAGCCGGAAT GGATATTAAG GACGGTTTCA GAAGAGAAAA TTACAATTGT ATGGCTTCTT
GTCCCGTGGG CCCAGGATAT CCTGGATGCC ATTGAAAGAG GAGACGTAAA ACTGGAAGAT
TATGACCTTT CCCAGTGGAG ACTTATGCAT ATCGGTGCAC AGCCGGTGCC TCCCAGCTTG
ATTCGTCGCT GGAAAAAGTA CTTCCCGCAT CATCTTTACG ATACCAACTA CGGCTTGAGT
GAGTCTGCGG GACCGGGATG TGTGCATCTG GGTGTGGAGA ATATTCACAA AGTAGGTGCC
ATAGGCTTGC CGGGTTATAA TTGGGAGGCT AAAATTGTGG ATGAAAACGG ATGCCCTGTA
AAACAGGGAG AAGTAGGTGA ACTGGCTGTC AAAGGTCCCG GTGTGATGAA GTGCTATTAC
AAAGATCCGG AGGCTACTGC CGCAGTGCTG AAAGACGGCT GGCTTTTAAC CGGTGACATG
GCGAGAATGG ATGAAGATGG ATTCATTTAT CTGGTGGACC GCAAGAAAGA CGTAATTATC
AGCGGAGGAG AAAATATATA TCCTGTGCAG ATTGAGGATT TCCTAAGGTC GCATGAGGCA
ATCAAGGATG CGGCGGTAAT TGGACTTCCG GACAAGCGCC TTGGTGAAAT AGCGGCCGCC
ATTATAGAAT TGAAACCGGG CTTTGAGTGC ACTGAAGAGG AAATAAACAA ATTCTGTCTC
GTACTGCCGC GTTACAAACG ACCTCGTAAA ATTATTTTTG ACAAAGTACC GAGAAATCCA
ACAGGAAAGA TTGAAAAGCC GCGTTTGAGG GAAAAATATG GTGTGGTAGC TTTGGTGGAA
GCAGAAACAA TAAGCTGA
 
Protein sequence
MPITEFLERN AALYGSEKCL TEINLDLQES HNVIWREYEL IENNPAGEYR RDMTWKVFDE 
KANRFANLLI KRGIKKGDKV AILLMNCLEW LPIYFGILKA GAVAVPLNFR YTAEEIKYCL
ELSDSIALVF GPEFIGRIEN IYDQIIPKIK LLLFAGENRP SFAESYDRLT ANCPSEAPKV
EITDDDDAAI YFSSGTTGFP KAILHAHRSL VSACYTEQKH HGQTREDNFL CIPPLYHTGA
KMHWFGSLLS GSKAVLLRGI KPEWILRTVS EEKITIVWLL VPWAQDILDA IERGDVKLED
YDLSQWRLMH IGAQPVPPSL IRRWKKYFPH HLYDTNYGLS ESAGPGCVHL GVENIHKVGA
IGLPGYNWEA KIVDENGCPV KQGEVGELAV KGPGVMKCYY KDPEATAAVL KDGWLLTGDM
ARMDEDGFIY LVDRKKDVII SGGENIYPVQ IEDFLRSHEA IKDAAVIGLP DKRLGEIAAA
IIELKPGFEC TEEEINKFCL VLPRYKRPRK IIFDKVPRNP TGKIEKPRLR EKYGVVALVE
AETIS