Gene Cthe_0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0133 
Symbol 
ID4808691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp162114 
End bp163598 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content44% 
IMG OID640105544 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001036567 
Protein GI125972657 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTATT GCAATTGGTT GTTTGACACA AAAGACAAGG ACCTTGAAAG AGAAATCATC 
ATTGACTGGG AAACCGGAAA AAGGCTGACT TTTAAAGGAC TTCAGACGGA AGTGGTAAGG
CTTGCAAATT TCCTCAAGTC AAAAGGGTAT GTCCCCGGAA CGGTTATTGC CACACATCTT
TACAACGGTA TTGAAGCAGC CGTCGCTTTT TTGGCCGCCG AATATATCGG ATGTGTTGTT
TGCCTTGTGG ATCCGCTTTT TAAGGCGGAC GAAGTGCCGT ACTATGTTGA AGACTCCGGT
GCCAAATGTC TAATTACCCA CCTGGAAAAA GATGAGATAG CCGGAAAACT ACCATCGGAA
GTTGATGTGA TAAACGTAAG AGAGGTTCAG GAAGTCTGTG AAAGCGACGA GTTTGAAAAA
TCTCTTGAAA TATATGATTT TGAAGAAAAT GAACTTGCAC TGCTTTTATA TACCTCGGGT
TCCACTTCCA CTCCCAAGGG TGTGATGCTT ACAACGGGCT GTTGTCATAC GTTCCTTAGA
AAGAATCATC AGTCGATGTA CAGATATGAT CCGGATGACA GAATCTTATG TTTTGTGCCC
TTTTCCCATG GATTCGGTTC AATTTCCGTC CTGATTCCGG CATTGGCGTA CAAGGCGGGA
ATTGTGTTTC AAAAAACATT CCATCCTGCC AAAGTTGCCG AAGCGGTGAT AAAAGAGAAC
ATTACCCATA TGCTGGGCGT GCCGACCCAT TACCGTCAAT TGTTAAGATA TGAACCTTTC
ATTAACAATC TGGGCAAGCT TAAAGCGGCT TTTTGCTCGG CAGCGCCCAT TAGCTGTGAA
GTGGCACGGC AGTGGTACGA AAAAACCGGA ATATATTTGG ATGAGGGCTA CGGAATGAGT
GAAGCAACCA CTCTTATTAC CACAAGGATG TCACGGCTTC CTTCAACTTC AGGGGATGTG
GGACACCCCC CGGAAGGGAT TATATCCGTT GACATTGTTG ACGACAACGA CAGGGTGGTT
GAAAACGGAA CAATAGGAGA AATTCGTGTA ACCGGACAGG GACTCATGCT TGGATACCTG
AATCGGCCGA AAGAGACAGC GGAAAGGCTC AGAAACGGAT ATCTCTATAC CGGTGATTTG
GGATACAAAA ACCCTGACGG ATCACTGGTT GTTTGCGGCA GAAAAACAGA ATTCATAAAC
GTTGCAGGGC TTAAAATATC GCCTGTTGAA GTTGAGACTG CATTAAATTC CCATTCAGAT
GTGATTGATT CTGCAGTTGT CGGAGTTACG GATGAAGTCT ATGGAGAAGT GGTAAAGGCT
TTTGTTATCA AGAAACAGGA TTCAAACCTC ACGGAGCGGG AACTGATAAA ATATGTTTCC
GACAAAGTGG CAAACTTTAA AGTACCGAAA TATGTTGTGT TTGTTGATGA ATTTCCGCGA
AACAATGTTG GAAAAGTTGA TAAAAAGGCA TTAAAAAATA TGTAG
 
Protein sequence
MNYCNWLFDT KDKDLEREII IDWETGKRLT FKGLQTEVVR LANFLKSKGY VPGTVIATHL 
YNGIEAAVAF LAAEYIGCVV CLVDPLFKAD EVPYYVEDSG AKCLITHLEK DEIAGKLPSE
VDVINVREVQ EVCESDEFEK SLEIYDFEEN ELALLLYTSG STSTPKGVML TTGCCHTFLR
KNHQSMYRYD PDDRILCFVP FSHGFGSISV LIPALAYKAG IVFQKTFHPA KVAEAVIKEN
ITHMLGVPTH YRQLLRYEPF INNLGKLKAA FCSAAPISCE VARQWYEKTG IYLDEGYGMS
EATTLITTRM SRLPSTSGDV GHPPEGIISV DIVDDNDRVV ENGTIGEIRV TGQGLMLGYL
NRPKETAERL RNGYLYTGDL GYKNPDGSLV VCGRKTEFIN VAGLKISPVE VETALNSHSD
VIDSAVVGVT DEVYGEVVKA FVIKKQDSNL TERELIKYVS DKVANFKVPK YVVFVDEFPR
NNVGKVDKKA LKNM