Gene Athe_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1507 
Symbol 
ID7408166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1591790 
End bp1593091 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content37% 
IMG OID643715870 
ProductPhenylacetate--CoA ligase 
Protein accessionYP_002573378 
Protein GI222529496 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATGGT CTGAATATGA AAAACTCAAT AGAAAACAGT ATGAAGAACT GCAGCTTGAA 
AGACTTAAAA GAACGGTAGA AAGGGTTTAT GAAAATGTTC CTTTTTACCG TAAAAAATTT
GATGAAATAG GAGTAAAGCC ACATCATATT AAGAATTTAA AAGATATTCG GCTTCTTCCC
TTCACAACTA AGGATGACCT GAGAGAAAAC TATCCGTATG GTCTTTTTAC TGTCCCTCTT
TCAAAAATTG TTAGAATTCA TGCCTCCTCA GGCACAACAG GTAAGCCAAC CGTTGTAGGA
TATACAAAAC ATGACATGGA AGTGTGGACA GAGGTTGTTG CAAGAATAGT CACAGCAGCA
GGTGTCAGAG AACATGATAT TGCTCAGATT GCTTTTGGTT ACGGACTCTT TACTGGTGCT
TTTGGACTTC ACCAGGGTTT AGAGAGAGTT GGTGCAACAG TAATTCCAAT TTCAAGTGGT
AATACTGAAA AGCAGCTTAT GGTTATGCAG GATTTTGGTG CTACAGTTTT GGTATGTACA
CCGTCTTATG CACTTTACAT AGACGAGGTT GCAAATGAAC TTGGCATTGA TAAGTCAAGG
ATAAAACTAA GACTGGGCCT TTTTGGTGCA GAAGCTTCAA CAGTTGAGAT GAGAAGAGAG
ATTGAAAAGA AGTGGGGACT TTTTGCAACA GAAAATTATG GACTTTCTGA AATAATTGGT
CCAGGGGTTT CTGGAGAGTG TGAATATAGA GAAGGGTTAC ATATAAATGA AGACCATTTC
TATCCTGAGA TAATAAATCC CGACACAGGA GAGGTTCTTG AAGAAGGAGA AACAGGAGAG
CTTGTATTAA CAACCATTAC AAAAGAAGGT ATGCCTCTTA TAAGATATAG AACAAGGGAT
ATCACCTCAC TTATATATGA GCCATGCAAG TGCGGAAGGA CAAATGTGAG AATGACATCT
GTTAAAGGAA GAACAGATGA TATGCTAATA ATCCGAGGTG TCAATGTATT TCCCTCTCAG
ATAGAAAGTG TTCTAATGGG AATTGAAGGT ATAGGTCCTC ACTATCAACT TGTTGTCACA
AAGAAAGGAT ATTTGGATGA TTTGGAAGTT CATGTAGAGC TTGTTGATGG AAAACTTTTG
GAAAGATATG CTGAACTCGA GAAATTAGAA AATAAGATAA AGCACAGGAT ATTTACTGTA
TTGGGATTAA ATGTTAAGGT AAAACTTGTT GAACCGAAAA CTTTAGAAAG AACTACTGGA
AAGGCAAAAA GAGTAATTGA TTTGAGAAAT AAAACCAATT AA
 
Protein sequence
MIWSEYEKLN RKQYEELQLE RLKRTVERVY ENVPFYRKKF DEIGVKPHHI KNLKDIRLLP 
FTTKDDLREN YPYGLFTVPL SKIVRIHASS GTTGKPTVVG YTKHDMEVWT EVVARIVTAA
GVREHDIAQI AFGYGLFTGA FGLHQGLERV GATVIPISSG NTEKQLMVMQ DFGATVLVCT
PSYALYIDEV ANELGIDKSR IKLRLGLFGA EASTVEMRRE IEKKWGLFAT ENYGLSEIIG
PGVSGECEYR EGLHINEDHF YPEIINPDTG EVLEEGETGE LVLTTITKEG MPLIRYRTRD
ITSLIYEPCK CGRTNVRMTS VKGRTDDMLI IRGVNVFPSQ IESVLMGIEG IGPHYQLVVT
KKGYLDDLEV HVELVDGKLL ERYAELEKLE NKIKHRIFTV LGLNVKVKLV EPKTLERTTG
KAKRVIDLRN KTN