Gene Athe_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1504 
Symbol 
ID7408163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1588129 
End bp1589430 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content39% 
IMG OID643715867 
ProductPhenylacetate--CoA ligase 
Protein accessionYP_002573375 
Protein GI222529493 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATATT GGGATGAACA TATGGAGTGC ATGGACAGAA GTACTTTGCA GGAGATTCAG 
CTCAAAAGAC TTGTTGAGAC TGTAAAAAGA GTATATACGA GCGTGCCGTA TTACAGGCGA
AAAATGCAAG AGCTTGGTAT TATCCCTGAA GACATAAAGA GCTTGGACGA CCTAAAGAAA
CTTCCATTTA CTACAAAACA GGACTTGCGT GATAACTACC CGTATGGACT CTTTGCTGTA
CCTTTGAGCG AAATTGTAAG AATTCATGCT TCTTCTGGAA CAACAGGCAA ACCCACTGTT
GTCGGATACA CAAAACATGA TATTGGTATT TGGTCTGAGG TTATGGCAAG GACACTTGTG
GCAGCCGGAG CAGACAAACA CTCGTTTGTC CAGATAGCAT ATGGCTATGG TCTTTTCACA
GGTGGGCTTG GTGTTCACTA CGGAGCAGAG CGGATTGGTG CATCAGTAAT ACCTATTTCA
TCTGGAAATA CGAGAAGACA GATACAGATT ATGGTTGATT TTGGAACAAC AGTTTTGGCT
TGCACACCTT CATATGCTCT CTATCTTGCT GAGACTATGG AAGAGATGGG AATTGACAAA
TCTCAGCTTA AGCTAAAGTC AGGGGTATTT GGTGCAGAGC CGTGGTCAGA AAACATGCGA
AAAGAGATAG AATCAAAATT GAATATCAAG GCATACGATA TATACGGTCT TTCAGAAATA
ATTGGACCTG GGGTTTCGTT TGAATGTGAA TATCAGTGTG GAATGCATAT AAATGAAGAC
CATTTCTTAC CAGAAATAAT CAATCCAGAA ACAGGTGAAG TTTTGGGCGA GGGAGAATAT
GGTGAGCTTG TATTTACCAC AATTACAAAA GAAGGACTTC CACTTATAAG ATACAGAACA
CGAGACATAA CTGCTCTTCA CTATGATAGG TGCAAGTGTG GTAGAACATT AGTAAGGATG
GAAAAGGTTA TTGGTAGAAC AGATGATATG ATAATTATAC GTGGTGTCAA TGTCTTCCCA
TCTCAGATAG AAAGCGTCCT ACTTGAGATG GGAGAAGTCG AGCCACATTA TCAGCTGATT
GTGGATAGGG TTAACAATCT TGATGTTCTT GAGGTTTTGG TAGAAGTTTC TGAAAGAATG
TTTTCTGATG AGGTTAAAAA ACTTGAACAG CTTGAGAAGA AAATAACAAA AGCTATTGAA
GAGACTCTTG GAATTTCTGT AAAGGTTCGA CTTGTTGAAC CAAAGACAAT TGAAAGAAGT
GAAGGAAAGG CGAAAAGAGT TATTGACAAG AGAAAAATAT AA
 
Protein sequence
MRYWDEHMEC MDRSTLQEIQ LKRLVETVKR VYTSVPYYRR KMQELGIIPE DIKSLDDLKK 
LPFTTKQDLR DNYPYGLFAV PLSEIVRIHA SSGTTGKPTV VGYTKHDIGI WSEVMARTLV
AAGADKHSFV QIAYGYGLFT GGLGVHYGAE RIGASVIPIS SGNTRRQIQI MVDFGTTVLA
CTPSYALYLA ETMEEMGIDK SQLKLKSGVF GAEPWSENMR KEIESKLNIK AYDIYGLSEI
IGPGVSFECE YQCGMHINED HFLPEIINPE TGEVLGEGEY GELVFTTITK EGLPLIRYRT
RDITALHYDR CKCGRTLVRM EKVIGRTDDM IIIRGVNVFP SQIESVLLEM GEVEPHYQLI
VDRVNNLDVL EVLVEVSERM FSDEVKKLEQ LEKKITKAIE ETLGISVKVR LVEPKTIERS
EGKAKRVIDK RKI