Gene Athe_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1010 
Symbol 
ID7407912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1107481 
End bp1108728 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content37% 
IMG OID643715375 
ProductPhenylacetate--CoA ligase 
Protein accessionYP_002572884 
Protein GI222529002 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.45626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTG TTGAGATGAG AGAAAATGTA AGTGAACTTT TTTTGAGCCA GCTCGAAAAT 
GTCTTTAAAA ACAGTCCTTT TTATCAAAAA AAATTCAAAG AAACAGGGGT AAAACTTAGT
AAGATACAGG ACTTAGATGA TATCAAAAAA CTTCCATTTA CAACAAAAGA AGAGTTAAGA
GATGCTTATC CCTTAGGGCT TATGGCTGTT GATGAGAAAA AGGTTGTAAG AATTCACTCT
TCGTCAGGTA CAACTGGAAT GCCAGTGATT ATTCCTTACA CTCAAAAAGA TGTTGATGAC
TGGAAAGAAA TGATGAAAAG GTGTTATCAG TTAGCAGGTG TAACAGAGCT GGACAGAGTC
CAGATAACTC CTGGATATGG CCTTTGGACA GCAGGTATTG GATTTCAGCT TGGTGCTGAG
TTTTTGGGTG CGATGACAAT TCCTATGGGG CCCGGAAATA CAGAAAAACA GCTTCAGATG
ATGGTGGATT TAAAGTCAAC AGTCATTATT GCGACTTCAT CTTACGGGCT TTTGCTTGCT
GAAGAGGTAG TTAAAAGAGG TTTAAAAGAC AAGATACATT TAAGAATTGG GATATTTGGT
TCTGAAAGAT GGGGAGAAAA ACAGCGAAAA ACTATTGAGG CGTATCTGGG CATAGAAAGT
TTTGATATTT ATGGGTTAAC AGAGATTTAT GGACCGGGAA TTGCAATAGA TTGCAAAAAA
CATACAGGCC TTCATTATTT TGATGATTTT CTGTATTTTG AAATAATTGA CCCTCAAACA
GGAGAGAATG TGCCTGATGG AGAGTTTGGT GAACTTGTTA TTACCACTTT GAGAAAAGAA
GGTGCTCCTC TTATAAGATA CAGAACAAGA GACATCACAC GAAAAATTCC AGGTGAGTGC
AGTTGTGGTT CTAAGTATCC ACGTATTGAC AGGATTGTTG GTAGAACTGA CGACATGATA
AAGGTCAAAG GTGTTAATAT CTTTCCTGCT CAGATAGACA CATTTTTGAA TGATGTAGAT
GGTGTTGGAA GTGAATATCA AGTGATTATA GAGAGGATTG ATTACAGGGA TAAACTTACA
TTAAAGGTTG AGGTTAAAGA TGAATATTTT ACTTCTGAGA TGAAAGAGTT AATCTCTCAT
GAATTTAAAA ATAAGATAGG AGTATCGCCT GAGGTCATTT TGTGCAGGGT AGGTGAACTT
CCTCGAAGCG AAAAGAAGAC AAAACGCATA TTTGATTTGA GAGGCTGA
 
Protein sequence
MESVEMRENV SELFLSQLEN VFKNSPFYQK KFKETGVKLS KIQDLDDIKK LPFTTKEELR 
DAYPLGLMAV DEKKVVRIHS SSGTTGMPVI IPYTQKDVDD WKEMMKRCYQ LAGVTELDRV
QITPGYGLWT AGIGFQLGAE FLGAMTIPMG PGNTEKQLQM MVDLKSTVII ATSSYGLLLA
EEVVKRGLKD KIHLRIGIFG SERWGEKQRK TIEAYLGIES FDIYGLTEIY GPGIAIDCKK
HTGLHYFDDF LYFEIIDPQT GENVPDGEFG ELVITTLRKE GAPLIRYRTR DITRKIPGEC
SCGSKYPRID RIVGRTDDMI KVKGVNIFPA QIDTFLNDVD GVGSEYQVII ERIDYRDKLT
LKVEVKDEYF TSEMKELISH EFKNKIGVSP EVILCRVGEL PRSEKKTKRI FDLRG