Gene Tpen_0336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0336 
Symbol 
ID4601703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp305104 
End bp306630 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content53% 
IMG OID639773096 
ProductCoA-binding domain-containing protein 
Protein accessionYP_919748 
Protein GI119719253 
COG category[C] Energy production and conversion 
COG ID[COG1042] Acyl-CoA synthetase (NDP forming) 
TIGRFAM ID[TIGR02717] acetyl coenzyme A synthetase (ADP forming), alpha domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAGCG AAAGCCTTAA GCCCTTCTTT GACCCTAGAG GAGTAGTAGT TGTAGGCGCC 
TCCCGCGAAG AAGAGAAGCC AGGACATGTA ATATTCAGAG TATTACTCGA AAACAGGAAG
AAAGGATTGT TAAAGGCGCC TGTCTACGGC GTTAACCCTA AAGCCGACTA CATCCTTGGA
GAAAAAGTAT TCCCCAGCGT CAAGGACATA CCCGGAGAGG TCGATCTAGC AGTTATAGTG
ATTCCCGCCG AACACGTTAA AAGGGCGATC GAGGACCTAG GAGTAAAGGG TATTAAAGCG
GCGATTATAA TCTCTGCAGG CTTCAGCGAG ATAGGTAGAA GTGACCTTGA GAGAGAGGTC
GCTGAGACAG CTAAAAAGCA CGGTATCAGG ATTATAGGTC CCAACTGTAT CGGCGTCTAC
TCTCCGTGGA GTGGTGTTGA CACGATCTTT CTACCTTACA CAAAGGTTCT CGAAGACGGA
AGGGAAGTTC TGAACGCCCC TAGACCCTCA AAGGGTTTTG TAGCGCTAAT ATCGCAGAGC
GGAGCCGTGG GTACAGCGGC ACTCGACTAT ATGTACGGAG AAGGCATAGG GCTTTCCCAC
TTCGTGAGCA TCGGGAACAA GGCAGACGTG GACGAGGTAG AGCTGGTAGA GTACCTAGGA
GGCGATGACA GGACGAGGGT TATACTCCTC TACCTTGAAA ACATAAAGAG GGGAAGAGAG
TTCATCGAGG TTGCCGGGAA AGTTACCTTG AAGAAACCCG TGGTAGTCCT TAAGGCCGGA
AGGACGTCGG CAGGACGCAG AGCAGCGGCA TCCCACACAG CAGCCCTAGC CGGCGTAGAC
GAGGTGTACG ATGCAGCGTT CAGGAGAAGC GGCGTTATCA GGGCTAACGA TATAGAAGAA
CTCTTCGACT ACGCCAAAGC GCTAGCTATG CAACCACCCG CACGTGGCGA AAGAATAGGG
ATAATCACGG ACGGCGGGGG CGCCGGCGTA ATGGCCACGG ACATCGCCGA AATGCTCGGA
CTCAAAGTCC CAGAGTTACA GGGCAACGCA AGGAGAGAGC TCGAGGAGCT TAGAGAGAGG
GGAATCTTCC CGAGGTACGC CCAGCTATCT AACCCCGTTG ACCTCACGGG TTCTGCTACG
AGCGAGATGT TCGTCGAGGC TACCCGGATA CTGCTTGAAT CGGACGAAGT TGACGCCGTT
GTAGTACTTG CGCTTCACCA AGTACCGGGT ATCCCCGACC CCGTTAAGCT TGCACGCGAG
ATTTCACGCC TTGCGGCAGG CTACGCTAAG CCCGTCGTCG CGGTGGACAC GGGGTGGAGC
GAGGCGGCTA TACTGGAAAG AAAGGAGTTC GACTCTATGG GTGTTCCGTC TTACCCGACG
CCGGAAAGAG CAGTCAAATC GCTAAGCGCG CTGGTCAGAT ACGGCAAGTA CTTGTCTTCT
CGCGGAGCGC TCGGGCGTTA TCTGCAGGAG TATTTGGACT TTAAACGAAA GAACATTTTG
CAAATGCATA AAAATAATTC TACATAA
 
Protein sequence
MTSESLKPFF DPRGVVVVGA SREEEKPGHV IFRVLLENRK KGLLKAPVYG VNPKADYILG 
EKVFPSVKDI PGEVDLAVIV IPAEHVKRAI EDLGVKGIKA AIIISAGFSE IGRSDLEREV
AETAKKHGIR IIGPNCIGVY SPWSGVDTIF LPYTKVLEDG REVLNAPRPS KGFVALISQS
GAVGTAALDY MYGEGIGLSH FVSIGNKADV DEVELVEYLG GDDRTRVILL YLENIKRGRE
FIEVAGKVTL KKPVVVLKAG RTSAGRRAAA SHTAALAGVD EVYDAAFRRS GVIRANDIEE
LFDYAKALAM QPPARGERIG IITDGGGAGV MATDIAEMLG LKVPELQGNA RRELEELRER
GIFPRYAQLS NPVDLTGSAT SEMFVEATRI LLESDEVDAV VVLALHQVPG IPDPVKLARE
ISRLAAGYAK PVVAVDTGWS EAAILERKEF DSMGVPSYPT PERAVKSLSA LVRYGKYLSS
RGALGRYLQE YLDFKRKNIL QMHKNNST