Gene Tpen_0737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0737 
Symbol 
ID4601144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp684219 
End bp685439 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content59% 
IMG OID639773513 
ProductS-adenosylmethionine synthetase 
Protein accessionYP_920142 
Protein GI119719647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1812] Archaeal S-adenosylmethionine synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0136402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCGA AGAACATAGT TGTTGAGAAA TCGAGCTACA TCCCGCCGAG CAGGCTCCCC 
GTAGAGATAG TGGAGCGGAA GGGGACGGGC CACCCAGACT ACATTGCAGA CTCCATTTCG
GAGGCAGCGA GCAGGGAGCT ATCAAGGTAC TACCTAGAGC ACTACGGGGC AATACTACAC
CACAACCTCG ACAAGGTTCT AGTCGTCGGG GGACAGTCTA GCCCCAGGTT CGGGGGCGGG
GAGGTAGTCC AGCCCATATA CATACTCGTA TCCGGGAGAG CGACCACGGA GGTAGTCTCG
GAAGGAGGCA GGGAGAGCGT GCCCGTCGGC CCGATCATAC TCAAGGCGAC CCGCGACTGG
ATAAAGAGCA ACATCAGGTT CCTGGACCCC GATACCCACG TCATCGTAGA CTACAGGGTA
GGCAAGGGCT CCGCAGACCT CGTGGACATA TACAACCGCA GGGGGAGCTA TCCCGGGGCG
AACGACACCT CGATGGGTAT AGGCTACGCA CCCCTCTCGC CGACCGAGAG AGCCGTCCTG
GAGACCGAGA GGTTGCTGAA CTCCGAGAAG GTTAAGAAGG AGCTACCGGC CGTCGGGGAG
GACGTAAAGG TCATGGGGGT CAGGAAGGGC AACAAGCTAA CGCTCACGGT CGCCATGGCC
GTGATATCGA GGTTCGTCCA CAGCACCGAG GAGTACCTCT CCCTGAAGGA GGAGGTGAAA
AAGCTCGTAA AGGAGCACGC CGCTAGCATC ACGGATCTAG ACGTGGAAGT CTACGTAAAC
ACGGGCGACG ACCCCTCCAG GGGGGACAAG GGAGGGCTCT ACCTCACGGT CACGGGGACC
TCGGCCGAGC ACGGAGACGA CGGGGCCACG GGGAGGGGTA ACAGGGCGAA CGGGCTCATA
ACGCCCTTCA GACCTATGTC CCTGGAGGCG ACCGCAGGCA AGAACCCCGT CAGCCACATA
GGGAAGCTGT ACAACGTCGT AGCTTTCCAG GCGGCAAGCG AGATATGCGG GTTAGACCAC
GTCAACGAGG TCTACATCAA GCTGATAAGC CAGATAGGGA AGCCGATAAA CCAGCCTCTA
CTAGCCTACA TAGCCATAAA CGCCCCCGAC GACGTCCTCG CCCGCGTAAA GCACCAGGCA
GAGGAAGTTC TAGCAAAGCA CCTAGACAGG ATAAACGTGC TCTGGGAAAG CATACTGAAG
GGAAACGTGT CCCTGTTCTA G
 
Protein sequence
MSAKNIVVEK SSYIPPSRLP VEIVERKGTG HPDYIADSIS EAASRELSRY YLEHYGAILH 
HNLDKVLVVG GQSSPRFGGG EVVQPIYILV SGRATTEVVS EGGRESVPVG PIILKATRDW
IKSNIRFLDP DTHVIVDYRV GKGSADLVDI YNRRGSYPGA NDTSMGIGYA PLSPTERAVL
ETERLLNSEK VKKELPAVGE DVKVMGVRKG NKLTLTVAMA VISRFVHSTE EYLSLKEEVK
KLVKEHAASI TDLDVEVYVN TGDDPSRGDK GGLYLTVTGT SAEHGDDGAT GRGNRANGLI
TPFRPMSLEA TAGKNPVSHI GKLYNVVAFQ AASEICGLDH VNEVYIKLIS QIGKPINQPL
LAYIAINAPD DVLARVKHQA EEVLAKHLDR INVLWESILK GNVSLF