Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0737 |
Symbol | |
ID | 4601144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 684219 |
End bp | 685439 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639773513 |
Product | S-adenosylmethionine synthetase |
Protein accession | YP_920142 |
Protein GI | 119719647 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1812] Archaeal S-adenosylmethionine synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0136402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGA AGAACATAGT TGTTGAGAAA TCGAGCTACA TCCCGCCGAG CAGGCTCCCC GTAGAGATAG TGGAGCGGAA GGGGACGGGC CACCCAGACT ACATTGCAGA CTCCATTTCG GAGGCAGCGA GCAGGGAGCT ATCAAGGTAC TACCTAGAGC ACTACGGGGC AATACTACAC CACAACCTCG ACAAGGTTCT AGTCGTCGGG GGACAGTCTA GCCCCAGGTT CGGGGGCGGG GAGGTAGTCC AGCCCATATA CATACTCGTA TCCGGGAGAG CGACCACGGA GGTAGTCTCG GAAGGAGGCA GGGAGAGCGT GCCCGTCGGC CCGATCATAC TCAAGGCGAC CCGCGACTGG ATAAAGAGCA ACATCAGGTT CCTGGACCCC GATACCCACG TCATCGTAGA CTACAGGGTA GGCAAGGGCT CCGCAGACCT CGTGGACATA TACAACCGCA GGGGGAGCTA TCCCGGGGCG AACGACACCT CGATGGGTAT AGGCTACGCA CCCCTCTCGC CGACCGAGAG AGCCGTCCTG GAGACCGAGA GGTTGCTGAA CTCCGAGAAG GTTAAGAAGG AGCTACCGGC CGTCGGGGAG GACGTAAAGG TCATGGGGGT CAGGAAGGGC AACAAGCTAA CGCTCACGGT CGCCATGGCC GTGATATCGA GGTTCGTCCA CAGCACCGAG GAGTACCTCT CCCTGAAGGA GGAGGTGAAA AAGCTCGTAA AGGAGCACGC CGCTAGCATC ACGGATCTAG ACGTGGAAGT CTACGTAAAC ACGGGCGACG ACCCCTCCAG GGGGGACAAG GGAGGGCTCT ACCTCACGGT CACGGGGACC TCGGCCGAGC ACGGAGACGA CGGGGCCACG GGGAGGGGTA ACAGGGCGAA CGGGCTCATA ACGCCCTTCA GACCTATGTC CCTGGAGGCG ACCGCAGGCA AGAACCCCGT CAGCCACATA GGGAAGCTGT ACAACGTCGT AGCTTTCCAG GCGGCAAGCG AGATATGCGG GTTAGACCAC GTCAACGAGG TCTACATCAA GCTGATAAGC CAGATAGGGA AGCCGATAAA CCAGCCTCTA CTAGCCTACA TAGCCATAAA CGCCCCCGAC GACGTCCTCG CCCGCGTAAA GCACCAGGCA GAGGAAGTTC TAGCAAAGCA CCTAGACAGG ATAAACGTGC TCTGGGAAAG CATACTGAAG GGAAACGTGT CCCTGTTCTA G
|
Protein sequence | MSAKNIVVEK SSYIPPSRLP VEIVERKGTG HPDYIADSIS EAASRELSRY YLEHYGAILH HNLDKVLVVG GQSSPRFGGG EVVQPIYILV SGRATTEVVS EGGRESVPVG PIILKATRDW IKSNIRFLDP DTHVIVDYRV GKGSADLVDI YNRRGSYPGA NDTSMGIGYA PLSPTERAVL ETERLLNSEK VKKELPAVGE DVKVMGVRKG NKLTLTVAMA VISRFVHSTE EYLSLKEEVK KLVKEHAASI TDLDVEVYVN TGDDPSRGDK GGLYLTVTGT SAEHGDDGAT GRGNRANGLI TPFRPMSLEA TAGKNPVSHI GKLYNVVAFQ AASEICGLDH VNEVYIKLIS QIGKPINQPL LAYIAINAPD DVLARVKHQA EEVLAKHLDR INVLWESILK GNVSLF
|
| |