Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1417 |
Symbol | |
ID | 4600734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1369797 |
End bp | 1371017 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639774192 |
Product | aminotransferase, class I and II |
Protein accession | YP_920817 |
Protein GI | 119720322 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0436] Aspartate/tyrosine/aromatic aminotransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.581012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCGCGA CGCGGCTAGC CGTGTCGCCG CTTAGGCTAC CGAGGAGGCG TCGCGGGGCG GACTTCCTGG AGATGGACCC CTCCTTCGAG TTCCTCGAGA AAGCCGGGAA AGGCGCCGTG AGCTTCGGGA TAGGCCAGCC GGACTTCTCC CCGCCCGGCG AGGTTCTCGA AGCCCTTAGG ACGGTTGGGG CGGAGGCTTT GAAGTATACC CCGCCCCTGG GGCTCCCGGA GCTCCGCGAG GCGCTGGCGG GGTACCTCTC GGAGAAGTAT GGGGTGGATG TTAAGCCCAG CGAGGTCGCG GTGACTCCCG GCGCGACCGC CGCGGTCTTC GCCTCGCTCG TCCTGCTCGT GCGCGGGAGG GCTAGGGTCG TCGTGCAGGA CCCGGGCTTC CCCATGTACG ACGACGTGGC GAGGTTTGCC GGTGGTAGGG TCGTCTACGC GTACTCGGGG ATCGAGGAGT CCTTCGAGTG GTCTGCCGAG AGCATAGCCG GGAGGCTCGG CGAGGGAGGA GTTGCGGTGC TGAACTTCCC CAACAACCCG ACGGGCTCCC TGGCCCCCCG CGGGCTACTG GAAGAGCTGG GAGGACTCGC CGCCAGGAAG GGCTTCTACG TTGTGAGCGA CGAGGTTTAC GAGGACTTCG TCTACGAGGG TAGCCACGAG TCCGTCCTGC AGGTACCCGA GCTCCGCGAG AGGTCCGTCT ACGTCGGGAG CTTCTCGAAG ACCTGGGGGC TCGCTGGGCT CAGGCTTGGG TACGTCGTGG CCCCGCGCCG GCTAGTCGAG AGGCTGGAGG CAGTCGCCGT GAATGTCTAC GGCTCGCCGC CCTCTCCGGC CCAGCTCGCC GCCCTCAGGG CCCTCGACCA CGGTCTCGGC TGGTTCTCAG GGGTTCTCTC GGAGTACAGG CGGAGGAGGG ACGCGCTTCT CGAGGAGCTC TCCAAGGTGG AGGGGGTGGA GCTCTACAGA CCTCGCGGCG CGTTCTACGT GTACCCCAGG GTGAGGGGGC TCTTGAAGAG GCTGGGCGTG GGCTCCTCCA GGGAGCTTGC GGAGTCGCTA CTCCAGGCCG GCGTGGTGGT CCTCCCGGGT GACGCTTACT CCGGGAGGGC GGGGCGGGAG CACGTGAGGC TTTCCTACGC GTTGCCTGTG GAGTCCATAC GGGAGGGGGT TAGGCGCATA AGGGCCTTCG TCGAGGAGGC TGCCTGCGCG CGGAGAAAAC GCGGCGCATA A
|
Protein sequence | MRATRLAVSP LRLPRRRRGA DFLEMDPSFE FLEKAGKGAV SFGIGQPDFS PPGEVLEALR TVGAEALKYT PPLGLPELRE ALAGYLSEKY GVDVKPSEVA VTPGATAAVF ASLVLLVRGR ARVVVQDPGF PMYDDVARFA GGRVVYAYSG IEESFEWSAE SIAGRLGEGG VAVLNFPNNP TGSLAPRGLL EELGGLAARK GFYVVSDEVY EDFVYEGSHE SVLQVPELRE RSVYVGSFSK TWGLAGLRLG YVVAPRRLVE RLEAVAVNVY GSPPSPAQLA ALRALDHGLG WFSGVLSEYR RRRDALLEEL SKVEGVELYR PRGAFYVYPR VRGLLKRLGV GSSRELAESL LQAGVVVLPG DAYSGRAGRE HVRLSYALPV ESIREGVRRI RAFVEEAACA RRKRGA
|
| |