Gene Tpen_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1417 
Symbol 
ID4600734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1369797 
End bp1371017 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID639774192 
Productaminotransferase, class I and II 
Protein accessionYP_920817 
Protein GI119720322 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.581012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCGCGA CGCGGCTAGC CGTGTCGCCG CTTAGGCTAC CGAGGAGGCG TCGCGGGGCG 
GACTTCCTGG AGATGGACCC CTCCTTCGAG TTCCTCGAGA AAGCCGGGAA AGGCGCCGTG
AGCTTCGGGA TAGGCCAGCC GGACTTCTCC CCGCCCGGCG AGGTTCTCGA AGCCCTTAGG
ACGGTTGGGG CGGAGGCTTT GAAGTATACC CCGCCCCTGG GGCTCCCGGA GCTCCGCGAG
GCGCTGGCGG GGTACCTCTC GGAGAAGTAT GGGGTGGATG TTAAGCCCAG CGAGGTCGCG
GTGACTCCCG GCGCGACCGC CGCGGTCTTC GCCTCGCTCG TCCTGCTCGT GCGCGGGAGG
GCTAGGGTCG TCGTGCAGGA CCCGGGCTTC CCCATGTACG ACGACGTGGC GAGGTTTGCC
GGTGGTAGGG TCGTCTACGC GTACTCGGGG ATCGAGGAGT CCTTCGAGTG GTCTGCCGAG
AGCATAGCCG GGAGGCTCGG CGAGGGAGGA GTTGCGGTGC TGAACTTCCC CAACAACCCG
ACGGGCTCCC TGGCCCCCCG CGGGCTACTG GAAGAGCTGG GAGGACTCGC CGCCAGGAAG
GGCTTCTACG TTGTGAGCGA CGAGGTTTAC GAGGACTTCG TCTACGAGGG TAGCCACGAG
TCCGTCCTGC AGGTACCCGA GCTCCGCGAG AGGTCCGTCT ACGTCGGGAG CTTCTCGAAG
ACCTGGGGGC TCGCTGGGCT CAGGCTTGGG TACGTCGTGG CCCCGCGCCG GCTAGTCGAG
AGGCTGGAGG CAGTCGCCGT GAATGTCTAC GGCTCGCCGC CCTCTCCGGC CCAGCTCGCC
GCCCTCAGGG CCCTCGACCA CGGTCTCGGC TGGTTCTCAG GGGTTCTCTC GGAGTACAGG
CGGAGGAGGG ACGCGCTTCT CGAGGAGCTC TCCAAGGTGG AGGGGGTGGA GCTCTACAGA
CCTCGCGGCG CGTTCTACGT GTACCCCAGG GTGAGGGGGC TCTTGAAGAG GCTGGGCGTG
GGCTCCTCCA GGGAGCTTGC GGAGTCGCTA CTCCAGGCCG GCGTGGTGGT CCTCCCGGGT
GACGCTTACT CCGGGAGGGC GGGGCGGGAG CACGTGAGGC TTTCCTACGC GTTGCCTGTG
GAGTCCATAC GGGAGGGGGT TAGGCGCATA AGGGCCTTCG TCGAGGAGGC TGCCTGCGCG
CGGAGAAAAC GCGGCGCATA A
 
Protein sequence
MRATRLAVSP LRLPRRRRGA DFLEMDPSFE FLEKAGKGAV SFGIGQPDFS PPGEVLEALR 
TVGAEALKYT PPLGLPELRE ALAGYLSEKY GVDVKPSEVA VTPGATAAVF ASLVLLVRGR
ARVVVQDPGF PMYDDVARFA GGRVVYAYSG IEESFEWSAE SIAGRLGEGG VAVLNFPNNP
TGSLAPRGLL EELGGLAARK GFYVVSDEVY EDFVYEGSHE SVLQVPELRE RSVYVGSFSK
TWGLAGLRLG YVVAPRRLVE RLEAVAVNVY GSPPSPAQLA ALRALDHGLG WFSGVLSEYR
RRRDALLEEL SKVEGVELYR PRGAFYVYPR VRGLLKRLGV GSSRELAESL LQAGVVVLPG
DAYSGRAGRE HVRLSYALPV ESIREGVRRI RAFVEEAACA RRKRGA