Gene Tpen_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1664 
Symbol 
ID4601246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1611141 
End bp1612352 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content60% 
IMG OID639774437 
Productaminotransferase, class I and II 
Protein accessionYP_921062 
Protein GI119720567 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTAAAC CTACGCTGGT TTTTTCTCCC CTCGCGGAGC AACTAGAGCC TGAGGGCGCC 
TTCGTATACC TCGACTTGGC AGCCGAGGCT AGGCGTAAAG GTATAGACGT TATCAGCTTC
GGGATAGGGC AACCCGACTT CCAGCCCCCT AGAGAAGCAC TCGAAGCAAT CAAGGAGGCG
CTGGACAGGG GCTACACGAG GTACATCTCG CCGCTAGGCA TACCGGAGCT CCGCGAGGAG
ATAGCGCGCT ACGTCGGCGA AAAGTACGGA GTGGACGTTA AGCCCAGCGA AGTCGCGGTA
ACCGTGGGGG CTAAGGCAGC GCTCTTCATG ACGATCTCCC TCCTGACGAG GCCTGGAGAC
GAGGTCGTCG TGCAGGACCC TGCGTTTCCG ACGTACGAGT GTGTTATTAG GTACGCGGGC
GGCAGGCCTG TCTTCGTGAG GCTCGCGGAG GAGAGAGGGT TCAGGCTCTC GGCGGAGGAC
GTCGAGAGGA CTGTAGAGGG CCTCCACAGG GTGAGGGGTA TCGTCGTGAA CTCTCCGCAT
AACCCGACGG GCTCGGCGCT CGAGGAGAAA GACGTGGAGG CTCTACTGGA GCTGGCTAGG
AGGAAGGGCA TGTTCGTGAT TAGCGACGAG ATATACGAGG ACTACGTGTA CGAGGGGAAG
CACGCGAGCT TCCTCCAGGC ACCGGACTGG CGCGACTACG TCGTGTACGT GAGCGGTTTC
TCGAAGACCT GGGCTATGAC CGGGCTGAGG CTCGGCTACG TCGTCGCCAG AGAAGAAGTC
ATACGGGCCC TCGAAGTCTT TGCGACGAAC ATGTACAGCT GTCCGCCCGC TCCTCTCCAG
TACGGGGCGC TCAAGGCGCT CCAGCTGGGT ACGGGCTGGT TCAAGCCGCT ACTAGAGGAG
TACCGTAGGA GGAGGGACGC CGCTTTCGAA GAGCTCAGCA AGATACCCGG CGTTAGCACC
GTGAAGTCCA GGGGTGCATT CTACCTCTTC CCGAACTTCA AGGAGGTTCT CCGAGCGACG
GGCTTGAGGA GTGTGGACGA GCTTGCGAAG AGGTTACTGT TCGAGGCGGG TGTAGTTCTC
TTACCTGGGA CTGCCTTCCC CTTGAGGGGC GGTGATGGGT ACATGCGTGT CTCGTACGTT
TTACCGGTGG AGAAGATTCG TGAGGGGTTC GGTAGGGTTC GCGAATGGGT CGAGAAGAAT
GCGGGTGGGT AG
 
Protein sequence
MPKPTLVFSP LAEQLEPEGA FVYLDLAAEA RRKGIDVISF GIGQPDFQPP REALEAIKEA 
LDRGYTRYIS PLGIPELREE IARYVGEKYG VDVKPSEVAV TVGAKAALFM TISLLTRPGD
EVVVQDPAFP TYECVIRYAG GRPVFVRLAE ERGFRLSAED VERTVEGLHR VRGIVVNSPH
NPTGSALEEK DVEALLELAR RKGMFVISDE IYEDYVYEGK HASFLQAPDW RDYVVYVSGF
SKTWAMTGLR LGYVVAREEV IRALEVFATN MYSCPPAPLQ YGALKALQLG TGWFKPLLEE
YRRRRDAAFE ELSKIPGVST VKSRGAFYLF PNFKEVLRAT GLRSVDELAK RLLFEAGVVL
LPGTAFPLRG GDGYMRVSYV LPVEKIREGF GRVREWVEKN AGG