Gene Tpen_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0081 
Symbol 
ID4600811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp63192 
End bp64388 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID639772835 
Productaminotransferase, class I and II 
Protein accessionYP_919494 
Protein GI119718999 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCTGGG AAACAAGCTT TCGCCTAAGT AAAGCCGCCT CCCAGCTAGG AGCAGAGGAG 
GCTTTTGTCT ACCTTGCGCG GAGCCTCGAG CTCAAAAGAA GAGGGGTCGA CGTCGTCTCC
TTCGGTATAG GGCAACCGGA CTTCCAGCCT CCGCCGCACG TGATATCTGA GGCTAAGAAG
GCGATGGACG AGGGGTTCAA CGGCTACGGT CCAAGCCTGG GAATGCCGGA GTTACGGGAA
GCTATAGCGA GCTTCGTGTC AGAGGAATAC GGAGTGGACG TGAAGGCGGA GGAGGTAGCC
GTTACTGTTG GAGCTAAATC AGCGATCTTT ATGGCCATGA TATCACTGTT GGAGCCCGGA
GACGAAGTAA TAATACCTGA CCCCTCTTAC CCCCTCTATG AGTCCGTCGC ACGCTTCGCC
GGTGCCAAGC CTGTCTTCCT GCGCCTCCAC AGGGGCAACG GCTACAAGGT GACCTTCGAA
GAGGTAGAGA AGCTCGTAAC TCCCAAGACT AGAATGATCG TTCTAAACTA CCCGGAGAAC
CCCGTGGGCA CCACCATGGA TCAAAGGGAC GTGGAAGAGC TTGTCGATTT CTCGGCTAAG
CGTGGCATAG TCGTACTGTC GGACGAGATA TACGACCACT TTGTCTACGA GAAAAAGCAC
TTTTCTACTT TGCAGACGTC GAGTTGGCGC GACGCCGTCT ACTACGTGAA CGGTTTTTCG
AAGACCTTTG GGATGACGGG CTGGAGGCTG GGCTACGTCA TCTCTAATAA AGAGCTGATC
TCCAAGCTAT CAGTGGTCGC CAACAATATC TACTCCTGCC CGGTAACCTT CGAGCAGATA
GCCGCTGCGA AGGCCCTGAA GGAAGGCTTG TCTTGGTTTA AACCCATACT CGAGGGGTAC
AGGAAGAGGA GGGACCTCAT ATACAGGGAG TTTCTCTCGA TAAAAGGCGT AAAAGTCGTA
AAGCCCGAGG GGGCTTTCTA CATATTCCCG GACTTTACTG AGGTGATACG GGAGAAAGGG
CTGAAGAATG AGCGCGAGCT TGCGGACAGG CTACTAGAGG AGAGAGGCGT AGTGGTTCTG
CCTGGGACAG CTTTCCCGAA GGAGGGTGGG AAGGGGCACC TAAGGTTCTC CTTTGCTGTG
AGTGAGAACG ACATTGTAAG GGGCATTGCG AGGATTAAGG AGTGGATAGA GTCCTGA
 
Protein sequence
MGWETSFRLS KAASQLGAEE AFVYLARSLE LKRRGVDVVS FGIGQPDFQP PPHVISEAKK 
AMDEGFNGYG PSLGMPELRE AIASFVSEEY GVDVKAEEVA VTVGAKSAIF MAMISLLEPG
DEVIIPDPSY PLYESVARFA GAKPVFLRLH RGNGYKVTFE EVEKLVTPKT RMIVLNYPEN
PVGTTMDQRD VEELVDFSAK RGIVVLSDEI YDHFVYEKKH FSTLQTSSWR DAVYYVNGFS
KTFGMTGWRL GYVISNKELI SKLSVVANNI YSCPVTFEQI AAAKALKEGL SWFKPILEGY
RKRRDLIYRE FLSIKGVKVV KPEGAFYIFP DFTEVIREKG LKNERELADR LLEERGVVVL
PGTAFPKEGG KGHLRFSFAV SENDIVRGIA RIKEWIES