Gene Tpen_0611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0611 
Symbol 
ID4601231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp563935 
End bp565161 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content55% 
IMG OID639773385 
Productthreonine dehydratase 
Protein accessionYP_920018 
Protein GI119719523 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01124] threonine ammonia-lyase, biosynthetic, long form
[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0341122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGTAG ACGAAAAGCT GTTCGAGGAG CTTAGCATTA GGATAAAAGA GGCGCGCGAC 
GTTCTGAGAA ACGTTATACA TAGGACTCCT CTGCAGGCTT CGAAGACTCT CTCAGACCTA
ACGAACTCGG AAGTTTACCT GAAGCTGGAA AATCTTCAAA AGACTGGGGC GTTCAAGGTT
CGAGGAGCGT ACTATAAACT GCAGAAACTA GCCAGGAGCG GCGTGAAAAG CGTTGTCGCG
GCGAGCTCCG GGAACCACGC CCAGGGCGTA GCCTACTCCG CGTCCCTGCT AGGCATTAAA
TCGACGATCG TTATGCCCAG GTATACCCCT TTCTACAAGG TAAACGCCAC TAAGAGCTAC
GGGGCAGAGG TCGTCCTCCA CGGAGAAACC TACGACGACG CTTACCTGAA GGCGCTAGAA
ATAGCTGAGA AAACAGGCTC GCCGTTCGTG CACCCCTTCA ACGATCCCGA CATCATCGCC
GGACAGGGAA CCATAGGAGT CGAGATATTC GAAGATTTAA GCAACGTAGA CCTGGTACTA
GTACCCGTAG GTGGCGGTGG ACTCATATCC GGAATAGCAG TGGCCTTGAA GAAACTAAAG
CCCGACGTAA AGGTCGTAGG GGTTCAACCG AGGGGGGCTC CCGCCATGTA CCTTTCCTTC
CACGAGAAAA GGATAGTTGA AACCCCCCAG GTCAACAGCA TAGCCGACGG TGTTATCGTT
AAGCGCCCGG GGGACCTCAC CCTCAGGATA ATGGAGGAAT TCGTCGACGA CGTAGTGCTA
GTCGACGACC GAGAAATAGC GAGGGCCATG TTCCTTCTCC TAGAGCGCGT GAAGACTGTG
GCCGAGCCCG CGGGCGCTCT CTCCGTAGCC GCGCTGACCT CCGGGGCTGT GAGTGCAGAG
GGCAAAAGGG TCGTCGCAGT AGTGAGCGGT GGCAACGTAG ACCCCGCGCT CCTAGTCAGG
ATCCTAGGAC AGGTGCTATA CGCGGAGGGG AGACAGGTGA GGATACAGGG AGTGCTCCCA
GACAAGCCTG GACAACTGAA GAAAGTCATC GATGTTGTCT CCGAGCTCGG GCTGAACATA
GTGGAGATAC AGCACGAGAG GCTAAACCCG CTGCTAAGCC CGGGTATGGC TCAGGTTACG
CTAGGCCTCG AGGTGCCCTC GCGAGAGTAC GCAGACATGC TTATCTTAAA GCTTAAAGCC
CAGGGCCTCG ACTTTAAAGT GATATAA
 
Protein sequence
MSVDEKLFEE LSIRIKEARD VLRNVIHRTP LQASKTLSDL TNSEVYLKLE NLQKTGAFKV 
RGAYYKLQKL ARSGVKSVVA ASSGNHAQGV AYSASLLGIK STIVMPRYTP FYKVNATKSY
GAEVVLHGET YDDAYLKALE IAEKTGSPFV HPFNDPDIIA GQGTIGVEIF EDLSNVDLVL
VPVGGGGLIS GIAVALKKLK PDVKVVGVQP RGAPAMYLSF HEKRIVETPQ VNSIADGVIV
KRPGDLTLRI MEEFVDDVVL VDDREIARAM FLLLERVKTV AEPAGALSVA ALTSGAVSAE
GKRVVAVVSG GNVDPALLVR ILGQVLYAEG RQVRIQGVLP DKPGQLKKVI DVVSELGLNI
VEIQHERLNP LLSPGMAQVT LGLEVPSREY ADMLILKLKA QGLDFKVI