Gene Tpen_0890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0890 
Symbol 
ID4600466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp837532 
End bp838914 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content60% 
IMG OID639773668 
Producttryptophan synthase subunit beta 
Protein accessionYP_920294 
Protein GI119719799 
COG category[R] General function prediction only 
COG ID[COG1350] Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) 
TIGRFAM ID[TIGR01415] pyridoxal-phosphate dependent TrpB-like enzyme 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAAA AATACTTCCT AGACGAAGAG GAGCTACCGA GGGAGTGGTA CAATATACTA 
CCAGACCTGC CCGAGCCCCT ACCGCCACCG CTGAACCCCG CAACCGGGAA GCCCGTCTCG
CCAGAAGACC TTGAACCCAT ATTCCCCAAG GAGCTTATCA GGCAGGAGGT ATCGCAGGAG
AGGTTTATCC CGATACCCGA AGAGGTACTC GAGGTGTACA GGATATGGAG GCCTACCCCC
CTCATCAGGG CCACCAGGCT GGAGAAAGCC CTCAGGACCC CTGCGAGGAT CTACTACAAG
TACGAAGGGG TATCGCCCCC AGGGAGCCAT AAGCCGAACA CGGCTGTCGC GCAGGCGTAC
TACAACGCGC GCGAAGGCGT GGCGCGCCTC ACGACGGAGA CCGGGGCTGG GCAGTGGGGA
AGCGCTTTAT CGTTCGCGAC CATGCTGTTC GGGCTCAAGC TTACGGTGTA CATGGTGAGG
GTGAGCTACC AGCAGAAGCC CTACAGGGCT ATACTGATGC AGACTTGGGG CGCCGAGGTC
GTCCCGAGCC CCAGTAACAG GACTAAGGCG GGCAGGGCTT TCCTCGAGAA GGACCCCGAG
CACCCCGGCT CTCTCGGTAT AGCGATCAGC GAGGCGCTGG AAGACGCAAT AGAGCACGAG
GACACGAAGT ACAGCCTGGG TAGCGTGCTA AACCACGTGC TGCTCCACCA GACGATCATA
GGGCTCGAAG CCATCAAGCA GATGCAGGCG CTCGACGAGT TCCCCGACAT AGTGATAGGC
TGTGTGGGCG GTGGGAGCAA CTTCGCGGGA ATATCGTACC CCTTCTACTA CCTGGTGCGT
AGCGGGAAGG CACCGAAGAA GACGAGGTTC CTCGCCGTGG AGCCTGCGTC CTGCCCCTCC
ATGACCAAGG GAGAGTACAT GTACGACTTC GGCGACACGG CGAGGCTTAC ACCGCTCCTA
AAGATGTACA CGCTGGGACA CGACTTCGTG CCACCGCCGA TTCACGCCGG CGGACTACGT
TACCACGGCG CGGCTCCAAC CCTAAGCCTT CTCAAAAAAG CGGGGATCTT CGAGAGCGTT
GCCTACAACC AGGTGGAAGT CTTCGAGGCG GCGAGGCTCT TCGCCCGGAC GGAGGGCATA
GTGCCTGCTC CTGAGAGCGC CCACGCCATA AAGGCGGTCA TCGACGAGGC CCTAGAGGCT
AAGAGGAAGG GCGAGGAAAA GGTCATACTC TTCAACCTGA GCGGGCACGG GCTCTTGGAT
CTCGCCGCCT ACAGGGAGTT CCTCGAGGGA CGATTACCGG CGCACGAGTA CCCCGCCGAG
GCGATACAGA AGTCCATCGC GAAGATCAAG GAGTACCTAC TCGCGCGGGG CATACAGCCC
TAG
 
Protein sequence
MSKKYFLDEE ELPREWYNIL PDLPEPLPPP LNPATGKPVS PEDLEPIFPK ELIRQEVSQE 
RFIPIPEEVL EVYRIWRPTP LIRATRLEKA LRTPARIYYK YEGVSPPGSH KPNTAVAQAY
YNAREGVARL TTETGAGQWG SALSFATMLF GLKLTVYMVR VSYQQKPYRA ILMQTWGAEV
VPSPSNRTKA GRAFLEKDPE HPGSLGIAIS EALEDAIEHE DTKYSLGSVL NHVLLHQTII
GLEAIKQMQA LDEFPDIVIG CVGGGSNFAG ISYPFYYLVR SGKAPKKTRF LAVEPASCPS
MTKGEYMYDF GDTARLTPLL KMYTLGHDFV PPPIHAGGLR YHGAAPTLSL LKKAGIFESV
AYNQVEVFEA ARLFARTEGI VPAPESAHAI KAVIDEALEA KRKGEEKVIL FNLSGHGLLD
LAAYREFLEG RLPAHEYPAE AIQKSIAKIK EYLLARGIQP