Gene Tpen_0205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0205 
Symbol 
ID4602216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp183990 
End bp185009 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content53% 
IMG OID639772959 
Productglycosyl transferase family protein 
Protein accessionYP_919618 
Protein GI119719123 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.234288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTACCCAA AGGTAAGCAT AGTCATAGTG AACTTTAAGG GAACAGAAGC ACTGAAAAAG 
TGTCTGAGAA GCGTTTTCGA AACCGAGTAC CCCGACTACG AGGTCATAGT TGTAGACTCC
CTGACCGATA ACGTCGAGAA AGCTTTGAGG GACGAGTTCG GATCTAAGGA AAACTTGAAG
ATCATTCACT TCGACTCGAA CATCGGCGCC TCGGGATCCC ACAACGTAGG GGCGATGGCG
AGCGACCCTA ACAGTAAGTA CCTGGTGTTC CTCGACAACG ACGTTGAAGT AGAAAAGGAC
TGGTTGAAGA GGCTGGTCGA GACCGCCGAG GAGAGCCCGA GGATCGGATG CGTTCAGGCG
AAAGTGATCT CGAAGAGCAA TGAGGGTAGG ATGGATCACG CAGGGCTAGC GTTAGACTTG
ACGGCGACGT GGCTTTCTAC GTACGGGTTT AGAGAAGAGA TATTCCAGCG CCCGATAGAG
TTGTTCGTCG CCAGTTCGGC TGCGCTTTTA ACCCCACGCG AGCTCTACTT TAAGGTAGGC
GGGTTCGACA GCTCCTACTT CATATACGAC GACGACACGG ATTACACGTG GCGCGTTAGG
CTTCAGGGCT ACATCTCCCT CCTGGAGACA AGGGCTCGCG TATACCACGA GGACAAGATA
AGCTCGAGGC TACGCTTCGA CAAGCTGTAC TTCGGGTATC GGAACAGGCT TCAGAACATC
GTGAAAAACA TGGACGCGAA GAACATGGTC GTTAGCCTGC TGGTGACCCT CTACCTTGGA
TACCTGGTAA CGGTACTCCT AGCGCTTGCC GGCAGGATCA GGGAGACAGC CGCATACTTC
TTGTCGTCTA CGAGCGTCGT GTTCTCGCTA CCAAGGCTTA TGTGGAAGCG TAAGCTAGTC
TCGCTGAAGA GAAGGGTTCC CGACAGCTAC TTCGAGAAGA AAGGCTTCCT TAGGAAGGAC
CTTCTCGGGA CGATCTACAT GACGAGGGCA TTGCTTATTC GCTCGGTAAG GAAGAAGTAG
 
Protein sequence
MYPKVSIVIV NFKGTEALKK CLRSVFETEY PDYEVIVVDS LTDNVEKALR DEFGSKENLK 
IIHFDSNIGA SGSHNVGAMA SDPNSKYLVF LDNDVEVEKD WLKRLVETAE ESPRIGCVQA
KVISKSNEGR MDHAGLALDL TATWLSTYGF REEIFQRPIE LFVASSAALL TPRELYFKVG
GFDSSYFIYD DDTDYTWRVR LQGYISLLET RARVYHEDKI SSRLRFDKLY FGYRNRLQNI
VKNMDAKNMV VSLLVTLYLG YLVTVLLALA GRIRETAAYF LSSTSVVFSL PRLMWKRKLV
SLKRRVPDSY FEKKGFLRKD LLGTIYMTRA LLIRSVRKK