Gene Tpen_1745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1745 
Symbol 
ID4601770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1687146 
End bp1688207 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content52% 
IMG OID639774518 
Productglycosyl transferase family protein 
Protein accessionYP_921143 
Protein GI119720648 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCTCGGG CTGGCGAGTA CCCGTTGATC ACGGTCGCGA TGGTGGCGCT GAACAGGGAG 
TGGATTATCG GGCACGCCTT GAAGAGCCTG CTTTCCCAGA CGTACCCCCA CGACAGGATT
TTCTTCGTCC TCGTGGACGG CGGGAGCGTT GACCGCACGG TGGAGGTTGT GCGCGAAGCC
CTGAGCGGGT CCGACCTAGC AGGCTTCGAG GTCGTCGTGC AGAGGAGCAA TATCCCTGAG
GCTAGGAACA TAGCTATCGA GAGGATGAGG GGGGAGGCGA TATTCTTCTG GGACAGCGAC
GTACTCCTAG AGCCGGACGG TTTGAAAATG CTATACGAAG CCGCCTCAGA CTATGGGATC
GATATCTTGT CCGCAGATAC CCTGTCCATT AAAGTCTCCA GTGTAGAAGA AGGTTGGAGA
GTTTTAGAAG AGCTGTGTAA GAAGAGGGGA AAAATCGGCG TAGAGCTTGT TCCCGCTGTC
GCTATGGGTG CAACGCTTAT TCGAAAAATC GTGTTGGATA GGGTTAGGTT TGACCCCGAG
CTTACCTTTG GCGAGGACGC TGATTTCTGC GTAAAGGCAC GTTCCCTCGG TTATAGAGTA
GCCGTGCATA GAGGCGTGGT GGCTGTCGAT GTGAACGTTA CCGGTAGAGC TGGTAGCGAC
ATATACGTTT CTAAACCTCT CACCGAACTT TTAAAGGGTA TTCGGAAGAA GGCTAAAGTA
AAGGTTCTCG GTCTCAGCTT CGATCCGGGT CTCAAAGATG TACTCGTTTA CATGTGGAGG
TACAAGAGGT ACCTTTACTA CCTCGGCTAC CTTCCGATGC TTGTAGCGCT AATAGTCGGA
CTTGCGAAAA ATCTCCCGAT GCTCACTTTA CTCTTCCCTG CCTATGTACT TCCCTACCTA
CTATACCAGG CTAAGAGGAG GGGTATCCTC TTGGGCTTAA AGACCTTTAT AGCGAGCCTC
GTTGTCGGTC TTCCATTGTC GCTTTCAATG CTAGCATATG CTACTGCTAG AAGCTTGGAA
CGCTTCGTAA GCACGCCGCG CAGAAAGAAG CATGTCCGTT AG
 
Protein sequence
MARAGEYPLI TVAMVALNRE WIIGHALKSL LSQTYPHDRI FFVLVDGGSV DRTVEVVREA 
LSGSDLAGFE VVVQRSNIPE ARNIAIERMR GEAIFFWDSD VLLEPDGLKM LYEAASDYGI
DILSADTLSI KVSSVEEGWR VLEELCKKRG KIGVELVPAV AMGATLIRKI VLDRVRFDPE
LTFGEDADFC VKARSLGYRV AVHRGVVAVD VNVTGRAGSD IYVSKPLTEL LKGIRKKAKV
KVLGLSFDPG LKDVLVYMWR YKRYLYYLGY LPMLVALIVG LAKNLPMLTL LFPAYVLPYL
LYQAKRRGIL LGLKTFIASL VVGLPLSLSM LAYATARSLE RFVSTPRRKK HVR