Gene Tpen_1724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1724 
Symbol 
ID4601749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1666467 
End bp1667618 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content48% 
IMG OID639774497 
Productglycosyl transferase, group 1 
Protein accessionYP_921122 
Protein GI119720627 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.4293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAAA CCAGAATAAT ATTTTTAACG GACTACTTAA CAACTAAAAG TGGCGGTGTA 
GGTTTTCTCT ACGAAGTAAT GAAAAAAATA GCTGGAAAAT ATCGCATAGA TATAATTGCG
GGGCGTGTTG AGGAAAGCCT CCAAAAGGAA GATTATGTAA GAATTTTGAA TCTTAATGTG
TACCGGGATG ATCTTCCATC AGCACAACCC GAGAATGCAG TAAAATTTCT GAATTTATCG
ACGAAAATGC TCAAAAGAAT AGTTAAGTGC ACCGGTGAGG AAGAGTTAAT CCTACACTTC
AACAACCATT TTCCCAACTT AATACCCTGG TTTATTGTAA ATAGTGTGCC AAAAGTATGT
TCAATACATC ACCTCGAAGA AACAGCGCAA TTCTCCGGGG TGATACCCAA GCTCGCCAAG
GTAGCGGTAC AGGATGTATT CGAGGTCAAC AGCCCCTGCA CCGTCGTACT TACAGTCTCA
AAGAGCGTCA GGCAAAAGCT AGCCTCGCTC AGAGCCGTGA GGAAGGGCGG CATCGTTGTG
ATCCCTCCGG GCATAGATAC TGGGAAGTAC CTCTCGGTAC GCAGAGATCC AGAGGAAAAC
ACCTTCATCA TGGTGGGAAG GCTGGAGAAA AGGAAGCACT ACGACCACGC GATAGTAGCC
TTCAAAGCGG TAGCCAAGGC AGAGCCCAAC GCCAAGCTTC TCATAGTGGG CGAGGGGCCT
CTACGACCGT ACCTAGCCCA GCTCATAAGA AAGTTTTCAC TCGTTAGGAA CGTTCAGTTG
CTGGGATCAG TAAGCGAAGA GGAAAAGCTG AGTCTGCTTT CAAAAGCTCA GGCGCTGATC
CACCTCGGGT ACCCCGAGGG ATTCGGCATC GTGCTCATAG AAGCGCTCGC CGCCGGAGTA
CCAGTAATAG CCTACGACAT ACCACCGCTC AACGAAGTCG TGGAGCACGG TGCAACAGGC
ATACTTGTGC CAAAGGATGA CGTAAGAGTG CTGGCCAGAG CTATAGTCAG GTTCAACAGC
TATACCTTCG AGGAGAAAAC ACTGAGAAAG AGAGCCGAGC GCTACGACAT CAACATTATT
GCAAGGGAGT TCGCCAGACT CTACGATACG CTAGCCTGCT GTAGGAGAAA TAATGCTGGA
AGTATTCAAT GA
 
Protein sequence
MRETRIIFLT DYLTTKSGGV GFLYEVMKKI AGKYRIDIIA GRVEESLQKE DYVRILNLNV 
YRDDLPSAQP ENAVKFLNLS TKMLKRIVKC TGEEELILHF NNHFPNLIPW FIVNSVPKVC
SIHHLEETAQ FSGVIPKLAK VAVQDVFEVN SPCTVVLTVS KSVRQKLASL RAVRKGGIVV
IPPGIDTGKY LSVRRDPEEN TFIMVGRLEK RKHYDHAIVA FKAVAKAEPN AKLLIVGEGP
LRPYLAQLIR KFSLVRNVQL LGSVSEEEKL SLLSKAQALI HLGYPEGFGI VLIEALAAGV
PVIAYDIPPL NEVVEHGATG ILVPKDDVRV LARAIVRFNS YTFEEKTLRK RAERYDINII
AREFARLYDT LACCRRNNAG SIQ