Gene Tpen_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1722 
Symbol 
ID4601747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1664147 
End bp1665331 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content43% 
IMG OID639774495 
Productglycosyl transferase, group 1 
Protein accessionYP_921120 
Protein GI119720625 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.664722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAACAGG TACAGGAATT CTCGCATAAA GGAAAATACT GCAAAAAGCT TAGAATAGCA 
TTCGTACACA ACTACTACAT CCACTATAGG GTTCCCCTAT TTAAAGCACT AAGCAGAGTT
TTTCATGTAA AATTCTTCTT CGATGATGTC TACGAATACG TGAAGAAACC GGAAAAAGAA
CTAGACTTCG TGATAAATAA AGGACCTAGG ATAAAAGGGA TAAGGCTTCC AGTCACACTT
TTATTCCACT TGTTAAAGCA GAGACCACAT CTAGTAATTG CCGGCGACTC AACGTACCCC
AGCACGCTAA TAGCCTTTCT TACCTCCAAG ATGCTAAGAG CGAAGTTTAT CCTGTGGGAA
GAGAGGTGGT TCTGGCACAG CAGCCTCCTT TCAAATCTTC TATGGCCTTT CTCACGTACG
GTGGCACTAA AAGCCGATGC GCTCATAGTA CCTGGCACGC TATCCAAAGA GTTTTACAAG
AACATAGGAG TCGAGAAGGG AAGGATTTTC GTAGCACCGA ATGCAAGCTA CGTCGACATA
AACGAGGAAA TTAAAAACAG GGCTAGAGCT CTTAGAAGAA AACTAGGCTT GGACAATAAG
ATCGTAGTAC TTTACATTGG GAGAGTAATT CCTTTAAAAG GTGTCCACCT AATTCTCAAA
GCTTTAACAA AAATAAATGA ATATAACCTG CACCTGTTGA TTCCCGGCAC GTTCGTCGAT
CCTTGGTACA AGAGACTTCT TGACGATATT GTAAAGGCAA GCAAACTTGA AAGCAAGGTA
ACAATGCTTA GTCTTAAGTT CGTCAGGGTA GAGGACAGGG GAATCTACTA CGAGCTAGCA
GACATAGTTT GTTACCCTTC GTACTACGAG GCCTGGGGTA TGGTGGTTAA CGAGGCGGCA
TATGCCGGGA AACCGGTAAT ATCCACGAGA ACGTGCGCCG CGGCATACGA CATACTCTTC
GGACATCCCG AACTCGTAAT ACCTCCAGGA AACGTTGAAG AACTAGCCAA AAGCCTAAAA
CTTTTAGCAA TGGATGCCAA TAAAAGAAAG GCTATCGGAA TGGAATTGAA ACGCTTAATA
AGCGAGAAGT ACTCCTACGA GGAAATGCTG AAAGGCTTCC TCAAAGCCAT AAAATACACC
TTGGTAAATC AGCTTACCGA GCAGTCACAA AACAAGCTAG ATTAG
 
Protein sequence
MKQVQEFSHK GKYCKKLRIA FVHNYYIHYR VPLFKALSRV FHVKFFFDDV YEYVKKPEKE 
LDFVINKGPR IKGIRLPVTL LFHLLKQRPH LVIAGDSTYP STLIAFLTSK MLRAKFILWE
ERWFWHSSLL SNLLWPFSRT VALKADALIV PGTLSKEFYK NIGVEKGRIF VAPNASYVDI
NEEIKNRARA LRRKLGLDNK IVVLYIGRVI PLKGVHLILK ALTKINEYNL HLLIPGTFVD
PWYKRLLDDI VKASKLESKV TMLSLKFVRV EDRGIYYELA DIVCYPSYYE AWGMVVNEAA
YAGKPVISTR TCAAAYDILF GHPELVIPPG NVEELAKSLK LLAMDANKRK AIGMELKRLI
SEKYSYEEML KGFLKAIKYT LVNQLTEQSQ NKLD