Gene Tpen_1725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1725 
Symbol 
ID4601750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1667611 
End bp1668675 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content54% 
IMG OID639774498 
Productglycosyl transferase, group 1 
Protein accessionYP_921123 
Protein GI119720628 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.24734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTT GCTTTGTTTG CTCGCATTTA TGGCCTCACT CCTACGGCGG GGCAGAGAGG 
AGGTACTACA TCCTCGCCAA AGAGCTTATG AAGAGAGGGC ACGACGTCGT ATACGTCACG
TACGACTACG GCGAAAGCGA AATCCCCCTA GCGACCGTGG GACCCCCACC TAGGCTCTAC
GACGAGAAGG GCAGGAGGCG CCTACTCCCA GCGCTGGAGT TCGGCTGGAA GACCGCGAAG
CTCGTTGAAA AGCTGAAGTG CGACGTGGTA GACGTCACGG TGCCGTACAC GACCGCGCTT
TTCATGAAGC CACGTAGCTT CGTGCTGACG TACCACGAGT ACTGGGGAAA ATATTGGGAG
CACTACTTCT CCAAGCCTCT GGGAACACTC GCCGCCATCG CGGAGAAAAG GCTACTGAGG
AAAGCCGTGC TCGTAATCAC ACCTTCCAAG CTCGTGGCCA ACAGGATCCT AAGCGAGGTG
GGCACCGTGA AGGTAGCACC GGTACCCATA GGTCTGAACC CCGACGACTA CGCGAAGTAC
AGGGGGCGAA GCAGGGACAT AGACGTCACC ATGATCGGCA GGTTCACGTA CACAAAGGGC
TGGTGGAGGC TAATAGAAGT ACTCAGGCAG GTAGACAAGC CCCTCCGAGT AGCAGTAGTA
GGAGACGGCC CACTCTACAA CGAAGTAACC GCGCAGCTCG AAAAGCTACA CCACGAGGTG
CACAGCTACA GGAAGGCAAG CGAAGAGGAA AAACTCGAAC TACTCGCGCG CAGCAAGTAC
TACCTAAACC TCTCAGACGC CGAAGGATTC AGCATAGCAA CCCTCGAAGC AATCCTATGC
GGAGCAACAC CCATAGTCCT CGACACAGGG CTCAATGCTG CAGTCGAAAT AGTAGAAGAG
ACAGGTTGCG GCTACATAGC CAAAAACCTG AGAGAAATTG CGGAACAAGT AGCCAAGGAA
ACCAGTACCT GCACGCCCAA TATATCGGGC TACACCATTC AGACTTTCGT CAACAATTAC
CTCAGATTTT TAGGAGAGCT TCAGACGATG AGAGTAAATG CGTGA
 
Protein sequence
MKVCFVCSHL WPHSYGGAER RYYILAKELM KRGHDVVYVT YDYGESEIPL ATVGPPPRLY 
DEKGRRRLLP ALEFGWKTAK LVEKLKCDVV DVTVPYTTAL FMKPRSFVLT YHEYWGKYWE
HYFSKPLGTL AAIAEKRLLR KAVLVITPSK LVANRILSEV GTVKVAPVPI GLNPDDYAKY
RGRSRDIDVT MIGRFTYTKG WWRLIEVLRQ VDKPLRVAVV GDGPLYNEVT AQLEKLHHEV
HSYRKASEEE KLELLARSKY YLNLSDAEGF SIATLEAILC GATPIVLDTG LNAAVEIVEE
TGCGYIAKNL REIAEQVAKE TSTCTPNISG YTIQTFVNNY LRFLGELQTM RVNA