Gene Tpen_1743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1743 
Symbol 
ID4601768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1684774 
End bp1686036 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content50% 
IMG OID639774516 
Productglycosyl transferase, group 1 
Protein accessionYP_921141 
Protein GI119720646 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCGA GTAAGCCTCA ACTAGCGGTG GTCGCTAACC CCTCGGAGCA CGGAAGCGGC 
GGCGAGCTTA GGGCTTTAAG GTCTGTAAAG GAGTATGCTA AACATTTCCA CGTCTACCTT
TTCACCCCTC TACGCGCGGT AGGCTCTACG TCCTTAAGGA GCCTTGCGCG GCTGAAGAGC
TTAGGCGTAG CTCTCTCGGG ATGCATTACG TATCCGAGAA TTGGGAAGCT GGAGCCCGAA
TTCAAGCTTT TTTTCCCGCA ATTAACTAGG CTAACGACGA CAAAACTCCC CTTCTTCGAC
GGCATCGTAG TGCTACACGA GAATATAGAC TATCTATACG CTGGCTACCT TCTCGGCAAA
GCTAGCGGAG CGCCGGCGAT GGTTCTCCTG CAGAACCCCC CTCTTTTCGG GTCAAAGAAA
AGGCTCAGTG AGATACTCAA AGCTGTATAC CTGTGGGAGA AGCTCACTTC AGCTTCCACT
CTCGAGGAAG CTTATGCTAC CGTGCGCCTT GCAAGGCTAC AAGTGAGAAG ACCAATAGAG
GAGGCCAGAA TTCGAATGTT GCTCAACAAG TACTCGCTCG TAGTAGGGGT GAGCAGAGCA
ACAGTTCTAG AAATGGGGGA ACCCTGGTTC AGCAAGGCTT TCTACCTAGA TCCGGGTGTA
AATCTAAACG AGGAGGAAGT ACATTTATTG AGCAGGATTC GCAAAAGTGT TAAAGAGAAG
GAAAACTGCA TACTGTACAA AGGCGGCCTT ACTCCCGTGA AAGGTATCTT AGACGTTCTC
CTTGCCTACA GACTCATTAG GAAGGAGAGA GGCGATCTGA AACTCGTCAT CACAGGGAAA
CTGGACAGCA AAACTCATAG AAAGCTTCTC GTAGCACTTA AAAAGCTAAA CCTCAAGGAC
TACGTTGTCC TTACAGGATT TATATCAAGG GAGCAGCTCT TCGAGCTCAA CGCTAAGGCA
AGGCTCCTGC TCCACCCAAG CCATATGGAT TCATTTCCAT ACACTGTCCT GGAGTCTTTA
CACGCAGGAA CCCCCGTCGT AGGCTACGAT ATACCCGCGC TCAAAATATA CTACGGCGGG
CTTCCCGGAG TCAGGCTTGT ACGGGAAGGC GACGTAGAAG CCTTAGCAAC AGAGGCTATC
GACATGCTGG AGAGAAGCCC CAAAGAGGTT AATCCTCCAA GCTTGAAAAG CTGGAACGAG
ATAATAGAGG AGGAGCTTAG CTTGGTTAAA AAAAAATTGC TAGGCGAACA TAGACACTCG
TAA
 
Protein sequence
MHSSKPQLAV VANPSEHGSG GELRALRSVK EYAKHFHVYL FTPLRAVGST SLRSLARLKS 
LGVALSGCIT YPRIGKLEPE FKLFFPQLTR LTTTKLPFFD GIVVLHENID YLYAGYLLGK
ASGAPAMVLL QNPPLFGSKK RLSEILKAVY LWEKLTSAST LEEAYATVRL ARLQVRRPIE
EARIRMLLNK YSLVVGVSRA TVLEMGEPWF SKAFYLDPGV NLNEEEVHLL SRIRKSVKEK
ENCILYKGGL TPVKGILDVL LAYRLIRKER GDLKLVITGK LDSKTHRKLL VALKKLNLKD
YVVLTGFISR EQLFELNAKA RLLLHPSHMD SFPYTVLESL HAGTPVVGYD IPALKIYYGG
LPGVRLVREG DVEALATEAI DMLERSPKEV NPPSLKSWNE IIEEELSLVK KKLLGEHRHS