Gene Tpen_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1778 
Symbol 
ID4601939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1720614 
End bp1722164 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content60% 
IMG OID639774551 
Productglycosyl transferase, group 1 
Protein accessionYP_921176 
Protein GI119720681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCCG AGCCCAGGGT TGTGGTTAGC GTTACGCCCG AGCTGGCCCT TGACGACGGC 
TACACGTTCG CCGGAGGCCT AGGCGTGCTG GAAGGCGACA AGTTCTACGC GGCGGCGAAG
CTGGGGCTGA AGTACTACGC CTTGACGCTG TTCTACAGGA ACGGCTACGT GGACTACGCG
TTCGATGACT CGCTGAACCC GGTCGCCAAG CCGCAACCCC AGCCTGCGAG CTTCCTGGGG
TCTCTCAAGG ACGGCGGCGA GCTGGAGGTC TTCCTGAAGG GGGAGAAGGT AGCCGTCAAG
GCATGGGAGT ACGAGCATGG AAGCGCTAAG GCTGTGTTCT TCGAGCCCGT AAGCCCGGAT
TGGGCGCGTA GCCTCGGCGA GAGGGTTTTC CTGGAGAGGG ACGCGGAGGA GAGGTTCTAC
AAGTACATCC TCCTCGCGCG CGCCGCCGTA GCCTACATGA AGGACAGGAT AGGCCTGGAG
AACATAGCGT ACATAGACCT CCAGGAGGCG TACACCGCCG TGATACCCCT AGTGTTCAAG
ATACCCGGGA GGTACCGGCT GGTGATACAC ACTCCCGGGC CCTGGGGGCA CCCGTCCTTC
CCGAGGGACC TCTTTGCCAA GGAGCTCGGC TACCGGTTCA TCGAGAACCC CGTTGTGCTG
ACAAGTATCG GGGCAGCCAC CGCCTACGAG GTAGTAATGG TCAGCTCGAA GCACTTCGAC
ATAATGAGGC GCGTGATACC CCAGTACTAC CACAAAGCGA GGTTCGTGAC TAACGGCGTA
AACATAGATA GGTGGATGAA CCCCAAGCTG AGAAACCTGT TCGCGAGCGG GAGCCTCGAC
GTCGCTACGC TGAGAGGCGT CAGGCTCGAG ATGCGGGACC AGCTCGTCAG GTTCCTGAAG
TCCAGGAAAC AGGTCAACGT TGACCAGGAC ACCTTTATCT TCGCGTGGAC GCGCAGAGTG
ACGAAGTACA AGAGGCCCTA CTTCCCCGTC AGGCTCATAG AGGAGCTCGG CGACAGGGAC
ACGCTCTTCG TTCTCGGCGG GAAAGCGCAC CCGGAGGACA AGGAGGGGTT GCAGTACATG
AGGAAGTTCA AGGAGCTGGA GAAAACGCGG CCCAACGTTG TCTACGTCCA CGACTACTCC
GTGGAGAGCG CTAAGATCAT ACTCTCGGGG GCCGATGTGC TGGCATTTAC GCCTTTCCCC
GGGTGGGAGG CTTCGGGGAC GAGCTTCATG AAGGCAGGGG TTAACGCTGT CCCGTCCATC
GCTTCGCGCG ACGGCGCCGT AGTAGAACTC CTCACGGACG GGGTGAACGG GTGGCTGTTC
GGGGAGGACA TAAGGGAACT GATAGACTTC GGGAAAGACC CCCGCGTGAG CGAGATCGAC
GAGAAGGACT ACGAGGAGTT CAAGAGGAAG TACGCCCAGG CTAAGGATCT CTACGCGAAC
GACAGGGAAG GCTTCCTCAA GGTCGCGCTG AGCGCTGTCC TCTCGCTGAC GATGCGCGTC
GACATAGTGA GGGCACTGAG GGAGTACTAC CCGGACCTCG TACAAACCTA G
 
Protein sequence
MDSEPRVVVS VTPELALDDG YTFAGGLGVL EGDKFYAAAK LGLKYYALTL FYRNGYVDYA 
FDDSLNPVAK PQPQPASFLG SLKDGGELEV FLKGEKVAVK AWEYEHGSAK AVFFEPVSPD
WARSLGERVF LERDAEERFY KYILLARAAV AYMKDRIGLE NIAYIDLQEA YTAVIPLVFK
IPGRYRLVIH TPGPWGHPSF PRDLFAKELG YRFIENPVVL TSIGAATAYE VVMVSSKHFD
IMRRVIPQYY HKARFVTNGV NIDRWMNPKL RNLFASGSLD VATLRGVRLE MRDQLVRFLK
SRKQVNVDQD TFIFAWTRRV TKYKRPYFPV RLIEELGDRD TLFVLGGKAH PEDKEGLQYM
RKFKELEKTR PNVVYVHDYS VESAKIILSG ADVLAFTPFP GWEASGTSFM KAGVNAVPSI
ASRDGAVVEL LTDGVNGWLF GEDIRELIDF GKDPRVSEID EKDYEEFKRK YAQAKDLYAN
DREGFLKVAL SAVLSLTMRV DIVRALREYY PDLVQT