Gene Tpen_1730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1730 
Symbol 
ID4601755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1674348 
End bp1675610 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID639774503 
Productglycosyl transferase family protein 
Protein accessionYP_921128 
Protein GI119720633 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.461107 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGG AGCCGTCAGC GCTACTCGGA GCCGTAGCCC CTCCGGGATT GCTCCCGGAA 
GCCTTCTACC TGCTTCTAGA CGCCCTCTCC CTCCTCACCC TCGCGGGCAT ACTCGCGTGG
TCCGCCTACC ACGCCCCCAT AATCGTCGCC GGGCTACTAG CCCCCCGGGG CAGCGGGGAC
GACCCGGGGA ACGGGCTTCC CAGGGTGACA GTCATAGTGC CGTCGAAGGA CGAGGGGAGG
CGCGTCGAGC GTTGCCTCAA CGCTATCCTA TCCTCGGACT ACCCCCTGGA GAAGCTCGAA
GTGATAGTCG TCGACGCCAG CTCGGACGGG TACGTCGAGG AGATAGTGCG GAGAGCCGGA
GAAAGGTACC CGGGCGCGGT CAGGTTGATA AGGGAGGAGG AGCCCCGCGG GAAGCCCGCC
GCGCTGAACA GGGCGCTTAG GGAGGCGACG GGGGAGGTCG TAGCGGTGTT CGACGCGGAC
AGCGTCCCCG AGAGGGACGC CATAAGGCGC GCCGTGAAGC ACCTCGAGGA GCCGGGGGTC
GCCGCGGTCC AGGGGAAGAC GCTGGTACTC AACGAGCGCG AATCGGTGCT CGCGAGGGTA
GCCTCCAAGG AGGAGAAGGC CTGGTTCCAC GCGCTCATAC GCGGGAGGGA GAGGCTCGGG
CTCTTCGTAC CGCTCACCGG GAGCTGCCAG TTCGTTAAGA GGAGCGCGCT CGAGGAGGTC
GGGGGGTGGA GGGAGGACGC CCTCGCGGAG GACCTGGAGC TATCGATGGA CCTCCTCGCC
AGGGGCTACA GGGTGAAGTA CGCGAACGAC GTCGTCTCCT GGCAGGAGGC GCCGACCTCG
CTGAGGAGCC TCGCCGTGCA GAGGAACAGG TGGTATAGGG GGTACATGGA GGCATTCGCG
AGGCACCTGC GCCTCGCCCT CGCGGGCAGG AGGGGGCTGG ACGCCGCCAT CCTCTCGGCG
GGGCCCTACC TGATGGCGCT CAGCCTCCTA GCGGTGGCCG CCTGGCTCGC CTCGACGGCT
TTGCCCCACG TAAACCACTT CTCGACACCC GCCGCCCTCG TCGCCGCGCT GAACGCCGTG
TCCCTCTTCT CGGTCAGCGT CGCCCTCGCG CTCAGCGAGA GGCCCGTAAG CGCGAAGAAC
CTAGCCTGGG TCCCGGTTAT CTACGCGTAC TGGTTCACGC TCTCCGCGGT CGCCCTCCAC
GCCCTCGCCG AGATAATCCT GAGAAGGCCG CGCGTCTGGA GAAGGACTCC GAAGCCCATA
TAA
 
Protein sequence
MTSEPSALLG AVAPPGLLPE AFYLLLDALS LLTLAGILAW SAYHAPIIVA GLLAPRGSGD 
DPGNGLPRVT VIVPSKDEGR RVERCLNAIL SSDYPLEKLE VIVVDASSDG YVEEIVRRAG
ERYPGAVRLI REEEPRGKPA ALNRALREAT GEVVAVFDAD SVPERDAIRR AVKHLEEPGV
AAVQGKTLVL NERESVLARV ASKEEKAWFH ALIRGRERLG LFVPLTGSCQ FVKRSALEEV
GGWREDALAE DLELSMDLLA RGYRVKYAND VVSWQEAPTS LRSLAVQRNR WYRGYMEAFA
RHLRLALAGR RGLDAAILSA GPYLMALSLL AVAAWLASTA LPHVNHFSTP AALVAALNAV
SLFSVSVALA LSERPVSAKN LAWVPVIYAY WFTLSAVALH ALAEIILRRP RVWRRTPKPI