Gene Tpen_0640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0640 
Symbol 
ID4601437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp591559 
End bp593754 
Gene Length2196 bp 
Protein Length731 aa 
Translation table11 
GC content54% 
IMG OID639773413 
Productoligosaccharyl transferase, STT3 subunit 
Protein accessionYP_920046 
Protein GI119719551 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252493 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGAGAG GCGAGAAGGT TGTTGGAAAG CTGGTATCTG CCTTGGAGTT TCTAGGCAGC 
CCGAAGGTTG TCGTCGCTGT TCTCCTACTT ATGTCGTTCA CAGTAACGTT GCTTGCAAGG
CTCACACCCA TGCATTGGGG CGTGTATCTC AACGAGTTCG ACCCTTACTA CGAGTACTAC
CTTTCTGAGC AGTTACTCGC ACACGGCAAC GGCAACTACT TCGCAGGGGT TGCGTGGTGG
TATCACTGGT GGTTCGAAAA CCCCAAACCT AGAGACACGC TTTTCTGGGC TCCGGAAGGG
AGAGACCTTA GAGGGACAAG CCAACCTGGA CCAGCGTTCT TTAGCGCTGG TGTCTATACT
CTCTTGAGGA CTCTCGGCTT CGACGTTTCG CTGTACTACG TCCACGCTTT CCTAGTACCC
TTTGTAGCGT CACTAGCGGT GTTCACGGCC TACTTGCTGG GGAGCGAGCT AAAGGACTAC
AGAGCGGGCG TGCTAGCGTC CGTACTCATA GCTCTAAGCT GGGCGTACAT GTACAGAACG
AATCTAGGCG CGAAGCACGA GGCGATAGCC ATTCCATTCA TGTTGCTTGG TTTTTACCTG
TTCCTTAAAG GCTACAAGAA AAAGTCGCTA CTACTGTCGA TACTGGCCGG GCTCTCCCTC
GGAGTAGTAG TTCTCGCGTG GGGAGCCTAC GTTTACCCGT GGAACCTACT GGCGCTCGTC
GTCCTCTTCT GGCTATTCTT CCACCCGGAC GATACCGCGA TAGCCAGGTC GTACGTAGCA
ACAAACCTTG TGGTAACATT CTTCGTGGCT ACGACGCCCA GGTTCGGCCC CTCCGTAGCG
TTCATGTCTT TCTTGGGCTT CCTGCCGCTA GTGGCCACCC TAGCTTCGAT ACTGGTAATC
TTCGGCATAA GGGTTCCCTC GAGTAGCCTT GGGGCCAAGA AGACACGTAG GACTTTACTC
GTACTTCTCG TCGTCTTGCT CGTAGTTTTC GCCTTGGGGA CTTACCTCGG AGTGTTCCGG
GGGCTTGCCG GCAGAATAAT GGCTGTAGTC ATGCCCCTTG TACGCGAGCC TGGCGTTACC
ACGGTGGCCG AGCACCAGGT TCCCACGTGG AACCAGCTCT TCGACGACTT CCAGACCTCG
CTTATATTCG CGTTTTTCGC AGGCTACCTC TACTTCCTTA AGTCGAAGGA TGACTTCAAC
AGCGCCTTTA TCTCGCTCTT CATAGCTACG GCGGTGTACT TCTCCGCCTC CATAGTTAGG
CTTCTTCTAC TCCTCTCCCC CGCCGTCGCG ATTGCAGGGT CTCTCGGTCT CGTCGAAATA
CTGGATAGGC TGGCACTACC TGCGGAGAGA AGCTACGACA AGAGACACAG AGGAGCCTCC
ACCCAGACAA CGCGCCACCT AGCGATCCTG ATAGCGGTAA TCCTGTTGCT CCTCTTCTCC
CCGTCTATAC TCGCCTCAAA AATACCGCTG AACTCCCACC AACCACCACT CATACTCACG
TCTTCCGTGC CCCTAGTGCG GTACGACTAC GAGTACATGG ACTGGCTGTC CGCCCTTCAG
TGGATATCCG AGAACGTGCC TAGGGATGCG ACGATAGCGA CCTGGTGGGA TTACGGGTAC
TGGATAAGCG TGAACACGGG AAGGAAGACT ACGTGTGACA ACGCTACTAT AGACACGAAG
CAGATACAGA AGATAGCTAA GGCCTTTATG AGCGACGAAG ACACCGCGCT AAGGATATTC
AAGGAGCTGA ACGTCTCCTA CGTCGTAGTG TTCGAGCCTC TCCAACAGCT ACAGCTCTCG
AACGGCCTCA CGGTATGGTT CTCGATGATG CACCCCGCGC TCGGCGGAGA CATAGCTAAA
TCTCCGCAAA TGCTCAAGTG GATCGGGCTC AGCGCTAACG ACTACATATA CGGGTATAAC
AACGGATCTT TCGCCTACCT GCAACAAGGG GGCTACACGC TCTACCTGAT ACTTCCAGCC
AACACCCCGC AAGCGCTCAA CGCGACGCTC TACAGGATGG TATACACGAG AAACCATAAG
CAGCAAGTAT TCATCTTTGA CCCCTTCCTC TACCAGCTCT TCGGGCTCCA GGGATACAAA
GGACCCACGT ACACGTTCGC CCCCCTCAAG AACTTCGAGC TCGTATACGT GTCAGAGCCA
AACGGGTGGG TAAAAGTTTT CAGGGTCAAG GGGTAG
 
Protein sequence
MARGEKVVGK LVSALEFLGS PKVVVAVLLL MSFTVTLLAR LTPMHWGVYL NEFDPYYEYY 
LSEQLLAHGN GNYFAGVAWW YHWWFENPKP RDTLFWAPEG RDLRGTSQPG PAFFSAGVYT
LLRTLGFDVS LYYVHAFLVP FVASLAVFTA YLLGSELKDY RAGVLASVLI ALSWAYMYRT
NLGAKHEAIA IPFMLLGFYL FLKGYKKKSL LLSILAGLSL GVVVLAWGAY VYPWNLLALV
VLFWLFFHPD DTAIARSYVA TNLVVTFFVA TTPRFGPSVA FMSFLGFLPL VATLASILVI
FGIRVPSSSL GAKKTRRTLL VLLVVLLVVF ALGTYLGVFR GLAGRIMAVV MPLVREPGVT
TVAEHQVPTW NQLFDDFQTS LIFAFFAGYL YFLKSKDDFN SAFISLFIAT AVYFSASIVR
LLLLLSPAVA IAGSLGLVEI LDRLALPAER SYDKRHRGAS TQTTRHLAIL IAVILLLLFS
PSILASKIPL NSHQPPLILT SSVPLVRYDY EYMDWLSALQ WISENVPRDA TIATWWDYGY
WISVNTGRKT TCDNATIDTK QIQKIAKAFM SDEDTALRIF KELNVSYVVV FEPLQQLQLS
NGLTVWFSMM HPALGGDIAK SPQMLKWIGL SANDYIYGYN NGSFAYLQQG GYTLYLILPA
NTPQALNATL YRMVYTRNHK QQVFIFDPFL YQLFGLQGYK GPTYTFAPLK NFELVYVSEP
NGWVKVFRVK G