Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0640 |
Symbol | |
ID | 4601437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 591559 |
End bp | 593754 |
Gene Length | 2196 bp |
Protein Length | 731 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773413 |
Product | oligosaccharyl transferase, STT3 subunit |
Protein accession | YP_920046 |
Protein GI | 119719551 |
COG category | [R] General function prediction only |
COG ID | [COG1287] Uncharacterized membrane protein, required for N-linked glycosylation |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.252493 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGCGAGAG GCGAGAAGGT TGTTGGAAAG CTGGTATCTG CCTTGGAGTT TCTAGGCAGC CCGAAGGTTG TCGTCGCTGT TCTCCTACTT ATGTCGTTCA CAGTAACGTT GCTTGCAAGG CTCACACCCA TGCATTGGGG CGTGTATCTC AACGAGTTCG ACCCTTACTA CGAGTACTAC CTTTCTGAGC AGTTACTCGC ACACGGCAAC GGCAACTACT TCGCAGGGGT TGCGTGGTGG TATCACTGGT GGTTCGAAAA CCCCAAACCT AGAGACACGC TTTTCTGGGC TCCGGAAGGG AGAGACCTTA GAGGGACAAG CCAACCTGGA CCAGCGTTCT TTAGCGCTGG TGTCTATACT CTCTTGAGGA CTCTCGGCTT CGACGTTTCG CTGTACTACG TCCACGCTTT CCTAGTACCC TTTGTAGCGT CACTAGCGGT GTTCACGGCC TACTTGCTGG GGAGCGAGCT AAAGGACTAC AGAGCGGGCG TGCTAGCGTC CGTACTCATA GCTCTAAGCT GGGCGTACAT GTACAGAACG AATCTAGGCG CGAAGCACGA GGCGATAGCC ATTCCATTCA TGTTGCTTGG TTTTTACCTG TTCCTTAAAG GCTACAAGAA AAAGTCGCTA CTACTGTCGA TACTGGCCGG GCTCTCCCTC GGAGTAGTAG TTCTCGCGTG GGGAGCCTAC GTTTACCCGT GGAACCTACT GGCGCTCGTC GTCCTCTTCT GGCTATTCTT CCACCCGGAC GATACCGCGA TAGCCAGGTC GTACGTAGCA ACAAACCTTG TGGTAACATT CTTCGTGGCT ACGACGCCCA GGTTCGGCCC CTCCGTAGCG TTCATGTCTT TCTTGGGCTT CCTGCCGCTA GTGGCCACCC TAGCTTCGAT ACTGGTAATC TTCGGCATAA GGGTTCCCTC GAGTAGCCTT GGGGCCAAGA AGACACGTAG GACTTTACTC GTACTTCTCG TCGTCTTGCT CGTAGTTTTC GCCTTGGGGA CTTACCTCGG AGTGTTCCGG GGGCTTGCCG GCAGAATAAT GGCTGTAGTC ATGCCCCTTG TACGCGAGCC TGGCGTTACC ACGGTGGCCG AGCACCAGGT TCCCACGTGG AACCAGCTCT TCGACGACTT CCAGACCTCG CTTATATTCG CGTTTTTCGC AGGCTACCTC TACTTCCTTA AGTCGAAGGA TGACTTCAAC AGCGCCTTTA TCTCGCTCTT CATAGCTACG GCGGTGTACT TCTCCGCCTC CATAGTTAGG CTTCTTCTAC TCCTCTCCCC CGCCGTCGCG ATTGCAGGGT CTCTCGGTCT CGTCGAAATA CTGGATAGGC TGGCACTACC TGCGGAGAGA AGCTACGACA AGAGACACAG AGGAGCCTCC ACCCAGACAA CGCGCCACCT AGCGATCCTG ATAGCGGTAA TCCTGTTGCT CCTCTTCTCC CCGTCTATAC TCGCCTCAAA AATACCGCTG AACTCCCACC AACCACCACT CATACTCACG TCTTCCGTGC CCCTAGTGCG GTACGACTAC GAGTACATGG ACTGGCTGTC CGCCCTTCAG TGGATATCCG AGAACGTGCC TAGGGATGCG ACGATAGCGA CCTGGTGGGA TTACGGGTAC TGGATAAGCG TGAACACGGG AAGGAAGACT ACGTGTGACA ACGCTACTAT AGACACGAAG CAGATACAGA AGATAGCTAA GGCCTTTATG AGCGACGAAG ACACCGCGCT AAGGATATTC AAGGAGCTGA ACGTCTCCTA CGTCGTAGTG TTCGAGCCTC TCCAACAGCT ACAGCTCTCG AACGGCCTCA CGGTATGGTT CTCGATGATG CACCCCGCGC TCGGCGGAGA CATAGCTAAA TCTCCGCAAA TGCTCAAGTG GATCGGGCTC AGCGCTAACG ACTACATATA CGGGTATAAC AACGGATCTT TCGCCTACCT GCAACAAGGG GGCTACACGC TCTACCTGAT ACTTCCAGCC AACACCCCGC AAGCGCTCAA CGCGACGCTC TACAGGATGG TATACACGAG AAACCATAAG CAGCAAGTAT TCATCTTTGA CCCCTTCCTC TACCAGCTCT TCGGGCTCCA GGGATACAAA GGACCCACGT ACACGTTCGC CCCCCTCAAG AACTTCGAGC TCGTATACGT GTCAGAGCCA AACGGGTGGG TAAAAGTTTT CAGGGTCAAG GGGTAG
|
Protein sequence | MARGEKVVGK LVSALEFLGS PKVVVAVLLL MSFTVTLLAR LTPMHWGVYL NEFDPYYEYY LSEQLLAHGN GNYFAGVAWW YHWWFENPKP RDTLFWAPEG RDLRGTSQPG PAFFSAGVYT LLRTLGFDVS LYYVHAFLVP FVASLAVFTA YLLGSELKDY RAGVLASVLI ALSWAYMYRT NLGAKHEAIA IPFMLLGFYL FLKGYKKKSL LLSILAGLSL GVVVLAWGAY VYPWNLLALV VLFWLFFHPD DTAIARSYVA TNLVVTFFVA TTPRFGPSVA FMSFLGFLPL VATLASILVI FGIRVPSSSL GAKKTRRTLL VLLVVLLVVF ALGTYLGVFR GLAGRIMAVV MPLVREPGVT TVAEHQVPTW NQLFDDFQTS LIFAFFAGYL YFLKSKDDFN SAFISLFIAT AVYFSASIVR LLLLLSPAVA IAGSLGLVEI LDRLALPAER SYDKRHRGAS TQTTRHLAIL IAVILLLLFS PSILASKIPL NSHQPPLILT SSVPLVRYDY EYMDWLSALQ WISENVPRDA TIATWWDYGY WISVNTGRKT TCDNATIDTK QIQKIAKAFM SDEDTALRIF KELNVSYVVV FEPLQQLQLS NGLTVWFSMM HPALGGDIAK SPQMLKWIGL SANDYIYGYN NGSFAYLQQG GYTLYLILPA NTPQALNATL YRMVYTRNHK QQVFIFDPFL YQLFGLQGYK GPTYTFAPLK NFELVYVSEP NGWVKVFRVK G
|
| |