Gene Tpen_0964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0964 
Symbol 
ID4600438 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp914482 
End bp916284 
Gene Length1803 bp 
Protein Length600 aa 
Translation table11 
GC content61% 
IMG OID639773742 
Producthypothetical protein 
Protein accessionYP_920367 
Protein GI119719872 
COG category[S] Function unknown 
COG ID[COG3410] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.624537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTGACC CGGACCCAAG CAAAGCGAGA GACGTCGTCG GGGCGCTAGT CGAGGGCTAC 
AGAAGGTTCT ACGGCGAAGA CCCCTCCGGC GAGCTCGTAG CGTCGTGGAG TAGCAGCGTT
GCCCGCGTCC TAGGCGTCCT GGAGAGGGCG GGAGGCTTCC CCGCGGTGCT CGAGCTACCG
CTGTTCGGGT CCGAGAGGGC CGACTTCGTA GTCGTCGGCA GGGGTAGGGC GCTCGTAGTC
GAGGCGAAGG GCTGGAGCAC GGTTGAGAAG CTGAACTACG TGGTGCAGGT CGACGGGCTG
AGAGAGGTGG ATCCGTGCTA CCAGGTGGAG AACTACGTTT CGAAGCTCAA GTACTTCAGC
ACCGCCGCGG ACAGGGTGAG GCACTTCGAC GGGGTAGCGT ACCTCTACGG AGGCGCGAGC
TACTCGGATG GCTGCAGGAT CGCGAGGAGC GACGCGGAGC TGGAGGAGTA TGTAGGCTCC
CTCGGCTCGC CCGGCGACGA GGGAGACGTC GAGGCGGTAG CAAGCGCCAA GTTCACGGTG
AGGAGGGATA TAGTCGAGTT CCTGCGGAGC CACAGAGACA AACTCCTCAA GGAGGCGGCG
CGCTTCCTAG CCTCGGAGGG GTACGGGCTG GGCAGGGAGC AGCTAGTCCT GGTGCACGAC
GTCCTAGAGG CGCTGGAGGC GGGGTCTAGG AAGGCTTTCT TCGTCAGGGG CGGGACCGGC
TCCGGGAAGA CGCTCGTCGC CCTAACCCTC CTCTTCGAAG CGGTTTCGAG GGGCTACCAC
GCCGTCCTGG CTTACAAGAA CAACAGGTTG CTCAACACGC TCAGGTACGC CCTCTCGTTG
CGCGCACCGC GCGGTGCGCC CAAGCTCAGC GCACTCATAG TGTACTACTC TACCGGCAGA
GGGCACGGGC TCGGGGAGAG AAGAGCGTAC GAGAAGGGAC TTTACAGAAA CCTGAATCTC
GCGGTGCTCG ACGAAGCGCA GAGGATGACG CTCGAGAACA TCGAGTACAC AATGAAGAGC
GCCCCCGTCA CCGTCTACTT CTACGACGAC AAGCAGATAC TCATAGGCTA CGAGGAGGGC
TTCAGGGAAA ACTTCCTCGA AGCGGCGGAG AGGCTCGGGC TCGCCTACGA CGAGAGGGAG
CTGAAAACGC TCTACAGGGT CCCGCCCGGC TACGTGAAGC TCGTGGAAAG CCTGGTCTAC
AGCGGGGCAG TCGCCCAGCA GGACGTCCAG GGCTACGACA TCAAAGTATT CGATAACCCG
GCAGACATGC TTGAAGCCCT CCAGGAGAAG GCGAACAAAG GCTTCAAGGT AGCCCTCGTG
TGCGCCTTCA CGGAGACGAG GGGCGACAAG AACGACCTGA ACAGCCCGGA GAACAGGAGG
CTCACAGTCA AGCGCGGAGA CCGCGAAGAA GTAGTCACGT GGCTCATGGA CGAAAAAGAG
GAGTACCCCA AGTATTGGTG CGGAGAGCTG GGAAACCCCC TCACGCGCTG CGCCTCAGTC
TACGGGGCAC AAGGCTTCGA GGCAGACTAC GTCGGAGTAG TATGGGGCAG AGACATGGTC
TGGAGGTGCG GGCCGCTGGG CTGTGGGTGG AGCGTAAACC CCGACGCCAT AACAGACTAC
GTAGGCGGGC AGTACTCGCT GGAGAAACTA GCCAGGAAAG ACCCCGGCAA AGCCCTCGAA
CTACTCAAAA ACAGGTACTA CATCATGCTG ACAAGAGGAA TCAAGGGAAC CTACATATAC
CCCGAAGACG GGGAAACAGG GCGCCTACTC AGAGAAGTAG TCGAAAAGCT ACAGCAACAC
TAA
 
Protein sequence
MVDPDPSKAR DVVGALVEGY RRFYGEDPSG ELVASWSSSV ARVLGVLERA GGFPAVLELP 
LFGSERADFV VVGRGRALVV EAKGWSTVEK LNYVVQVDGL REVDPCYQVE NYVSKLKYFS
TAADRVRHFD GVAYLYGGAS YSDGCRIARS DAELEEYVGS LGSPGDEGDV EAVASAKFTV
RRDIVEFLRS HRDKLLKEAA RFLASEGYGL GREQLVLVHD VLEALEAGSR KAFFVRGGTG
SGKTLVALTL LFEAVSRGYH AVLAYKNNRL LNTLRYALSL RAPRGAPKLS ALIVYYSTGR
GHGLGERRAY EKGLYRNLNL AVLDEAQRMT LENIEYTMKS APVTVYFYDD KQILIGYEEG
FRENFLEAAE RLGLAYDERE LKTLYRVPPG YVKLVESLVY SGAVAQQDVQ GYDIKVFDNP
ADMLEALQEK ANKGFKVALV CAFTETRGDK NDLNSPENRR LTVKRGDREE VVTWLMDEKE
EYPKYWCGEL GNPLTRCASV YGAQGFEADY VGVVWGRDMV WRCGPLGCGW SVNPDAITDY
VGGQYSLEKL ARKDPGKALE LLKNRYYIML TRGIKGTYIY PEDGETGRLL REVVEKLQQH