Gene Tpen_0646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0646 
Symbol 
ID4601512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp595998 
End bp598124 
Gene Length2127 bp 
Protein Length708 aa 
Translation table11 
GC content63% 
IMG OID639773418 
Producthypothetical protein 
Protein accessionYP_920051 
Protein GI119719556 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000518672 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATG ACCCCAGACA GCCCGGCCGC GGGAGGACGG GGGAGGAGGG GGCCGAGGCG 
CAAGCAAAGA CTAGGCCTGG GGGAGAAACC CTGCAATCCC CTGCCGCTAA AATTGAATTT
TTTTCCCGCG TGTCTAACCC AACCCTCCCA TCATTGGGGG CCCTCCCCGA CAGGGAGGGG
TTCAATGAGC CCCGCGCGCG GGGAGGCCCG GCGCGGGGCG AGGGAGGCGT AGCCCGCGGG
GAGCCCCGGG GTTACGCCTC TCCGGGGCCC AGTGAGCGCC CCGGGGTGGG GAGGCCCTCC
CGGGCGCGAG GGGAGCGCTT GGCCGCGGGG AGGCCCGGGC CAGGCGCTCT AGACACGCGG
GGGCGGCGGG GGAGGCGGGG CTCTGGGGCC CCAGCCCCGG GCTGGGGCGT CTCTCCCCGG
TTGGGGAGAG ACCGGCCTAG GGGCCGATTC GTGAAGCCCG ACACGATGAG CCGAAAGGTG
ACGGTCGCCC CTAGTAAACG ACGCCGGCCT TCCCGGGGGG CCGGGCAAAG GCGGGCGAGG
CCTTGGCCCC AGCGGGCCCG CCCGAGAAAG CAATCACTTC GAAGCACTCA CCGCTCACGC
CTCCCTCCCG CCACGCTGAC CCCTGATAGC GCGCCCTCGC GGGGCGCCCC ACCCGCCACG
TGGAGGGTGG GTGATATGAG TGAGGCCTTG ATGAGGGGAA ATGGAGCCGT GGGGGAGGGG
GTCGAGAAGA CTGTTAGGGA GGAGGTTTGG AGGCGTGCGG CTCTAAGCTT GTATGCCACC
AGGACAGAGG ATAAGAGGAG GTCCAGGGGT AAGAAGAAGA AGGGCGAGAT ACACTACCGC
GGCCTGCACG ATGCTGTGAC CGAGGTCGAC TGGGACTTTA CCAGGTTCGC GGCACACGCG
CTGACCGTCG TGCCGGACGA CGTCTACCCG AGGTTTTACA GGTTCATCGA CATCGACGCG
AGGAAGTACC TCTTGCTGAA CAGCGACGAG AGGCCACGCG AAGGAAGCGC TGTGACGGAG
CTCAAAGGAA GGTTGCAAGC AATAGTCGAC GCCGGCGCCG ACGGTCTTAG AGCTGAGAAG
CATGGCAGAG TCTGGCACGT ATACATACCT AGAGAGAACT GGTATGTCGT CGTGAGCAAG
CCCACTAACG GCTGGTCCGT GCACGTACCG CTGAAGGGTT ACTGGGTTGA ATCAGAGTTC
CCGGAGGTTC TCGTGAGGAC ACCACCAGAC GTGCTTAGAA GCCTGCAGAA AGGGTGGGTC
TTAACGGATG TGACACCCCC TCGTGGACGC TTCAGCGACG TGAGTTTCGC CACGACGCAA
CCGTGGCAAC TCCCCAGTAC GCTCGCAAGC TTTCCGAGCG ACGATATCAA GCTCGGCGTC
ACGGCGGGCA TACTCGGCAG TACCAGGCTG AGCATCCAGT GGCACGCGCG TGTCTACGGC
TACGAGGAGG CGCTCTCCTG GGCTTCAAGG CTTATCGGCG AAGTTAAACG TGCAGAGTTC
CGCAGGTTGG TCGAGGAGTG CAAAGCGCTC AACGGCGACC CAGTGGCTCT TTTCACTGGC
TTCTTGGGGG ACGGCGGGCT CGAATACTTT CTAAGGCTTA GATGGCTCTA CTTCAAGGTT
GGACACGAGC TTCTCTACTT GCCGGCGGAG AGCGCCATTG TCAATGCTCG CTTGGCCGTG
GAGAGGGCAA GCGAGTACGT GCGCTTTGTC TCGCTGGTAA CTAAGTGTCC GAAGGTCAAA
CACTTCCTAT ACGTCGGCTA CGGGATGCTG CAGAAGAGGG GCAGGAAAAA CGGACAGAGA
AATAACACGT TCTATGCCGA GGTAGCGGGA GCTAGGCTAC ACCTAGTCTA CATCTCTACG
AAGAACCACG TCTACGCCCG TATCGCGGTC GAAGCTGTGC CTCAAGGCTG GGTGGAGGAG
GCGCGCGCTC AAGGCTGGGA CGTCCGGGTG GTTAACATGG GAGGCAGGGA GTACTACCAG
GTGACTCACA GTTCTCTCTT TGAACACGCG CGTAGCGACA CAGAGCTACG CGCAACACTC
CTCGCCTTCG CGAAGCACAA GGCTGAGCAG TACCCCAAGG CGCAGAGCCT CGTAGAACGC
CTCGAAAAAC TGGGGACAGA AGACTAA
 
Protein sequence
MSNDPRQPGR GRTGEEGAEA QAKTRPGGET LQSPAAKIEF FSRVSNPTLP SLGALPDREG 
FNEPRARGGP ARGEGGVARG EPRGYASPGP SERPGVGRPS RARGERLAAG RPGPGALDTR
GRRGRRGSGA PAPGWGVSPR LGRDRPRGRF VKPDTMSRKV TVAPSKRRRP SRGAGQRRAR
PWPQRARPRK QSLRSTHRSR LPPATLTPDS APSRGAPPAT WRVGDMSEAL MRGNGAVGEG
VEKTVREEVW RRAALSLYAT RTEDKRRSRG KKKKGEIHYR GLHDAVTEVD WDFTRFAAHA
LTVVPDDVYP RFYRFIDIDA RKYLLLNSDE RPREGSAVTE LKGRLQAIVD AGADGLRAEK
HGRVWHVYIP RENWYVVVSK PTNGWSVHVP LKGYWVESEF PEVLVRTPPD VLRSLQKGWV
LTDVTPPRGR FSDVSFATTQ PWQLPSTLAS FPSDDIKLGV TAGILGSTRL SIQWHARVYG
YEEALSWASR LIGEVKRAEF RRLVEECKAL NGDPVALFTG FLGDGGLEYF LRLRWLYFKV
GHELLYLPAE SAIVNARLAV ERASEYVRFV SLVTKCPKVK HFLYVGYGML QKRGRKNGQR
NNTFYAEVAG ARLHLVYIST KNHVYARIAV EAVPQGWVEE ARAQGWDVRV VNMGGREYYQ
VTHSSLFEHA RSDTELRATL LAFAKHKAEQ YPKAQSLVER LEKLGTED