Gene Tpen_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1866 
Symbol 
ID4600331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008696 
Strand
Start bp15432 
End bp17306 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content59% 
IMG OID639772464 
Producthypothetical protein 
Protein accessionYP_919124 
Protein GI119709784 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCATGA AAAGGTACAT ACCCCTGTTA CTGCTACTGG TAACGGTAGC CTTGGGCATC 
GCCACCGTAA AGGCGGCTCC CGTCACGACT TACACGGTCT TCGTCAACTA CACCATGTAC
ACGGGCACCG CTCCAGCCGC GGGTAGCACC TACTGGTTCA ACTTCACAAT CACTCCAAGC
CTCGACGTCG TCATCGTGAC GGTAGAGGAC TTGGGAGGCG CTAGCCAGAG GACGGTCTAC
GTGGACAACG TGGCTGTAAC CCTGCCCTAC AAGCTGAAGG CTGGAGAGAC CCACAGGGTA
GCCGTCAAGG TCTACTTCCC GAGCGCGCTG ACTGTAGCGT GGTTCGGCAA GACGATACTC
GCCTACAACA GGACTGAGGA CGTAAACCTG CAGGTAAAGT ACGTGGGCTA CGGCTTCGAC
TCCGTGTCCT ACGGGAGCTA CGTGGGTAGC GTGGACAACG TGGCAACGCT GACCGTCAAG
CTGGACACGC CTTACACCAT GGGGGTCACA CACACGTTCA GCTCTGTGTC CACGGTCGTC
TGGCTCAGGT TCCACCCGGC GATGCCGATT AAGGGCTACA CGGGCACCGC CTACTACACC
ACGACCTTCA ACGCCTACAC GATTGGAAGC CAGCCCGCGG GCACCGATAG CTACACCATC
TCCAACAACC CGGTAACCGT CACGGTTGAC TGTCCACCGC CAGTAGGGCT CTACGAGCTT
GTCTGGTACG TGAGCTGGGG AGGCTTGACG GAGAAGATGC TCGGGGCGCC GGGCACCCCC
ACCCCCAACA AGAACTACGC TAGGCTGGTC GGCGCCACGT TTACTTGGAA GTTCACCCCC
AACGCCACGT ACTTCCCCAC GACCAAGGTT GACGAGTACC TCTTGGTGAA CGGTAGCAAG
GCTTCGAGCC TCGCGCTGAG CAAGAAGGGG CTCTACAACG CCACCTTCGT CAACATCGGA
AAGATGAACA CCACGTTCTA CGGAGTGCTG GCATACCTGC CCCCCGTGGT CAGGGCGGGT
ACCATGTACG TCTACGCCGC GGAGCTTAAG GTCAACGTGC TCTCTCAGGT AGGCACGGTG
GGAGCCCCGA GCCCCGGCTT CACTCTCAGC TTAGCCGGCG CGAACGCTCC CGGCTACTTC
GCGCTTCAGG TCAAGGTGCC CGAGGGAGTC GTCACGGTCG AAAAGGCTGA GGGAGTGGCG
TGGAACAGGA CTGTGCTCCC CGGCATAGCC AAGGACGTCA AGACGAGCCT CGACGCCGAT
AAGCTGGTCT ACACCGTGAA CTACACGTAC ACGCTGTACT ACGCGCCTAT AGTGGGAGGC
GTCAACTTCC TAGGCGCGTG GGGCATAGAG GAGACTCCGA GCCTTACGCC GACGAGCCCG
ACTATAGCCA CTCTGACTGC CAGCAAGCCA GTGGTGCTCA ACGCTACTAG AAACGTGATA
AGGGTCTTGG ACTCCGCGGG CAACGACGTC AGCTTCGTCG GGAAAGCCAT AGCGATAAAG
AGTAGCGGCA CCTACACCGT CAAGCTTGAG ACCGTGATAA AGGTCGTCAA CCTCTACCAA
GGCAAGAAGA TACCCGCCAC GGTCAGGCTC TACGACGCTA AGGGCTTGCG CCTAGCGGAG
AAGACCGGGG AGGAGGTAAC GTTCACGGTT GAGCCCGGAC TGCTCTACAC GGTCGAATCG
GACAACGGTA ACGAGGTGCT GACTCAGCGC GTCACGCCTA CACAGGACGT CGATGTAACA
ATGGAGTTCA CGAAGCCTCC CGCCGTGGTA ATTCCGTGGG AGTGGGTGTG GCTAGCGCTG
GCCATAGTGT TCCTAGTGGT TCTGATATAC TTCGCTAAGA GGCTCAAGGA GGGTCTAGAG
ATAGTAGTGG GGTAA
 
Protein sequence
MGMKRYIPLL LLLVTVALGI ATVKAAPVTT YTVFVNYTMY TGTAPAAGST YWFNFTITPS 
LDVVIVTVED LGGASQRTVY VDNVAVTLPY KLKAGETHRV AVKVYFPSAL TVAWFGKTIL
AYNRTEDVNL QVKYVGYGFD SVSYGSYVGS VDNVATLTVK LDTPYTMGVT HTFSSVSTVV
WLRFHPAMPI KGYTGTAYYT TTFNAYTIGS QPAGTDSYTI SNNPVTVTVD CPPPVGLYEL
VWYVSWGGLT EKMLGAPGTP TPNKNYARLV GATFTWKFTP NATYFPTTKV DEYLLVNGSK
ASSLALSKKG LYNATFVNIG KMNTTFYGVL AYLPPVVRAG TMYVYAAELK VNVLSQVGTV
GAPSPGFTLS LAGANAPGYF ALQVKVPEGV VTVEKAEGVA WNRTVLPGIA KDVKTSLDAD
KLVYTVNYTY TLYYAPIVGG VNFLGAWGIE ETPSLTPTSP TIATLTASKP VVLNATRNVI
RVLDSAGNDV SFVGKAIAIK SSGTYTVKLE TVIKVVNLYQ GKKIPATVRL YDAKGLRLAE
KTGEEVTFTV EPGLLYTVES DNGNEVLTQR VTPTQDVDVT MEFTKPPAVV IPWEWVWLAL
AIVFLVVLIY FAKRLKEGLE IVVG