Gene Tpen_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0347 
Symbol 
ID4601457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp318487 
End bp320415 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content54% 
IMG OID639773107 
Producthypothetical protein 
Protein accessionYP_919759 
Protein GI119719264 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.379451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAGGTA TGAAGAAAAT ACTCGGCGTA GTGGTGTTAG CACTTGTCTT GCCAGTAATA 
CTTGGGGCTG TCCTCGGAGA ACCATCCTCC ACGCCCCAGT TTTACGCACG TATAGACATC
ACGCTTAGTA CCGAGGAATT CTTGACGCTC GACTTTGGGA GTAAGAGTAT CATCAGCATA
GCGGGGGCAG CAGAGCCTCC GGGCTTCAGT GTAGACAGGG TAGTCGTGGT CTTCCAGGGC
GGAGCGCCGA GCGGGCTTAT TCCCGTCAAG TACGACAGTA TAAGCGAGTC AATGGGGAGA
ATAACCTCTA TCTCTTCCCG TAGCGGAGAG GTAGTAGTAG CCTCCAATGG GTTTAACGGG
ACAGTCCCCG TGAGGGTCAT AACGTACTTT AGGAAAACTT CCTGGAAACC GATTAAGGGC
AACAACATAA CCGTGGACAC GTCAGAGTTT ACGGGCCTCA ATCTACCCAG CGTGCGCCTC
AAGGTAACAC TGGACAACTA CGCCCCCTAC AGCGTAGCGG GAGTACTAGG ACCTTCCGGG
GAAAACCTGT TAGACGTAGA CTTGCAGGAG AAGCTGGGAC CCAGCGTTAT AAAGTTCGAC
CCGAAGCATG TAGAGGTAGA CGTATCGAAG GTAGGCTTCG GCGTTTACAC CGTTAAGCTT
GCCCAAGGAG AGGAGAATAA GCTTCCCAAC GCGATGCTCG TAGTGGAGGA CACCTACATC
GAAACGAGCG TTCCAGCTAA GTCCTCCAAA GTGTTTAACC TTAGAGGGCG TACAGGCTGG
AACCCGCTAG GCTTCATAGT CGTAGTGTAC TCCGTAGCCC CCGGACCCCT CTCTGCAAAC
GTTCGAGTAG AGTCTGAAAT GACGAACTAT GTGTTTAGCA GGGCGGAAGA ATTCGATATA
AGAGGGGCGT CCCTGCTCAT ACCTCCTCTG CTTATGCACT ACTGGATAAA GGGATACATA
GCGTTCGGCC AGGCGGTCAA GGTGGTTAAC AACGAGAACC GCGACATCCA GGTGCTGCTC
GTACCAGTGT ACTACAAGGA GGTAGGAACG TGGACGCCTA GAGGATTAAT AGCCACAATC
TCGAAGGCCG ATATAGGCAA TGCGTACTCC GCGTTCCTGG TCGTGCAGGT ACCGTCTATA
GCCAGGATAA CCTCCATCGA AACTCCAAGC GGGCAGGTAT TGCAAGGCAA GGAGAACTAC
ACTGGGGCGT GGCTCGGCAC CTGGAGAACC GCCGTGATCG AGCCCGGCGA AGCCGCCGTC
ATGGTCAAGA ACGGCGACGC CGTTGAAGAC GGGACATACA AGGTCAACAT TGAGTGGAGG
CCGCTCAGAG TGAAGTTCGT GGACTCCAAG GGGTACCCAA TCTCCGGAGT AGAAGCCACG
CTTAAAGGCT CCGTAAGCGC CTCTGCAGTC AGCGGAGCGG ACGGTGTCGC CACCCTGAAT
GTTTATGCGC CGGGAGTCTA CACTCTGACC GGCGTATACA AGGGGTCAAA TATAGCGTCC
ATGGTACTCG GTACGCTCAT AGACACTGAC TTGGAGATAA AATGCCCCGT GTACAACCTG
AACGTCAAGG TCGTGAATGC TCTCGGAGCG CCGATCACTG GCGCAACGGT AACGGTTTCG
AACAACGGTG GCTTTACGCA GTCCATGGAG ACGGACGCGA GCGGGAAGGC ACTCTTCCAG
CAACTCCCCG GGGCACAGTA TACAATCGAA GTGAACTACA AGAGGATATC CACTAAGTCT
ACGCTCACCC TGACCCAAGA CCAAGATATA ACGATAAACA CGGGAGTACT CTTCGAAATC
CCGTTACTTG GCCCCATAAC AGTGATGGAG ACGCTGACTC TCGGAGCCGC AGCACTGCTA
ACCTCCGCCC TATACTTCGG TGTCAGGAAG GGCAGAGAGG AGGAAGTGGC AGAGATAGAG
ATAGACTAA
 
Protein sequence
MRGMKKILGV VVLALVLPVI LGAVLGEPSS TPQFYARIDI TLSTEEFLTL DFGSKSIISI 
AGAAEPPGFS VDRVVVVFQG GAPSGLIPVK YDSISESMGR ITSISSRSGE VVVASNGFNG
TVPVRVITYF RKTSWKPIKG NNITVDTSEF TGLNLPSVRL KVTLDNYAPY SVAGVLGPSG
ENLLDVDLQE KLGPSVIKFD PKHVEVDVSK VGFGVYTVKL AQGEENKLPN AMLVVEDTYI
ETSVPAKSSK VFNLRGRTGW NPLGFIVVVY SVAPGPLSAN VRVESEMTNY VFSRAEEFDI
RGASLLIPPL LMHYWIKGYI AFGQAVKVVN NENRDIQVLL VPVYYKEVGT WTPRGLIATI
SKADIGNAYS AFLVVQVPSI ARITSIETPS GQVLQGKENY TGAWLGTWRT AVIEPGEAAV
MVKNGDAVED GTYKVNIEWR PLRVKFVDSK GYPISGVEAT LKGSVSASAV SGADGVATLN
VYAPGVYTLT GVYKGSNIAS MVLGTLIDTD LEIKCPVYNL NVKVVNALGA PITGATVTVS
NNGGFTQSME TDASGKALFQ QLPGAQYTIE VNYKRISTKS TLTLTQDQDI TINTGVLFEI
PLLGPITVME TLTLGAAALL TSALYFGVRK GREEEVAEIE ID