Gene Tpen_0873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0873 
Symbol 
ID4600355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp822959 
End bp824431 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content57% 
IMG OID639773651 
Producthypothetical protein 
Protein accessionYP_920277 
Protein GI119719782 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGTA GAGGCATGAC GGGGGGTGAA GAGGGCGAGG AGGTTGTTAG GAAGCTTGTC 
TACGGGAGGG CGGCTCAAAG CTTGTATGCC ACCAGGACAG AGGATAAGAG GAGGTCTAGG
GGCGGGAAGA AGGGCAGAGT ACATTACCGC GGCCTGCATG ATGCTGTGAC CGAAATTAAC
TGGGACTTTA CGCGCTTCGT GGCGCACGCG CTGAGTGTCG TGCCGGACGA GGTTTACCCC
AGGTTTGGCA GGCTGTTTGA CTACGATTTG AGGCAGTACC TACTACTGGG AGACAGCGAG
AGACCGAGAG AGGGAAGCGC GGTCGTGGAG CTCAAAGGAA AGCTACAAGC AATCGTCGAT
GCAACAGAAG ACGGCCTTAA AGTTGAGAGG CACGGAAAGA TTTGGCGCGT CTACCCGCCC
AGGGAGAACT GGCATGTGAC AGTCTCAAGG CCCAATGCAC ATAGCTGGAC TGTCCACGTC
CCATTGGAGG GTTACTGGGT TGAGTCAGAG TTCCCGGAGG TACTCGCGAG GACTTCGCCC
GACATCCTAA GAAGCCTGCA GAAAGGGTGG CTTCTTACGG ACGTAATGCC GCCACGTGAA
AGTTACCTTT ACGTTTCCTT CAGCACTACC CAGGCGTGGC AACTTCCAGC CACGCTCGCC
GACTTTCCAA GCGACGATGT CCACCTCGGC ATTAGAGCAG GAGTTCTCGG CTCCACCAAG
ATGAGCATTA TGTGGGGAGT GAGCATCTAT GGCTATGCGG AGGAGCTTTC CTGGGCTTCA
AGGCTTGTCG GCGAGGTTAA GCGCGCTGAG TACCGCAGTT TGGTCGAGGA GTGCAAGGCG
CTTAATGGTG ACTCGGTGGC GCTGATAACC GCCTTCCTGG GGGACGGCAT CCTTGCGTTC
TTCCTAAGGC TTCGATGGCT CTACTTCAGG GTTGGCAACG AGATAGTTTA CTTACCTACG
GAGAGCGCTG TCTTTAACGC TCGTGTTGCT GTCGAACGGG CCAGCGAATA CGTAGCATTT
GTGGGCAAAG TTACTAGGTG CGCAAAGGTT AGGCACGTCC TATTCGTTGC CTACGGAGCA
CCGGGTAAGA GGGGCAGGAA GCCCGGGCAG AAGGTGGCTT GCCAGTGGCT CGGCCTCTAT
GCGCCGGTCG CCGGCGCGTT GCTTAACCTC GTTATGGTTA CTGTCGGCGA CGAGTATGCC
TACATCTACG CCCGTATACC CGTTGATAGC GCGCCTCCAG GCTGGTACCA GCGCGCGCTG
GAGGAAGGCT GGGACGTCCG GGTGGTTCGC ATGAGTGACA GGGAGTACTA CCAGATCCCC
CAAGATTCGT TATTTGAGCA CGCGGGCGAG AACCCGGAGC TCTGGGAGGC GCTATACCGC
TTCGCAGTTG CAAAGGCGCA GGCTAAGCCG GCCGCGAGGA AGCTAGTCGA AGAACTGCTA
AGATTTAAAC CAGCTGGGGC AAAGCAAGAA TAA
 
Protein sequence
MSGRGMTGGE EGEEVVRKLV YGRAAQSLYA TRTEDKRRSR GGKKGRVHYR GLHDAVTEIN 
WDFTRFVAHA LSVVPDEVYP RFGRLFDYDL RQYLLLGDSE RPREGSAVVE LKGKLQAIVD
ATEDGLKVER HGKIWRVYPP RENWHVTVSR PNAHSWTVHV PLEGYWVESE FPEVLARTSP
DILRSLQKGW LLTDVMPPRE SYLYVSFSTT QAWQLPATLA DFPSDDVHLG IRAGVLGSTK
MSIMWGVSIY GYAEELSWAS RLVGEVKRAE YRSLVEECKA LNGDSVALIT AFLGDGILAF
FLRLRWLYFR VGNEIVYLPT ESAVFNARVA VERASEYVAF VGKVTRCAKV RHVLFVAYGA
PGKRGRKPGQ KVACQWLGLY APVAGALLNL VMVTVGDEYA YIYARIPVDS APPGWYQRAL
EEGWDVRVVR MSDREYYQIP QDSLFEHAGE NPELWEALYR FAVAKAQAKP AARKLVEELL
RFKPAGAKQE