Gene Tpen_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1121 
Symbol 
ID4600863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1059776 
End bp1062214 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content59% 
IMG OID639773897 
Productmolybdopterin oxidoreductase 
Protein accessionYP_920522 
Protein GI119720027 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.721348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAAC TTACAAGGAG AGATGCAATC AAGCTCGGGG CACTGCTCGC CATAGCCTCT 
TCGCTTAATC TACCAGTAAA GGTTCAGAGA CCATCCCGGG AAGTAGAAGC CTCCGCGGCT
ACAGCCAGGA TACCCGTAAT GTGCAGTATG TGCGCCGCGG GATGCGGGAT TCTCCTAGTC
AAGGAGGGAA ACGGCACGGT CTACGTTGAG CCGAACCTCG AGCACCCGCA GCCGGGGCTC
TGCGCGAGGG CGGCTTCCGC GCTCCAGCTG TGGAACCACC CCTTGAGGCT GAAGAAGCCC
TTGAAGAGGG TTGGCGAAAG GGGTGAGGGC AAGTTCCAGG AGGTCGACTG GGATACTGCT
CTCAACGAGA TAGCTACGAA GCTGAAGGAG ATAATCTCCA AGTATGGTCC CGAGTCTGTC
GTATTCACGT ACCACGACTT CTACGCCTGG CACATGCCGC TCATAGCGTT CACGCTCGGC
ACGCCTAACC ACGTTCAGCA CGCGTCTTGT TGCCACAACG CAAGCACGTA CGCGAGGATG
CTGGTTCTAG GGGCAGGTGG ACCTCCGACC GTCGACCCGG ACTACGAGCG GGCACGCTAC
GTAGTGTTCG TGGGCCGAGT TCTGTGCGCG GCGATGGGCA TGGTTCAAAG GCTTCAGAAA
GCGCGGGAGT CCGGCGTGAA GCTAGTCTTC GTGGACCCCA GGATGGGCAA CGCGGCGATG
GCAGAGGGTG AGTGGGTCCC CATACTGCCC GGTACGGACG CCGCCTTCCT GCTCTCCATG
ATCCACGTCA TACTCAGCGA GAAGCTGTAC GACGAGTCCT TTTTGAAGAA GTACACGAAC
GCGACTTTCC TCATAAAGCC CGACGGAAGC CCCCTCACAG AGAAAGACCT CGGCAGGGAG
GGCTCAGACT ACGTTGTCTA CGACGCCGAC GCGAAAGACT TCCTGAGCTA CAAGAAGTCG
AAGAACCCGG CCCTCGAGTG GGAGGGCGAC GTCGCCGGCT TCCACGTGAA GACAGCGTTC
CTGCTACTCA AGGAGAGGGC TTCGCAGTAC GCCCCTGAGC AGGCCGAGAA GATATGCGGG
GTCCCGGCGG ACACTATCAG GAGGATTGCC CGCGAGTTCG CCAACGCTAG GGGAGTGGTG
GAGGACGGCT GGTGGTCCGC GAAGAACGCG AACGACTCAG ATGCTTATAG GGCCGCGCTT
ACACTCAACG CCCTCGTCGG TAGCATCGAG ACTGCGGGGG GTCTCTACAT AAAGCTGGGG
TCTAAGATGC CCCCCTCAGC CACGGCGACT GCCGAAAAAG TAACCACGAT TACGGGCGGA
ACGCTTCCCG GGATCAGGGC GAAGAGGATA GACACGCAGA AGTACCCAGC TGTACCCCAT
GTTTTCGACG CCGTGCTTGA CGCCGCGCTC GAAGGCAAGC CGTACCCGGT TAAAGCTCTG
TTCATCGTGG GCGCCGAGCC TTTCACTAGG GATGTGAACA CCGAGAAACT GAAAAAGGCC
CTCAAAGCCA TGGAGCTCGT CGTGGTAATC GACGTTGTAC CGAACGATAG CGTTGACTAC
GCGGACTACG TGCTCCCGGA CAACATCTTC CTGGAAAGGG AGGAGCTCAC AGACGTCAAG
TTTACTCCGC ACGCGGCGAT ACAGCTGTCG CACAAAGCCC TCGACCCGCC TCCCGGCATA
GACGCGCGGA ACGGCTTCTG GATCATGATG GAAATCCTGC GCCGAACAGT CCCCGAGAGA
GCCAAGGCAG TAGGCTACAC CGAGGAGTAT TCCAGCTACG AGAAGTTCAA GGAGTTCGAA
GCGCTTGTCA AGCGGAAAGT CTTGGAGTCC CTCTCGAAGA CGTGGAACGT CCCGGTGGAA
GACATAGAGA AGTCCCTTGA GGAGAAAGGC TTCTATGTGT TCAAGCACTG GATGCCGAAG
GCGGGCCCGG GAACCCTCCC CACGCCCAGC GGCCTCGTCG AGATATACAG CCTGGCAGCG
CTAAAGTACA ACGACGACCC GCTACCAAAG TGGAAGCGCC CGCCGTACAC ACTTCCGAGC
AACCCGGACG AGTTCTACCT CGTGAGCGGA AGGGACCAGT TCGTCACCGC GCACGCTGTG
TGGACGAAGA ACATTATCCA CCTAGTAGAC AGGAGGGTCT GGATGAACCC CAACGACGCT
AAGCGCCTGG GCATAAAGGA CGGCGACCTC ATAGAGCTCG AAGGGCTGGA CAACCACTAC
AAGGCCCGCG CGAGGGTAAA GGTGACGAAC AGGGTTAGGG AAGGCGTTCT GTTCGTGTAC
AGCAGAGCGG GAGGACGCTT CTCGAGGCTT ATCACGGGCG AGTACGAAGT TATGAAGGAG
GGGATAAACC CGAACATGTT CACTCTGAGC TGGCTGGAGC CGCTCAACGG GTCGACGGGT
CTTAACTCTA CGGTTAAGGT TAGAAGGGTG GGAGCATGA
 
Protein sequence
MAELTRRDAI KLGALLAIAS SLNLPVKVQR PSREVEASAA TARIPVMCSM CAAGCGILLV 
KEGNGTVYVE PNLEHPQPGL CARAASALQL WNHPLRLKKP LKRVGERGEG KFQEVDWDTA
LNEIATKLKE IISKYGPESV VFTYHDFYAW HMPLIAFTLG TPNHVQHASC CHNASTYARM
LVLGAGGPPT VDPDYERARY VVFVGRVLCA AMGMVQRLQK ARESGVKLVF VDPRMGNAAM
AEGEWVPILP GTDAAFLLSM IHVILSEKLY DESFLKKYTN ATFLIKPDGS PLTEKDLGRE
GSDYVVYDAD AKDFLSYKKS KNPALEWEGD VAGFHVKTAF LLLKERASQY APEQAEKICG
VPADTIRRIA REFANARGVV EDGWWSAKNA NDSDAYRAAL TLNALVGSIE TAGGLYIKLG
SKMPPSATAT AEKVTTITGG TLPGIRAKRI DTQKYPAVPH VFDAVLDAAL EGKPYPVKAL
FIVGAEPFTR DVNTEKLKKA LKAMELVVVI DVVPNDSVDY ADYVLPDNIF LEREELTDVK
FTPHAAIQLS HKALDPPPGI DARNGFWIMM EILRRTVPER AKAVGYTEEY SSYEKFKEFE
ALVKRKVLES LSKTWNVPVE DIEKSLEEKG FYVFKHWMPK AGPGTLPTPS GLVEIYSLAA
LKYNDDPLPK WKRPPYTLPS NPDEFYLVSG RDQFVTAHAV WTKNIIHLVD RRVWMNPNDA
KRLGIKDGDL IELEGLDNHY KARARVKVTN RVREGVLFVY SRAGGRFSRL ITGEYEVMKE
GINPNMFTLS WLEPLNGSTG LNSTVKVRRV GA