Gene Tpen_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0781 
Symbol 
ID4601842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp731014 
End bp732885 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content60% 
IMG OID639773557 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_920186 
Protein GI119719691 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCG TAATCGAAGG GTCCCCGGGA GAGGTGAAGC TTCTCCTGGG TAACGAGGCT 
ATAGCGAGGG GTGCCCTGGA GGCCGGCATC TCGGTTGCAA CGGCTTACCC CGGGACGCCG
TCCACAGAGA TCGTGGAGAC ACTGGCAGAG GTCGGCGAGA GGTACGGCGT CTACGTTGAG
TGGAGCACGA ACGAGAAGGT AGCACTAGAG ATAGCCATCG GCGCGTCCAT GATGGGCCTT
AGAGCGTTAA CAGCGATGAA GCACGTCGGC GTGAACGTGG CCTCGGACCC CTTGATGAGC
TTAGGGTACA CCGGAGTCGT TGGGGGGCTC GTCATAGTCA CGGCGGATGA TCCGAACGCA
CACAGTAGCC AGAACGAGCA GGACAACAGG ATCTACGGGC TTCACTCGTA TATCCCCGTT
TTCGAGCCTT CATCGCCCCA GGAGGCTAAG GACATGGTGA GAGATCTCTA CGATCTCTCG
GAGAAGTACT CCACCGCCGT CTTTCTGAGA ACAACTACGA GGCTGTCCCA CAGCAGGGGT
GAAGTGACTC TGGGGGAGTT GCGGGGCGCG GGTAGGGAGC CCCGTTTCCA CAGGGACCCA
GAGCGGTGGG CGTTGCTGCC GCCCTACAAT CTTGTGAAGC ACCGCGAGGC CGTTAACAGG
ATCAAGAGGC TAGAGGAAGA CCTCTCCAGC TTTAAGTACA ACTGGGTCGA GCCGGGCGAC
AGCATGGTCG CCGTAGTGGC AGTAGGGGCT ACCTACGCGT ACGTCAAGGA GGCTGTCTCC
AAGCTCGGGG TGAAACCGAC CATTTTCAAG CTTTCATCCA CGTACCCGGT CCCGAGGGGG
TTCGCCGTTA AGGCTCTCTC CTACGAGAGG TTGCTGGTGG TAGAGGAGCT GGAGCCGTTC
GTCGAGAAGG AGCTGAAGGT GATAGCCTTC GAGGAGGGCA TGAAGCCCGA AATACATGGC
AAGGATCTCC TGCCGAGAGT AGGGGAGCTC TCCACGGCGC TTGTTGCGCA AGCCATTGCG
AAGTTCCTCG GAGTTCCCTA CGAGCCGCCG AGGACGTACA CACCCGGCGT GGAGTTGCCC
AGGAGGCCCC CCGTTCTATG CGCAGGTTGC GGCCACAGGT CGACGTACTA CGCGGTGAAG
CTCGCGGCTG CGAGGGCTAG GGTGAAGCCG GTCTACGCGA ACGACATAGG TTGCTACACG
CTAGGCTTCT ACCCGCCCTT CGAGATGGCG GACTTCACGT GGAGCATGGG ATCGGCGCTC
GGGATAGGGA TGGGTATATC CAAGTTCAGC AAGGAACCCG TCATCGCTTT CATAGGCGAC
TCTACGTTCT ACCACGCGGG CATACCCGGG CTCATAAACG CGGTTTACAA CAGGATACCA
CTGGTGGTCG TCGTCATGGA TAACGGGATA ACAGCCATGA CCGGGCATCA GCCCCACCCT
GGTAGCGGGT TCGGCCCCGC CGGGGAGCCG AGGCCCGTCG TGAAGATAGA GGACATAGCC
AAGGCTGTGG GCGTCGAGTT CGTAGAGGTG GTGGATGCCT ACGACGTGCC GGCGGTCAGG
GATGCGGTAG AGAGGGCTAT CAGGTACGTG GTAGAGAAGA GTAGGCCGGC TGTCGTGGTC
TCCAGGAGGC CTTGTGCCCT GATGGAGCTT AGGAGGAAGC GGCTGAGCGG GGAGAGGGTC
GTGCCCTACT ACGTAGACCA GGAGAGGTGC GTGAGGTGCG GTATATGCGT GGACAAGTTC
TCGTGCCCGG CTATTGTCCG CGAGGAGGAC GGCAGGGTAG TTATCCTCCC GGAGGTCTGT
GTCGGGTGCG GTGTGTGCGC AACGATATGC CCCGCGAAGG CTATACACCC TGTAGGCGGG
TCTTCGGGGT GA
 
Protein sequence
MMSVIEGSPG EVKLLLGNEA IARGALEAGI SVATAYPGTP STEIVETLAE VGERYGVYVE 
WSTNEKVALE IAIGASMMGL RALTAMKHVG VNVASDPLMS LGYTGVVGGL VIVTADDPNA
HSSQNEQDNR IYGLHSYIPV FEPSSPQEAK DMVRDLYDLS EKYSTAVFLR TTTRLSHSRG
EVTLGELRGA GREPRFHRDP ERWALLPPYN LVKHREAVNR IKRLEEDLSS FKYNWVEPGD
SMVAVVAVGA TYAYVKEAVS KLGVKPTIFK LSSTYPVPRG FAVKALSYER LLVVEELEPF
VEKELKVIAF EEGMKPEIHG KDLLPRVGEL STALVAQAIA KFLGVPYEPP RTYTPGVELP
RRPPVLCAGC GHRSTYYAVK LAAARARVKP VYANDIGCYT LGFYPPFEMA DFTWSMGSAL
GIGMGISKFS KEPVIAFIGD STFYHAGIPG LINAVYNRIP LVVVVMDNGI TAMTGHQPHP
GSGFGPAGEP RPVVKIEDIA KAVGVEFVEV VDAYDVPAVR DAVERAIRYV VEKSRPAVVV
SRRPCALMEL RRKRLSGERV VPYYVDQERC VRCGICVDKF SCPAIVREED GRVVILPEVC
VGCGVCATIC PAKAIHPVGG SSG