Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0781 |
Symbol | |
ID | 4601842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 731014 |
End bp | 732885 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773557 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_920186 |
Protein GI | 119719691 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGCG TAATCGAAGG GTCCCCGGGA GAGGTGAAGC TTCTCCTGGG TAACGAGGCT ATAGCGAGGG GTGCCCTGGA GGCCGGCATC TCGGTTGCAA CGGCTTACCC CGGGACGCCG TCCACAGAGA TCGTGGAGAC ACTGGCAGAG GTCGGCGAGA GGTACGGCGT CTACGTTGAG TGGAGCACGA ACGAGAAGGT AGCACTAGAG ATAGCCATCG GCGCGTCCAT GATGGGCCTT AGAGCGTTAA CAGCGATGAA GCACGTCGGC GTGAACGTGG CCTCGGACCC CTTGATGAGC TTAGGGTACA CCGGAGTCGT TGGGGGGCTC GTCATAGTCA CGGCGGATGA TCCGAACGCA CACAGTAGCC AGAACGAGCA GGACAACAGG ATCTACGGGC TTCACTCGTA TATCCCCGTT TTCGAGCCTT CATCGCCCCA GGAGGCTAAG GACATGGTGA GAGATCTCTA CGATCTCTCG GAGAAGTACT CCACCGCCGT CTTTCTGAGA ACAACTACGA GGCTGTCCCA CAGCAGGGGT GAAGTGACTC TGGGGGAGTT GCGGGGCGCG GGTAGGGAGC CCCGTTTCCA CAGGGACCCA GAGCGGTGGG CGTTGCTGCC GCCCTACAAT CTTGTGAAGC ACCGCGAGGC CGTTAACAGG ATCAAGAGGC TAGAGGAAGA CCTCTCCAGC TTTAAGTACA ACTGGGTCGA GCCGGGCGAC AGCATGGTCG CCGTAGTGGC AGTAGGGGCT ACCTACGCGT ACGTCAAGGA GGCTGTCTCC AAGCTCGGGG TGAAACCGAC CATTTTCAAG CTTTCATCCA CGTACCCGGT CCCGAGGGGG TTCGCCGTTA AGGCTCTCTC CTACGAGAGG TTGCTGGTGG TAGAGGAGCT GGAGCCGTTC GTCGAGAAGG AGCTGAAGGT GATAGCCTTC GAGGAGGGCA TGAAGCCCGA AATACATGGC AAGGATCTCC TGCCGAGAGT AGGGGAGCTC TCCACGGCGC TTGTTGCGCA AGCCATTGCG AAGTTCCTCG GAGTTCCCTA CGAGCCGCCG AGGACGTACA CACCCGGCGT GGAGTTGCCC AGGAGGCCCC CCGTTCTATG CGCAGGTTGC GGCCACAGGT CGACGTACTA CGCGGTGAAG CTCGCGGCTG CGAGGGCTAG GGTGAAGCCG GTCTACGCGA ACGACATAGG TTGCTACACG CTAGGCTTCT ACCCGCCCTT CGAGATGGCG GACTTCACGT GGAGCATGGG ATCGGCGCTC GGGATAGGGA TGGGTATATC CAAGTTCAGC AAGGAACCCG TCATCGCTTT CATAGGCGAC TCTACGTTCT ACCACGCGGG CATACCCGGG CTCATAAACG CGGTTTACAA CAGGATACCA CTGGTGGTCG TCGTCATGGA TAACGGGATA ACAGCCATGA CCGGGCATCA GCCCCACCCT GGTAGCGGGT TCGGCCCCGC CGGGGAGCCG AGGCCCGTCG TGAAGATAGA GGACATAGCC AAGGCTGTGG GCGTCGAGTT CGTAGAGGTG GTGGATGCCT ACGACGTGCC GGCGGTCAGG GATGCGGTAG AGAGGGCTAT CAGGTACGTG GTAGAGAAGA GTAGGCCGGC TGTCGTGGTC TCCAGGAGGC CTTGTGCCCT GATGGAGCTT AGGAGGAAGC GGCTGAGCGG GGAGAGGGTC GTGCCCTACT ACGTAGACCA GGAGAGGTGC GTGAGGTGCG GTATATGCGT GGACAAGTTC TCGTGCCCGG CTATTGTCCG CGAGGAGGAC GGCAGGGTAG TTATCCTCCC GGAGGTCTGT GTCGGGTGCG GTGTGTGCGC AACGATATGC CCCGCGAAGG CTATACACCC TGTAGGCGGG TCTTCGGGGT GA
|
Protein sequence | MMSVIEGSPG EVKLLLGNEA IARGALEAGI SVATAYPGTP STEIVETLAE VGERYGVYVE WSTNEKVALE IAIGASMMGL RALTAMKHVG VNVASDPLMS LGYTGVVGGL VIVTADDPNA HSSQNEQDNR IYGLHSYIPV FEPSSPQEAK DMVRDLYDLS EKYSTAVFLR TTTRLSHSRG EVTLGELRGA GREPRFHRDP ERWALLPPYN LVKHREAVNR IKRLEEDLSS FKYNWVEPGD SMVAVVAVGA TYAYVKEAVS KLGVKPTIFK LSSTYPVPRG FAVKALSYER LLVVEELEPF VEKELKVIAF EEGMKPEIHG KDLLPRVGEL STALVAQAIA KFLGVPYEPP RTYTPGVELP RRPPVLCAGC GHRSTYYAVK LAAARARVKP VYANDIGCYT LGFYPPFEMA DFTWSMGSAL GIGMGISKFS KEPVIAFIGD STFYHAGIPG LINAVYNRIP LVVVVMDNGI TAMTGHQPHP GSGFGPAGEP RPVVKIEDIA KAVGVEFVEV VDAYDVPAVR DAVERAIRYV VEKSRPAVVV SRRPCALMEL RRKRLSGERV VPYYVDQERC VRCGICVDKF SCPAIVREED GRVVILPEVC VGCGVCATIC PAKAIHPVGG SSG
|
| |