Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1071 |
Symbol | |
ID | 4601649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1009107 |
End bp | 1010720 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 639773848 |
Product | NADH-ubiquinone oxidoreductase, chain 49kDa |
Protein accession | YP_920473 |
Protein GI | 119719978 |
COG category | [C] Energy production and conversion |
COG ID | [COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit [COG3261] Ni,Fe-hydrogenase III large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGGAG GTAGGATACA GGGTAAGGTT CAGGTTCCTC CCGAAGGCAT ACGGGAAGCT GTTAGAAGGC TCGTCGCCGA AGGTTACGCT CACCTAACGG CGATAACGGC GGTCGACCTC CAGGACTCCA TAGAGCTGAT ATACCACTTC GACGGGGGCG GCGGCTACAG ACACGTCTCC GTGAAGCTAC CCAGGGCTAA CCCGACCCTT CCCTCCATCG CCCAGGTAGT CCCCGGCGCC GACTACTACG AGAAAGAGGT ACGCGACTTG TTCGGCGTCA ACTTCCAGGA CGGCGCGGAG AAGAGGCGCT TCATACTCCC GGAGTGCTAC CCGCCCGACG CGCCTCCCCC CTTACTAAAG GACGTCGACC CTGCCTCGGT GAAAGCGCTA GTCGCCGAGG CGCCGACGTG CCAGGTTGAC GCCGTGGGTT TGCACGCGCC TCTTTCCCCA GAAAGCGTGG TCGTACCCGT AGGCCCGTAC CACCCGGCCT TCAAGGAGCC GGAGTACTTC AGCCTGGTGG TAGACGGGGA GAGGATAGTG AAGGCCTTTG TACGCATAGG GTTCGTGCAC AGGGGTATAG AGAAGGCTGC GGAGTCTAGG AGCTTCTTCA GGGACATCTT CCTCGTGGAG AGGATATGCG GGATATGCAG TACTTCCCAC GCGTGGTGCT TCGTAGAGGC TGTCGAGAGG CTCTTAAACA TGGAGGTGCC CAGGAGGGCG GCCTACCTTC GAACCCTCGT CGCGGAGCTG GAGAGAATAC ACTCCCACGC TCTCTGGCTG GGCCTCGTGG GCTACTGGAC GGGCTTCGAG ACAATGTTCA TGTGGGTCTG GGGGCTCCGA GAGACCATAA TGGACCTACT CGAGGAGATC TCCGGGAACA GGGTCCACAA GTCCTTCGTA ACCATAGGCG GGGTGCGGCG CGACGTTCCC GACGCCCTGC TGAGACGCGT CTCCGAGAGG GTTGAGACGT TCCGCAGGGA GTTCGAGTCC CTGCTTGAAA GCCTGTCGTA CTCCGAGGAG ATAATCTCCA GGACGAAGGG GATAGGGAGG TACAGCCTGG ATGATGCCAG GAAGTACTGC GCTGTGGGGC CTGTTAGGAG GGCTGCGGGA GACCCGTTCG ACGTGAGAAG AATAGAGCCC TACGGGGCCT ACCCGGAGGT GGAATTCGAG GTCCCCACGT CCAACGAGGG AGACGTGTAC AACCTCATTA GGGTCCGAGT AAAGGAGGTC GTAGAGTCGG CCAGCATAAT CCAGCAGTTG GTCGAAAAGA TGCCTTCGGG CAACCCCGTA CCTCCTAAGC CGTTCATGGG CACGGTTCCC GAGGGTGAGG CTTACAGCAG GATCGAGGCT CCGAGGGGCG AGCTGTTCTA CTACGTAAAG GCGAATAACA CACACAACCC CTACAGGGTT AAGGTTAGAA CCCCGACCCT GGCGAACATC CAGCTCGCGG CGAAGATCCT CGAGGGGCAG ACGCTGTCCG ACTTCCCAGT CGTCGTCACG AGCATAGACC CCTGCTTCAG CTGTATGGAC AGGGTGACTA TAATCGACTT GTCCGGGAAG AGAAGGGAAG TTTCTTCCAA GTTTTTCGAG TCGCTTAGAG GGGTGAGGAG ATGA
|
Protein sequence | MGGGRIQGKV QVPPEGIREA VRRLVAEGYA HLTAITAVDL QDSIELIYHF DGGGGYRHVS VKLPRANPTL PSIAQVVPGA DYYEKEVRDL FGVNFQDGAE KRRFILPECY PPDAPPPLLK DVDPASVKAL VAEAPTCQVD AVGLHAPLSP ESVVVPVGPY HPAFKEPEYF SLVVDGERIV KAFVRIGFVH RGIEKAAESR SFFRDIFLVE RICGICSTSH AWCFVEAVER LLNMEVPRRA AYLRTLVAEL ERIHSHALWL GLVGYWTGFE TMFMWVWGLR ETIMDLLEEI SGNRVHKSFV TIGGVRRDVP DALLRRVSER VETFRREFES LLESLSYSEE IISRTKGIGR YSLDDARKYC AVGPVRRAAG DPFDVRRIEP YGAYPEVEFE VPTSNEGDVY NLIRVRVKEV VESASIIQQL VEKMPSGNPV PPKPFMGTVP EGEAYSRIEA PRGELFYYVK ANNTHNPYRV KVRTPTLANI QLAAKILEGQ TLSDFPVVVT SIDPCFSCMD RVTIIDLSGK RREVSSKFFE SLRGVRR
|
| |