Gene Tpen_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1071 
Symbol 
ID4601649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1009107 
End bp1010720 
Gene Length1614 bp 
Protein Length537 aa 
Translation table11 
GC content60% 
IMG OID639773848 
ProductNADH-ubiquinone oxidoreductase, chain 49kDa 
Protein accessionYP_920473 
Protein GI119719978 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit
[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGGGAG GTAGGATACA GGGTAAGGTT CAGGTTCCTC CCGAAGGCAT ACGGGAAGCT 
GTTAGAAGGC TCGTCGCCGA AGGTTACGCT CACCTAACGG CGATAACGGC GGTCGACCTC
CAGGACTCCA TAGAGCTGAT ATACCACTTC GACGGGGGCG GCGGCTACAG ACACGTCTCC
GTGAAGCTAC CCAGGGCTAA CCCGACCCTT CCCTCCATCG CCCAGGTAGT CCCCGGCGCC
GACTACTACG AGAAAGAGGT ACGCGACTTG TTCGGCGTCA ACTTCCAGGA CGGCGCGGAG
AAGAGGCGCT TCATACTCCC GGAGTGCTAC CCGCCCGACG CGCCTCCCCC CTTACTAAAG
GACGTCGACC CTGCCTCGGT GAAAGCGCTA GTCGCCGAGG CGCCGACGTG CCAGGTTGAC
GCCGTGGGTT TGCACGCGCC TCTTTCCCCA GAAAGCGTGG TCGTACCCGT AGGCCCGTAC
CACCCGGCCT TCAAGGAGCC GGAGTACTTC AGCCTGGTGG TAGACGGGGA GAGGATAGTG
AAGGCCTTTG TACGCATAGG GTTCGTGCAC AGGGGTATAG AGAAGGCTGC GGAGTCTAGG
AGCTTCTTCA GGGACATCTT CCTCGTGGAG AGGATATGCG GGATATGCAG TACTTCCCAC
GCGTGGTGCT TCGTAGAGGC TGTCGAGAGG CTCTTAAACA TGGAGGTGCC CAGGAGGGCG
GCCTACCTTC GAACCCTCGT CGCGGAGCTG GAGAGAATAC ACTCCCACGC TCTCTGGCTG
GGCCTCGTGG GCTACTGGAC GGGCTTCGAG ACAATGTTCA TGTGGGTCTG GGGGCTCCGA
GAGACCATAA TGGACCTACT CGAGGAGATC TCCGGGAACA GGGTCCACAA GTCCTTCGTA
ACCATAGGCG GGGTGCGGCG CGACGTTCCC GACGCCCTGC TGAGACGCGT CTCCGAGAGG
GTTGAGACGT TCCGCAGGGA GTTCGAGTCC CTGCTTGAAA GCCTGTCGTA CTCCGAGGAG
ATAATCTCCA GGACGAAGGG GATAGGGAGG TACAGCCTGG ATGATGCCAG GAAGTACTGC
GCTGTGGGGC CTGTTAGGAG GGCTGCGGGA GACCCGTTCG ACGTGAGAAG AATAGAGCCC
TACGGGGCCT ACCCGGAGGT GGAATTCGAG GTCCCCACGT CCAACGAGGG AGACGTGTAC
AACCTCATTA GGGTCCGAGT AAAGGAGGTC GTAGAGTCGG CCAGCATAAT CCAGCAGTTG
GTCGAAAAGA TGCCTTCGGG CAACCCCGTA CCTCCTAAGC CGTTCATGGG CACGGTTCCC
GAGGGTGAGG CTTACAGCAG GATCGAGGCT CCGAGGGGCG AGCTGTTCTA CTACGTAAAG
GCGAATAACA CACACAACCC CTACAGGGTT AAGGTTAGAA CCCCGACCCT GGCGAACATC
CAGCTCGCGG CGAAGATCCT CGAGGGGCAG ACGCTGTCCG ACTTCCCAGT CGTCGTCACG
AGCATAGACC CCTGCTTCAG CTGTATGGAC AGGGTGACTA TAATCGACTT GTCCGGGAAG
AGAAGGGAAG TTTCTTCCAA GTTTTTCGAG TCGCTTAGAG GGGTGAGGAG ATGA
 
Protein sequence
MGGGRIQGKV QVPPEGIREA VRRLVAEGYA HLTAITAVDL QDSIELIYHF DGGGGYRHVS 
VKLPRANPTL PSIAQVVPGA DYYEKEVRDL FGVNFQDGAE KRRFILPECY PPDAPPPLLK
DVDPASVKAL VAEAPTCQVD AVGLHAPLSP ESVVVPVGPY HPAFKEPEYF SLVVDGERIV
KAFVRIGFVH RGIEKAAESR SFFRDIFLVE RICGICSTSH AWCFVEAVER LLNMEVPRRA
AYLRTLVAEL ERIHSHALWL GLVGYWTGFE TMFMWVWGLR ETIMDLLEEI SGNRVHKSFV
TIGGVRRDVP DALLRRVSER VETFRREFES LLESLSYSEE IISRTKGIGR YSLDDARKYC
AVGPVRRAAG DPFDVRRIEP YGAYPEVEFE VPTSNEGDVY NLIRVRVKEV VESASIIQQL
VEKMPSGNPV PPKPFMGTVP EGEAYSRIEA PRGELFYYVK ANNTHNPYRV KVRTPTLANI
QLAAKILEGQ TLSDFPVVVT SIDPCFSCMD RVTIIDLSGK RREVSSKFFE SLRGVRR