Gene Tpen_0183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0183 
Symbol 
ID4600885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp157287 
End bp158990 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content54% 
IMG OID639772937 
ProductNADH dehydrogenase (ubiquinone), 30 kDa subunit 
Protein accessionYP_919596 
Protein GI119719101 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAATG GGATGGAGAG ACTAGAGGAA AAGATCAGAA AGGAAATCGG AGGGCTGTTA 
CTAGGAGTTG AGAAACCTTA TCCCGATCAG CTATACCTAT ACGTCGAGAG GACGAAGCTC
CCAGAAGTCT GCTCGTACGT TTACTACGAG CTGGGAGGCT TCCTCTCTAC CATGGCCGGA
AGTGACGAGA GGAGCCTGCA CGGAGCGTAT AGACTCTACT ACGTTTTCTC CATAGAGGAG
GGTTACGAAG ATGGCAAGAA ACCGTGGGTT GTGGTAGTAA CGAATATACC GCCCCACGAT
ACGAAGTTCT CCTCAGTAAC GCCCAAGATT CCCGCGGCTT CCTGGTACGA GAGGGAGGTG
AGAGACCTCT TAGGGCTAGT TCCCGAGAAC CATCCGGATC CGCGCAGACT GGTATTGCCC
GACGACTGGC CCGAGGGTGT TCATCCTCTT AGAAAGGAGT TCACGTACAC GGAGAGGCCT
CCTAGCGTAG CTAGAAAGTG CGAGTTTAGA CCGAGCGTAG AAGGAGAAGG CATAATGCAA
GTGCCTATAG GCCCCGTACA CGCCGTTGCC GACGAGCCCG GGCAGTTCAG AGTGTTCCTT
GATGGAGAGA AGGTAGTCGA CGTCGACTAC AGGATGTTCT ACGTCCACCG CGGCATTGAA
AAACTCGCGG AGAGCAGGCT GACCTACAAC CAAGTACCGT TCATCGCGGA GAGAATATGT
GGCATATGCG GGTATGCTCA TTCGTGCGCG TACTGCCAGG CAGTCGAGCA GGCGCTAGGC
ATAGAGGTTC CAGAGAGGGC ATTGTACATA AGGACGCTCA TGCTGGAGGT GGAGAGGCTG
CACAGCCACC TGCTGAACTT AGGCCTAGCA TGCCACCTCG CGGGCTTCGA CTGGGGCTTC
ATGGCCTTCT TCAAGGCTAG AGAGAAAGTA ATGTACATGG CAGAGCTCCT CACGGGCGGC
AGGAAGACTT ACGGTATGAA CGTGGTCGGG GGCGTGAGAA GGGACATAAC CGAGGACAGA
GCGAAGAAGG CCCTCGAAAT CCTGAAGGAA GTGGAAAAGG AGTATAAGGC TGTCCTAGAC
GCTGTTCTCA GCACGTCTAC GCTTGTAAGC CGCGCAAAGG ACGTTGGAGT ACTGCCTAGG
GACGTCGCAA GAAAGGTCAG CGTGGTTGGA CCGGTTGCCC GCGGCTCTTC GATAAAGAGG
GACACGAGGA AGGATCACCC CTACGCCGCG TACAGCGAGG TGGACTTTAA GGTCCCAGTG
CACAGCGAGG GAGACGTCCT CGCGAGACTC AGCGTAAGAG CCGAGGAAAC TTTCGAAACC
ATAAGTATCA TTAGGCAGGT GCTTGAGAAC ATGCCGGGAG GACCTATACA AGCCGAGGTG
AAGGAGTACA GACCTTACGC CAGGGGACTG GGCTACGTTG AAGCTCCGAG AGGTGAAGAC
GTGCACTTTG TCATAACCGG CCCGCACAGC AAGGTTTACA GGTGGAGGGT GAGAGCCTCA
ACGTACAATA ACTGGCCGGC AATACCCTAC ATGTGCCGAG GCTATACGCT GGCAGATCTC
CCGTTGATAA TTGGGAGCAT TGACCCCTGC TATAGTTGTA CGGAGAGAGT GATCGTTGTG
GACGTAAAGA GTGGGAGGAC CCGCGTTTTT CCGTACGAGT ATTTGGTTTC TCTCTCTCGT
AAAGGTGGTA AGCCATGGGC ATAA
 
Protein sequence
MNNGMERLEE KIRKEIGGLL LGVEKPYPDQ LYLYVERTKL PEVCSYVYYE LGGFLSTMAG 
SDERSLHGAY RLYYVFSIEE GYEDGKKPWV VVVTNIPPHD TKFSSVTPKI PAASWYEREV
RDLLGLVPEN HPDPRRLVLP DDWPEGVHPL RKEFTYTERP PSVARKCEFR PSVEGEGIMQ
VPIGPVHAVA DEPGQFRVFL DGEKVVDVDY RMFYVHRGIE KLAESRLTYN QVPFIAERIC
GICGYAHSCA YCQAVEQALG IEVPERALYI RTLMLEVERL HSHLLNLGLA CHLAGFDWGF
MAFFKAREKV MYMAELLTGG RKTYGMNVVG GVRRDITEDR AKKALEILKE VEKEYKAVLD
AVLSTSTLVS RAKDVGVLPR DVARKVSVVG PVARGSSIKR DTRKDHPYAA YSEVDFKVPV
HSEGDVLARL SVRAEETFET ISIIRQVLEN MPGGPIQAEV KEYRPYARGL GYVEAPRGED
VHFVITGPHS KVYRWRVRAS TYNNWPAIPY MCRGYTLADL PLIIGSIDPC YSCTERVIVV
DVKSGRTRVF PYEYLVSLSR KGGKPWA