Gene Tpen_1086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1086 
Symbol 
ID4600934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1021055 
End bp1022557 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content62% 
IMG OID639773863 
Productproton-translocating NADH-quinone oxidoreductase, chain M 
Protein accessionYP_920488 
Protein GI119719993 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGTAC CGTACCTCTG GTTGGCGTTG CTGGTACCCT TAGCGGCGTC ACTGGCCTCG 
CTGGCGTTAA AGAGCAGGAA AGCTCTAGCC GCGCTGAACT CCTCTGCCCT AGCGTTCTCG
GCCGCCGTCC TCCTCCTACT CTACCTCACC AACGGCTCGA ACAGGTGGTT CGACCCCCTC
TCGTTCAAGC TGGGAAGCCT GGGGACTTTC TCGCTGGTTA TGGACCCCAT GGTATTCCTG
GTAGCGTTCA GCGTGGCGGT TACAACCTCT GTCATCGCGC TCTACAGCTC CCCCTACATG
GAGCATAGAT TCGAGGAGCT TGAACGCGAA GGGGTCTCGG CGCCCGGGTG GGGGGCCTAC
TACTTCCTCT ACACGCTCTT TGCCCTCTCG ATGATGGGCA CCGTCATGTC CACGAACATA
ATTGAGTTCT ACGTGTTCCT CGAGCTAACG CTCATCCCCA GCTTCCTGCT GATAGCCTTC
TACGGCTACG GCGAGAGGCT GAAGATAGCC ATAATGTACC TCATCTGGAC CCACGTGGGC
GCCCTCCTCT TCCTCATAGG TGCGCTGACC GTGGGCTCCA AGGTGGGCTT CGACTTCGTA
GATCCCGAGA AGGGCTTCCT CCTGGGGCTG GGCGAAGGCG TGGGCGTGCT CGCCTTCTGG
CTTATGGTGG TAGGGCTCTC GGTGAAGCTA GCGGCGGCGG GCCTGCACAT GTGGCTCCCC
TACGCCCACG CTGAGGCCCC CACGCCCATC TCGGCGCTCC TAAGCCCGAA CCTCATCGGG
CTGGGAGGAG CGATGATGTT CCGCGTCGTC TACGTGCTTT TCCCGAAAAC CTTCGCCGCC
GCCTCGCCGG TACTGATGGC GTGGGCCCTC GTAACGATGA TCTACGGCGG GCTTATGGCG
CTGAGCCAGT CCGATTTCAA GAGGCTCCTC GCGTACAGTA GCATCAGCCA GATGGGCTAC
CTCCTGCTGG GGCTCGCCTC GGTCGACGTG TACGGCGTCG CCGGGACGTT CCTGCACTAC
ATGGTGCACG CCTTCGGCAA GGCTATACTC TTCGCCGTGG CGGGCATACT GATAGCCACG
TACCACGGCC TGCGGGACAT AACGAGGATG GGAGGCCTCG CCTCGAAGAT GCCCTACACG
GCCTCGCTGG CGCTCATCGG CTTCATGCAC ATCACGGGTA TACCTCCAAC CCTGGGCATC
TGGAGCGAGT ACCTGATACT AAGAGGAGCC GTCGCGCACG CCCTAGCCCT TGGAGCCCCC
TCGTACGTGC TCCTGGCGGC GGCCCTTCTC GTGGGTATAG GGCTCTCGAC AGCCTACTCC
TTCCTGACGA TGAGGAGGGT GTTCTACGGG CCCCTAAAGG TACCTGAGGC GCGTGAGGCC
GGTAAAGCCC TCTGGGCGCC GCTCCTAGCC TTCGCAGTGC TGGGCGTGCT GTTCTTCGTG
TGCGCCTCCC TGCTCATAGA CCCGCTGGTC TCGTCGCTCG GAGGGCTCGG GCTGGGTGGT
TGA
 
Protein sequence
MGVPYLWLAL LVPLAASLAS LALKSRKALA ALNSSALAFS AAVLLLLYLT NGSNRWFDPL 
SFKLGSLGTF SLVMDPMVFL VAFSVAVTTS VIALYSSPYM EHRFEELERE GVSAPGWGAY
YFLYTLFALS MMGTVMSTNI IEFYVFLELT LIPSFLLIAF YGYGERLKIA IMYLIWTHVG
ALLFLIGALT VGSKVGFDFV DPEKGFLLGL GEGVGVLAFW LMVVGLSVKL AAAGLHMWLP
YAHAEAPTPI SALLSPNLIG LGGAMMFRVV YVLFPKTFAA ASPVLMAWAL VTMIYGGLMA
LSQSDFKRLL AYSSISQMGY LLLGLASVDV YGVAGTFLHY MVHAFGKAIL FAVAGILIAT
YHGLRDITRM GGLASKMPYT ASLALIGFMH ITGIPPTLGI WSEYLILRGA VAHALALGAP
SYVLLAAALL VGIGLSTAYS FLTMRRVFYG PLKVPEAREA GKALWAPLLA FAVLGVLFFV
CASLLIDPLV SSLGGLGLGG