Gene Tpen_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1802 
Symbol 
ID4601795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1741983 
End bp1743161 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID639774575 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_921200 
Protein GI119720705 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00556002 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGTG TAGTCATCAT AGGAGGCGGT GGAGGAGGAG CCATACTGGC CAACCTTCTC 
CCGGAGGAGT TCAAGGTAAC GGTCGTCGAT AAAAGCGAGG TACACTTCTT CCAGCCGGGC
AACCTCTGGA TAGCGTTCAA GGGGGTTAGG AAGGAGAAGT TTCTCAGGCC TCTACGCTCC
CTCCTGAAAC CCAGGGTAGA ATTCGTCCAC GACGAGGTCG TAAGCGTGGA TCTCAACGAG
AGGGTTGTGA AGACGGCCTC CGGGAAAAGC TTGAGCTACG ACTACGTGGT TTTCGCCAGC
GGGGCGGAGC TGGACTACGG CTCCGTGCCC GGCCACAGAG AGCTACTCGA GAGGTTCGGG
GACTTCTACT CCACGCCCGA GAACGCCGAG AAGCTGCACG CCTCGCTGAG AGGCTTAAAG
GAGGGTAGGT TCGTGATAGG GATAGCGGAT CCTGTGTACA AGTGCCCTCC GGGGCCGCAC
AAGGCGGCCT TCTTGTCCTG GGAGTTCTTC GCCAGGAGGG GTCTAAGTGA CAAGGTGAAG
GTTGTCCTCG CCGTGCCAGT ACCCCACGCG TACCCGTCTA AAACGATCGC GGACATAATA
GAGCCCGAGC TGAACTCTCG CGGTATAGAG CTGCACACGT TTTTCACTGT GAACGAGGTG
GACGTGGCGA ACAAGAGGAT AGTCAGCCTT GAAGGCGAAG AACTCTCCTT CGACGTGGCA
GCCGTAGTTC CGCCGCACAG GGGTCCTAGC TACGCCGTTA ACCCGGCGGA GGTTAAGGAC
GGGAGTGGCT ACATAAAGAT AGACAAGTAC ACTAGCCGGG TGGAGGGCTT CGACGATGCC
TACGCCATAG GGGACTGTAC AAACGCGCCT ACCTCTAAGA GTGGCGTCAC GGCCCACCTA
CAGGCAGAAG TCGTAGCCGC GAGGCTTCAG GGGATCGATG CCAGGTACAG CGGTAGGACG
AACTGCCCCC TGATAACCGA CGGTAAAGGG TTGTTCGTTA TAAGCGACTA CGACCACCCG
CCGATACCCG TAAGACTCTC GAAGTTCAAG CGGCTCATGG AGGACTTCTT CGTGGCTACC
TACTGGAGCG CTGTGAGAAG CCCCGAGCTC TGGAGCCCCA TATTCAGGGC TTACTTCGAG
GCGACGGACG AGTTTATAAG GAGGGGGGAG GGGTGGTAG
 
Protein sequence
MERVVIIGGG GGGAILANLL PEEFKVTVVD KSEVHFFQPG NLWIAFKGVR KEKFLRPLRS 
LLKPRVEFVH DEVVSVDLNE RVVKTASGKS LSYDYVVFAS GAELDYGSVP GHRELLERFG
DFYSTPENAE KLHASLRGLK EGRFVIGIAD PVYKCPPGPH KAAFLSWEFF ARRGLSDKVK
VVLAVPVPHA YPSKTIADII EPELNSRGIE LHTFFTVNEV DVANKRIVSL EGEELSFDVA
AVVPPHRGPS YAVNPAEVKD GSGYIKIDKY TSRVEGFDDA YAIGDCTNAP TSKSGVTAHL
QAEVVAARLQ GIDARYSGRT NCPLITDGKG LFVISDYDHP PIPVRLSKFK RLMEDFFVAT
YWSAVRSPEL WSPIFRAYFE ATDEFIRRGE GW