Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1802 |
Symbol | |
ID | 4601795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1741983 |
End bp | 1743161 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774575 |
Product | FAD-dependent pyridine nucleotide-disulphide oxidoreductase |
Protein accession | YP_921200 |
Protein GI | 119720705 |
COG category | [C] Energy production and conversion |
COG ID | [COG1252] NADH dehydrogenase, FAD-containing subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00556002 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCGTG TAGTCATCAT AGGAGGCGGT GGAGGAGGAG CCATACTGGC CAACCTTCTC CCGGAGGAGT TCAAGGTAAC GGTCGTCGAT AAAAGCGAGG TACACTTCTT CCAGCCGGGC AACCTCTGGA TAGCGTTCAA GGGGGTTAGG AAGGAGAAGT TTCTCAGGCC TCTACGCTCC CTCCTGAAAC CCAGGGTAGA ATTCGTCCAC GACGAGGTCG TAAGCGTGGA TCTCAACGAG AGGGTTGTGA AGACGGCCTC CGGGAAAAGC TTGAGCTACG ACTACGTGGT TTTCGCCAGC GGGGCGGAGC TGGACTACGG CTCCGTGCCC GGCCACAGAG AGCTACTCGA GAGGTTCGGG GACTTCTACT CCACGCCCGA GAACGCCGAG AAGCTGCACG CCTCGCTGAG AGGCTTAAAG GAGGGTAGGT TCGTGATAGG GATAGCGGAT CCTGTGTACA AGTGCCCTCC GGGGCCGCAC AAGGCGGCCT TCTTGTCCTG GGAGTTCTTC GCCAGGAGGG GTCTAAGTGA CAAGGTGAAG GTTGTCCTCG CCGTGCCAGT ACCCCACGCG TACCCGTCTA AAACGATCGC GGACATAATA GAGCCCGAGC TGAACTCTCG CGGTATAGAG CTGCACACGT TTTTCACTGT GAACGAGGTG GACGTGGCGA ACAAGAGGAT AGTCAGCCTT GAAGGCGAAG AACTCTCCTT CGACGTGGCA GCCGTAGTTC CGCCGCACAG GGGTCCTAGC TACGCCGTTA ACCCGGCGGA GGTTAAGGAC GGGAGTGGCT ACATAAAGAT AGACAAGTAC ACTAGCCGGG TGGAGGGCTT CGACGATGCC TACGCCATAG GGGACTGTAC AAACGCGCCT ACCTCTAAGA GTGGCGTCAC GGCCCACCTA CAGGCAGAAG TCGTAGCCGC GAGGCTTCAG GGGATCGATG CCAGGTACAG CGGTAGGACG AACTGCCCCC TGATAACCGA CGGTAAAGGG TTGTTCGTTA TAAGCGACTA CGACCACCCG CCGATACCCG TAAGACTCTC GAAGTTCAAG CGGCTCATGG AGGACTTCTT CGTGGCTACC TACTGGAGCG CTGTGAGAAG CCCCGAGCTC TGGAGCCCCA TATTCAGGGC TTACTTCGAG GCGACGGACG AGTTTATAAG GAGGGGGGAG GGGTGGTAG
|
Protein sequence | MERVVIIGGG GGGAILANLL PEEFKVTVVD KSEVHFFQPG NLWIAFKGVR KEKFLRPLRS LLKPRVEFVH DEVVSVDLNE RVVKTASGKS LSYDYVVFAS GAELDYGSVP GHRELLERFG DFYSTPENAE KLHASLRGLK EGRFVIGIAD PVYKCPPGPH KAAFLSWEFF ARRGLSDKVK VVLAVPVPHA YPSKTIADII EPELNSRGIE LHTFFTVNEV DVANKRIVSL EGEELSFDVA AVVPPHRGPS YAVNPAEVKD GSGYIKIDKY TSRVEGFDDA YAIGDCTNAP TSKSGVTAHL QAEVVAARLQ GIDARYSGRT NCPLITDGKG LFVISDYDHP PIPVRLSKFK RLMEDFFVAT YWSAVRSPEL WSPIFRAYFE ATDEFIRRGE GW
|
| |