Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0407 |
Symbol | |
ID | 6166157 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 368542 |
End bp | 370458 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641667565 |
Product | nickel-dependent hydrogenase large subunit |
Protein accession | YP_001793801 |
Protein GI | 171184882 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.781594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0807519 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGG TAAAGATCTG GATTGACCCC ATTACGCGTA TTGAGGGACA TCTAGCTTTA TACGCCGAGA TAAACTCGGC CAGCAGAGCT GTGCAGACGG CTAGAACTAC CGTCATGATG TTCCGCGGGT TCGAGGTCTT CCTAAGGGGG AGGCCACCCG AGGACGCTCC CCACATAGTT TCTCGTACAT GTGGCGTCTG CGGCGCGGCC CACGCCAACG CGTCTGTGAG GGCATGCGAC GTAGCCGCCG GAATGACGCC GTATCCCATG GGAAACGTGT TGAGGACTCT GGCCTACGCC ATGACCGACT ACACCTACGA CCACCCGCTG ATTTTGAACA TGTTGGAGGG GCCGGACTAC AGCGAGGCTA TCGTCAGCAA GCTGACGCCG TCGGTGTGGA AGATCGCCCA GGAGACGCCG GCTCAGTACT CCGCCATTCA CGGATACCGG ACGATTGCAG ATATCATGAA GGATCTGAAC CCCATCACCG GCAGGATATG GCAGTTGACT GTGAAGTACC AGAGGATCGC GAGGGAGGCC GGCGTCTTGA TCTACGGCAG ACACTCCCAC CCGTCTACGT TGATTCCGGC GGGCATTTCG ACGGACATCT CCAATCTGCC GTCTCTCATC CAGGAGTACT ATGCGAGGCT TTCGCAACTT ACCGCCTGGG TTAAGTTCGT CTGGGCTATC TGGCAGGACC TCTACGAGTT CTACCGCGAC CACGTAACTA CGCCGGATGG TAAGCCCTAC GCCACTACCC AAGGCAAGAC CCACGACCCG CCGGTGATGC TCGCCGGCGG TTTTGCAGAC GACCCCGAGG TGTACAGCAA CATAACCGAC GAGGCGAAGG GCGACTGGAG GGAGCTCTAC GCGAGGCTTG ACCAGGCCTA CAACGCGAGG GGGGAGAAGC CGGGCTTCGC CATCGGCCAC GATATATACA GCAAGAACCC GACGGAGATC CAGCTCGGCT ATGTGGAGTT CGCCGACTCT TCCTTCTACG AGGACTGGGT GAAGAGCAAC GTGGCGCCGC CCACCGGCTG GATAAAGGAG GACCCCATCG GCCGGCCGTT GGTAAATGGA ACAGAGCTCT TCAAGTACCA CATGTGGAAT AGGACCACTA TCCCGAAGCC CGGCGCCATC AACTTCGCTG AGAAGTATAG CTGGGCGGCT GAGCCTAGGC TTGTGCTTAA GGACGGCCGC ATCGCGCCTA TTGAGACCGG CCCCATCTCG CAACTGTGGC TTGACACGCT ACACGCCACG AAGTTTGAGC TGGCCGGCTT CAAGGCCTGG GAGTCCAACG GTAGCCAGAT GAAGATCTAT CTCCCCGGCG GGACCGAGGC GCCCGACCTG CCGCCCGGCA CCAAGGACGA GCTCGTCATT ACGTGGAATC TGCCCAAGTA CTCAACGACG TTTGAACGTC TGCTGGCCCG CGCGGTTCAC CTCGCCGTGG TGAACGCCAT CGCTTGGGCC AACCTGCTGT ATAGCCTACA GCTGGTGAAC GCCGGCAAGA TACAGACCTC TAGGCCGTGG AGCTACGGCA AGTGGCCCGA CTTCAGCTAC AGCTTCGGCT GGTGGCAGGT GCCGCGTGGC AACTGTATGC ACTGGCTCGT GCAGAAGGGC GGGAGGATCG TGAACTACCA GTACGAGGCT CCGACGACGC CTAACGTCAG CCCGTCTAAT AACCGCTGTA CTGACCCGTG GAAGGGCCAG TGCGCCGGGC CCTTCGAGAT GTCTGTAAGG AACAGCGTGG TGACGGAGGA GCTTCCGCCT GACCAGTGGA CTGGCCTCGA CCAGGTGAGG GCAATCAGGA GCTTCGACCC ATGTCTCGCC TGTGCGGTGC ACTTCGAGGC CAAGGGCGAA GGGGGCAGAG TTATGAACGT GATTGAGAAG GTGATCTGGA ATGCCTGCGC CATTTAA
|
Protein sequence | MSTVKIWIDP ITRIEGHLAL YAEINSASRA VQTARTTVMM FRGFEVFLRG RPPEDAPHIV SRTCGVCGAA HANASVRACD VAAGMTPYPM GNVLRTLAYA MTDYTYDHPL ILNMLEGPDY SEAIVSKLTP SVWKIAQETP AQYSAIHGYR TIADIMKDLN PITGRIWQLT VKYQRIAREA GVLIYGRHSH PSTLIPAGIS TDISNLPSLI QEYYARLSQL TAWVKFVWAI WQDLYEFYRD HVTTPDGKPY ATTQGKTHDP PVMLAGGFAD DPEVYSNITD EAKGDWRELY ARLDQAYNAR GEKPGFAIGH DIYSKNPTEI QLGYVEFADS SFYEDWVKSN VAPPTGWIKE DPIGRPLVNG TELFKYHMWN RTTIPKPGAI NFAEKYSWAA EPRLVLKDGR IAPIETGPIS QLWLDTLHAT KFELAGFKAW ESNGSQMKIY LPGGTEAPDL PPGTKDELVI TWNLPKYSTT FERLLARAVH LAVVNAIAWA NLLYSLQLVN AGKIQTSRPW SYGKWPDFSY SFGWWQVPRG NCMHWLVQKG GRIVNYQYEA PTTPNVSPSN NRCTDPWKGQ CAGPFEMSVR NSVVTEELPP DQWTGLDQVR AIRSFDPCLA CAVHFEAKGE GGRVMNVIEK VIWNACAI
|
| |