Gene Tneu_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0407 
Symbol 
ID6166157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp368542 
End bp370458 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content60% 
IMG OID641667565 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_001793801 
Protein GI171184882 
COG category[C] Energy production and conversion 
COG ID[COG0374] Ni,Fe-hydrogenase I large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.781594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0807519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACGG TAAAGATCTG GATTGACCCC ATTACGCGTA TTGAGGGACA TCTAGCTTTA 
TACGCCGAGA TAAACTCGGC CAGCAGAGCT GTGCAGACGG CTAGAACTAC CGTCATGATG
TTCCGCGGGT TCGAGGTCTT CCTAAGGGGG AGGCCACCCG AGGACGCTCC CCACATAGTT
TCTCGTACAT GTGGCGTCTG CGGCGCGGCC CACGCCAACG CGTCTGTGAG GGCATGCGAC
GTAGCCGCCG GAATGACGCC GTATCCCATG GGAAACGTGT TGAGGACTCT GGCCTACGCC
ATGACCGACT ACACCTACGA CCACCCGCTG ATTTTGAACA TGTTGGAGGG GCCGGACTAC
AGCGAGGCTA TCGTCAGCAA GCTGACGCCG TCGGTGTGGA AGATCGCCCA GGAGACGCCG
GCTCAGTACT CCGCCATTCA CGGATACCGG ACGATTGCAG ATATCATGAA GGATCTGAAC
CCCATCACCG GCAGGATATG GCAGTTGACT GTGAAGTACC AGAGGATCGC GAGGGAGGCC
GGCGTCTTGA TCTACGGCAG ACACTCCCAC CCGTCTACGT TGATTCCGGC GGGCATTTCG
ACGGACATCT CCAATCTGCC GTCTCTCATC CAGGAGTACT ATGCGAGGCT TTCGCAACTT
ACCGCCTGGG TTAAGTTCGT CTGGGCTATC TGGCAGGACC TCTACGAGTT CTACCGCGAC
CACGTAACTA CGCCGGATGG TAAGCCCTAC GCCACTACCC AAGGCAAGAC CCACGACCCG
CCGGTGATGC TCGCCGGCGG TTTTGCAGAC GACCCCGAGG TGTACAGCAA CATAACCGAC
GAGGCGAAGG GCGACTGGAG GGAGCTCTAC GCGAGGCTTG ACCAGGCCTA CAACGCGAGG
GGGGAGAAGC CGGGCTTCGC CATCGGCCAC GATATATACA GCAAGAACCC GACGGAGATC
CAGCTCGGCT ATGTGGAGTT CGCCGACTCT TCCTTCTACG AGGACTGGGT GAAGAGCAAC
GTGGCGCCGC CCACCGGCTG GATAAAGGAG GACCCCATCG GCCGGCCGTT GGTAAATGGA
ACAGAGCTCT TCAAGTACCA CATGTGGAAT AGGACCACTA TCCCGAAGCC CGGCGCCATC
AACTTCGCTG AGAAGTATAG CTGGGCGGCT GAGCCTAGGC TTGTGCTTAA GGACGGCCGC
ATCGCGCCTA TTGAGACCGG CCCCATCTCG CAACTGTGGC TTGACACGCT ACACGCCACG
AAGTTTGAGC TGGCCGGCTT CAAGGCCTGG GAGTCCAACG GTAGCCAGAT GAAGATCTAT
CTCCCCGGCG GGACCGAGGC GCCCGACCTG CCGCCCGGCA CCAAGGACGA GCTCGTCATT
ACGTGGAATC TGCCCAAGTA CTCAACGACG TTTGAACGTC TGCTGGCCCG CGCGGTTCAC
CTCGCCGTGG TGAACGCCAT CGCTTGGGCC AACCTGCTGT ATAGCCTACA GCTGGTGAAC
GCCGGCAAGA TACAGACCTC TAGGCCGTGG AGCTACGGCA AGTGGCCCGA CTTCAGCTAC
AGCTTCGGCT GGTGGCAGGT GCCGCGTGGC AACTGTATGC ACTGGCTCGT GCAGAAGGGC
GGGAGGATCG TGAACTACCA GTACGAGGCT CCGACGACGC CTAACGTCAG CCCGTCTAAT
AACCGCTGTA CTGACCCGTG GAAGGGCCAG TGCGCCGGGC CCTTCGAGAT GTCTGTAAGG
AACAGCGTGG TGACGGAGGA GCTTCCGCCT GACCAGTGGA CTGGCCTCGA CCAGGTGAGG
GCAATCAGGA GCTTCGACCC ATGTCTCGCC TGTGCGGTGC ACTTCGAGGC CAAGGGCGAA
GGGGGCAGAG TTATGAACGT GATTGAGAAG GTGATCTGGA ATGCCTGCGC CATTTAA
 
Protein sequence
MSTVKIWIDP ITRIEGHLAL YAEINSASRA VQTARTTVMM FRGFEVFLRG RPPEDAPHIV 
SRTCGVCGAA HANASVRACD VAAGMTPYPM GNVLRTLAYA MTDYTYDHPL ILNMLEGPDY
SEAIVSKLTP SVWKIAQETP AQYSAIHGYR TIADIMKDLN PITGRIWQLT VKYQRIAREA
GVLIYGRHSH PSTLIPAGIS TDISNLPSLI QEYYARLSQL TAWVKFVWAI WQDLYEFYRD
HVTTPDGKPY ATTQGKTHDP PVMLAGGFAD DPEVYSNITD EAKGDWRELY ARLDQAYNAR
GEKPGFAIGH DIYSKNPTEI QLGYVEFADS SFYEDWVKSN VAPPTGWIKE DPIGRPLVNG
TELFKYHMWN RTTIPKPGAI NFAEKYSWAA EPRLVLKDGR IAPIETGPIS QLWLDTLHAT
KFELAGFKAW ESNGSQMKIY LPGGTEAPDL PPGTKDELVI TWNLPKYSTT FERLLARAVH
LAVVNAIAWA NLLYSLQLVN AGKIQTSRPW SYGKWPDFSY SFGWWQVPRG NCMHWLVQKG
GRIVNYQYEA PTTPNVSPSN NRCTDPWKGQ CAGPFEMSVR NSVVTEELPP DQWTGLDQVR
AIRSFDPCLA CAVHFEAKGE GGRVMNVIEK VIWNACAI