Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0967 |
Symbol | |
ID | 6316856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | - |
Start bp | 1025658 |
End bp | 1028537 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642643339 |
Product | protein of unknown function UPF0182 |
Protein accession | YP_001917139 |
Protein GI | 188585594 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000819285 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.392007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTGGA AAAAATATCG TAATTATATT ATTTTGGCAA TTATAGCCTT AATTCTCTTA TTTAGCCGAA GATTCGCCGT TATCGTGACT GATATTCAAT GGTTTCAAGC TCTAGATTTT TTTGAACTGT TTTGGAAACC TATTGCTATC CAATACCTTT TTTCGATAAT CGCCTTCATT TTAGGAGCAG TTTTCATATA TCTTAATTTA GGTCTTTTAA TAAATAATTT AATAACTCCT GAATATGAGA CTAAGTTTTA TGAGACGGGT TTAGCTGAAT ACGCTGATGT CATAACATCC ATGGGTAAAA TTGGAAGATT AATCATTTCC ATAATAATCA GTTTTATCTG GATTGGAAGT TTTGCTAATA TTTGGGAAAA CATCGTCTAT GTGATAGGAA GTCAAAGTAC CGGTATTGTT GACCCTATTT TTAATATAGA TGCGGTTTTT TATTTATTCC ATTTACCCTT TTTACAGAGA CTAGTAAGTC CAATTTTGGC TTTACTGATT TTAACAATTA TAGCTGTTTC AATAATTTAT TTTTCAAAAG GCCTGTTATC ATGGACGATA TTTCGTAAAT ATTCGCCATT ACAGGCTCCT GCTTTAAAGC ATCTCAATTA CCTTCTAAGT GGAATATTTT TAATACTGAC ATTCCGGTTT ATATTGTCCT TATTTAATCT TATGTTCTCT CCTCGAGGTG CAGTTTACGG TATTGGATTT ACTGATCTAT ACATATCAGC TCCCGTATAT ATTATTCTTG CATTATTGTC TATAGCCAGT TTGGTATTGA CATTAGGTAA TTTTAAATGG AAAAAATACC GACTATCTCT TTATTGTTTC GGAGGTATAA TTGCAGTTTC TATTCTCGGC GGTATAGTAG CTACTGCTGT TCAATCAATC GTTGTAGCCC CTAATGAACT TACTCGAGAG GGACCATTTA TCAATAATCA TTTAGAAATG ACACAAAAAG CTTACGGTAT TGATGATATT ACTGAAAGAA CCTGGGAAAT TGAAGATAAC GAAAATGGCA CTGAAAATAC TATTGAGAAT AACGTTAATG AAGATGAAGC TAATCAAATT GAAGAGGATC TGGATCAAGA AGAGGGTGTT CCTCAAGAAG AAGATGATGC TCCCCAAGAA GAGGGTTCTC CTCAGGAAGA AGGAGTAGCA GCTCCCGAAA TCATTCCAGA AGTTATTAAC AATGTCAGGT TACTGGATTA TCGTCCTCTG AGAGACGTCT ATCGGGAAGC TCAAGAATAT CGACGCTATT ATCATTTCAA TGATGTAGAT ATAGCTAGAT ATACTATTGA TGATCAATAT CACCAAGTTA TGTTGGCAGC GAGAGAGATG GATGTGGATC GTCTTCCAAC AGGTGCTCAA ACGGCAGTTA ACAGGCATTT AAAATATACA CACGGTTACG GGCTAGCCAT GAGCCCAGTC GGAGATTTGA CTCCAGATGG GCTTCCTCGT TATTTCTTCC AAGATATGCC CGTACAAGAT CATCTCGATA TGGGATTAGA GCGGCCGGAA CTTTATTTTG GCGAACTAAC CAATGATTTT GTGATTGTTA ACAGTGAAGA AACAGAATTT CATTATCCTG GTCAAGAAGA TGTAGAGTTA ATCTATGAAG GTGAATCAGG CATTAACATG CATTTTATGA ACCGATTGTT ATTTGCCCTG AGAAAATTCA ATAGTTTCAT ACTCTTTTCA GGGGAGTTTT CATCGGAAAG CCAAATCTTA TTCAACCGTA ATATCAAAGA ACGTGTAGAG CGAATAGCAC CTTTCTTAAG ATACGATCAA GATCCCTATC TTGCTGTAGC TGACGATAGA CTTTACTGGA TTATGGATGC CTATGTGGAA ACCAATCAGT TCCCTTACAG CCAACCCTTT GAAGATAGTA CTAATTATAT TCGAAATCCA GTTAAGGTAG TCATAGACGC TTACAGTGGT ACTGTGGATT TTTATTTAAT TGAAGACGAT GAACCCTTCA CTCAAGCACT GGATAGAGCA TTCCCTGACC TATTTTCCAA CCTGGATGAG TTATCTAAGG AATTGAGAGC CCAATTCCGC TATCCAGAAG ATTTATTTAG TGCCCAGGCC CATATACTAC AAAATTATCA TATGGAAAAT CCCGTGATTT TCTACAACAG GGAAGATGCC TGGGATATTC CTACTGAAAA CTACCAAAAT GAAACTATTA CCATGGAACC GTATTATGCC ACTTTAAATT TAGGTGAAGA AGAACGGCCC GAGTTTGTCC TAATGATTCC CTATACTCCT GTAGAGCGGA ATAATATGAT TTCTTGGTTA GGTGCTAGAA ATGATGGAGA AAATTACGGT GAATTGGTTC TGTTCAGATT CCCTTCAGGC GAACATATCT ATGGCCCTCA ACAAATTGAA TTCCGAATAG ATCAAGATCC CCAGATATCC CAACAGATCA GTTTATGGGA TACCCGAGGA TCAAGGGTAA TCCGTGGAAA TTTACTAGTA ATTCCTTTAG AAAACGGCAT CCTGTATATT GAACCTTTAT ACCTCCAAGC TGAAGCCAGT AGTTTTCCTG AAATGAGAAG AGTCTTAAGC TTTTGGCAAG GAGATCTAGT TATGGCCGAT ACCCTGGATG AAGCTTTGGC TATGCATGGA GTAGACCCTG AAGAATTAGA TTTGGAAGAT CCTGATGATA TTGAAGATAT TGAGGAGATT GAAGATTTAC CTGATACTGA TATCGCTGGG CTTGATCAAT TATCTCAAGA AGCTTTGGAT CTTTACCGTC AAGCTGATGA AGCTCTTCGC CAGGGTAATT GGACAGAATA CGGGGCCACT ATTGAAGAGT TAGAAGGAGT CTTAGAAGAA ATTCAGTTAA GAGTCGACGA AAACTTTTAG
|
Protein sequence | MFWKKYRNYI ILAIIALILL FSRRFAVIVT DIQWFQALDF FELFWKPIAI QYLFSIIAFI LGAVFIYLNL GLLINNLITP EYETKFYETG LAEYADVITS MGKIGRLIIS IIISFIWIGS FANIWENIVY VIGSQSTGIV DPIFNIDAVF YLFHLPFLQR LVSPILALLI LTIIAVSIIY FSKGLLSWTI FRKYSPLQAP ALKHLNYLLS GIFLILTFRF ILSLFNLMFS PRGAVYGIGF TDLYISAPVY IILALLSIAS LVLTLGNFKW KKYRLSLYCF GGIIAVSILG GIVATAVQSI VVAPNELTRE GPFINNHLEM TQKAYGIDDI TERTWEIEDN ENGTENTIEN NVNEDEANQI EEDLDQEEGV PQEEDDAPQE EGSPQEEGVA APEIIPEVIN NVRLLDYRPL RDVYREAQEY RRYYHFNDVD IARYTIDDQY HQVMLAAREM DVDRLPTGAQ TAVNRHLKYT HGYGLAMSPV GDLTPDGLPR YFFQDMPVQD HLDMGLERPE LYFGELTNDF VIVNSEETEF HYPGQEDVEL IYEGESGINM HFMNRLLFAL RKFNSFILFS GEFSSESQIL FNRNIKERVE RIAPFLRYDQ DPYLAVADDR LYWIMDAYVE TNQFPYSQPF EDSTNYIRNP VKVVIDAYSG TVDFYLIEDD EPFTQALDRA FPDLFSNLDE LSKELRAQFR YPEDLFSAQA HILQNYHMEN PVIFYNREDA WDIPTENYQN ETITMEPYYA TLNLGEEERP EFVLMIPYTP VERNNMISWL GARNDGENYG ELVLFRFPSG EHIYGPQQIE FRIDQDPQIS QQISLWDTRG SRVIRGNLLV IPLENGILYI EPLYLQAEAS SFPEMRRVLS FWQGDLVMAD TLDEALAMHG VDPEELDLED PDDIEDIEEI EDLPDTDIAG LDQLSQEALD LYRQADEALR QGNWTEYGAT IEELEGVLEE IQLRVDENF
|
| |