Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1847 |
Symbol | |
ID | 6164779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1627513 |
End bp | 1630689 |
Gene Length | 3177 bp |
Protein Length | 1058 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641669010 |
Product | molybdopterin oxidoreductase |
Protein accession | YP_001795210 |
Protein GI | 171186291 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.758609 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTCT CTAGAAGAGA TGTTTTGAAA GCGGGAGCCA CCATTGGCTT GATCGGCGGC GTATCCGGCG TTCTGCTGAA GGCTGTGGCT GAACAGACTA AGGCCGAGGC GGCCTCTGCG GTAACGTCGG TGCCCTCCAT CTGCGGGATG TGTATGGCCC AGTGCGCGAT CTATATCGAC GTCGTAGACG GCAAGCCGGT GCGGATTAGG CCAAATACAA ACGCGCCGAC GAGCGCCAAG GGCATATGCG CCCGCGGCGT CTCCGGCACC TTCAACGCCT GGCTTAACCC AGACGCGGTG AAGAAGCCCA TGGCCAGGAA GGCCCTCGTG GACTGGGCCC AGGGGAAGAT AAGCTGGGAG GAGGCCAAGA GACAGCTTGT AACAAACCGC GGCAGATACG ACGACATGGT CGAGGTGGAT TGGAAAACCG CCATAGACAT AATCGCCAAG AAGCTCAAGG AGCTGGCCGA CAACAACGAG CGCCACGCCT TCACGTTCCT CTTCGGCGCC TGGGGGCCCG TCGCCAGCAT GAGGGCGGGC GTGCCGCTCA TGAGGTTCGC AGATACATAC GGCGGGGGCA TGATAACCTT CGACAACCCC TACTGCACCT ACCCGAGATA CCTAGGCCAC TGGCTCACCT GGGGCCACGG GCACCAAGCC CACGTCGCTT GTATAGACTA CGGCGAGGCC GAGGCTGTGC TTGTGGTGAG GAGAAACGTC ATCGGCGCTG GGGTTGTCAC TGAGACGTGG CGCTTTATGG AGGCCGTGAA ACGCGGGGCG AGGCTGGTGG TCCTAAGCCC GGTGTTTGAC GAAACCGCCT CCTACGCCGA CGTGTGGCTC CCGGTGAAGC CGGGGACAGA CCTCGCGGTG CTTCTGGCCT TCATAAAGTA CGTGCTTGAC AACGGCTACT ACATGGCCGA GTATCTGAGG CGGTTTACCA ACGCCCCCTT CCTCATAAAG CCGGACGGCC TCCCCCTGCT GGCCTCTGAG GTAGACTGGG GTAAATACGG CGTGAAAGAG CCGGCTTTCG CCTACGTCGT GTGGGACGAA GCCGCCGGCG GGCCGGCCCC CGACAACGCG GCGCAGAGGG CGGCTCTCTT CGGCGAGTAC GAGGTGGCGC TTAAAGACGG GAGCCCGGTC AGGGCGAAGA CCGCGTTGCA GATACTCAGG GAGTGGGTAA AGGCGAACCT CTCGGCGCTG GCGGAGAAAC ACGGCGTGAA GGACTACATG GAGGCCGCCG CCAGAGAGGC CGACGTCGAC GTAAACGACC TCAGAAGGGC GGCGGAGATC GTGGCTAAAT ACAGGGCTGT GTCTCCCATC GGCTGGCACG ACCCAAGATA CAGCAATACT CCACAGACGT GGAGAGCAGT CGGAGTGTTG ATGGCACTCC TCGGGAGAAT ACAGCAGCCG GGCGGCCTCT TCCTACTCAC ACACCTCATA ATGCCCTACG CAGACGTGTA TACGAAAGTA ATGAAATATA CAAAGAAGGA CATCCCCTAC AAGACCATCC GCGGCTTGAC CTTCGGCGAG TACGTCCCCG CGAACCTCCG CGGCATATAT GTCATCCCCA TAGCGCCCCC CCTCCCCGGC CCCAGCGATA GAGGCGCGCC GCCGGTCAAG TCGTTAACTG AGGTCTGGGG CGAGGAGGCG GAGAAGAAGG GCTACCTCTA CCCCTACGAC ACAGTGCAGG CGCTTTACGA GAGCGTGGTT CACGGCAAGC CGTTTAAGAT CAAGGTGGTG TTCATCACCG GCTCCAACCC CATTCCGCGG ATTGGAAACA GCAGACTTGT GGAGGAGATC TTTAGAAACT TGGAGCTCGT AATCGTCCAC GACATCCAGT TCAACGACAC AACGGCCTTC GCCGACGTAA TACTGCCGGA TCTGCCCTAT CTCGAGCGGC TTGATCTGGC GCTTCCCGGG CCCTTCTCGC CGTTTCCAGC CATATCTGTC AGATTCCCCT GGTACTACGA GGAGTACAAG AAGAAGCTGG CGGCCGGCGG GAAGCCGGGC GAGTTAGACA AGGCGTTTAG GTCTAGAAAT GGGCGGACGG CCTTAGAGGT GTTGGTCATG ATCGCCCGGA GACTTTCGGA GCTGGGCATC AAACCCCGCG ACAGAACTGA GTGGTCTCAG AACATGCCCG TGGGGATGAT CACCGAAGAG GGCATACTCC CCATACCCAA CCTGGAGAGG TTCATCAACG CGCAACTCCG TAGAGTTCGT ATCGTGGACG AGGCGGGGAA CGTGAGGGCG CCCACGGTGG AGGACATCTA CAAGATGGGT GGCTACATGG TGTTGGTGCC CACGGGCCGC GTTGAAGCTG TGAAAGACGA GCTCTGGAGC AATGCGCTGG GGAGAGACGT GGTGGTGAGG GTACACGTCT ACAGGCCTGT GAAATACAGC GTTGACCTCG AGGAGTGGCT CTGGAGGACT ATCTACTACA ACTCGCCGAT GGCGAGGGGG GAGGTGCCGT TGCCGACGCC CAGCGGCAGG GTGGAGATAT ACAGCATAAA CCTGGCCTAC GACGTGAGGA GGGTCTTCGG CAAGCCCGCC ACTTCGATCG ACCCCTCCGA CCTAGAGGGT AAGAAGAGCG GCGTGGATCC GCTGTTTTCG CCAGTGCCGC TCTACGCGGG TATGGCTAGG CCGGACTACA TGTGGGCAAC CGGCCCGGCG ACGGAGGACG TGGAGATAAA CGGGCTGGTC CCGCCGGAGC CGCCCAAGAG ACTGCTGCTC GTATACCGCC ACGGGCCCTA CACCCACACC CACAGCAATA CTCAGAACAA CCTCCTGCTT GACACACTAA CCTCCAGTGA GCTGTTGTCC GCCTGGATAC ATCCAGACAC CGCGGCGGCC CTCGGCGTAA AAGACGGCGA TTGGATAGAG GTGAAGCCCG CGGCGCCCAA AGTGGCAAAA CAGCTGGAGT CGGTAGGCGT AAAGGAGGCG CCCACGGCCC GGTTTAGGGT GAGAGTTACG CCTATGGTGA GGCGGGACAT CATCGCCATC TACCACTACT GGCTTGTGCC AAGGGGTAGG CTAAGGGTCA AGGCATGGAA GCTGGCCGAC GTTAGGGCTG GCTACAGCGA CGACAACTAC CTAGGCCCGA TGTTGGCCGG GAAGCTCGGC ACGCCTGGCG CCATGGGTAA CACCGTTGTG GAAGTGAGCA AGGTGGGCGG GCTATGA
|
Protein sequence | MEVSRRDVLK AGATIGLIGG VSGVLLKAVA EQTKAEAASA VTSVPSICGM CMAQCAIYID VVDGKPVRIR PNTNAPTSAK GICARGVSGT FNAWLNPDAV KKPMARKALV DWAQGKISWE EAKRQLVTNR GRYDDMVEVD WKTAIDIIAK KLKELADNNE RHAFTFLFGA WGPVASMRAG VPLMRFADTY GGGMITFDNP YCTYPRYLGH WLTWGHGHQA HVACIDYGEA EAVLVVRRNV IGAGVVTETW RFMEAVKRGA RLVVLSPVFD ETASYADVWL PVKPGTDLAV LLAFIKYVLD NGYYMAEYLR RFTNAPFLIK PDGLPLLASE VDWGKYGVKE PAFAYVVWDE AAGGPAPDNA AQRAALFGEY EVALKDGSPV RAKTALQILR EWVKANLSAL AEKHGVKDYM EAAAREADVD VNDLRRAAEI VAKYRAVSPI GWHDPRYSNT PQTWRAVGVL MALLGRIQQP GGLFLLTHLI MPYADVYTKV MKYTKKDIPY KTIRGLTFGE YVPANLRGIY VIPIAPPLPG PSDRGAPPVK SLTEVWGEEA EKKGYLYPYD TVQALYESVV HGKPFKIKVV FITGSNPIPR IGNSRLVEEI FRNLELVIVH DIQFNDTTAF ADVILPDLPY LERLDLALPG PFSPFPAISV RFPWYYEEYK KKLAAGGKPG ELDKAFRSRN GRTALEVLVM IARRLSELGI KPRDRTEWSQ NMPVGMITEE GILPIPNLER FINAQLRRVR IVDEAGNVRA PTVEDIYKMG GYMVLVPTGR VEAVKDELWS NALGRDVVVR VHVYRPVKYS VDLEEWLWRT IYYNSPMARG EVPLPTPSGR VEIYSINLAY DVRRVFGKPA TSIDPSDLEG KKSGVDPLFS PVPLYAGMAR PDYMWATGPA TEDVEINGLV PPEPPKRLLL VYRHGPYTHT HSNTQNNLLL DTLTSSELLS AWIHPDTAAA LGVKDGDWIE VKPAAPKVAK QLESVGVKEA PTARFRVRVT PMVRRDIIAI YHYWLVPRGR LRVKAWKLAD VRAGYSDDNY LGPMLAGKLG TPGAMGNTVV EVSKVGGL
|
| |