Gene Tneu_1847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1847 
Symbol 
ID6164779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1627513 
End bp1630689 
Gene Length3177 bp 
Protein Length1058 aa 
Translation table11 
GC content61% 
IMG OID641669010 
Productmolybdopterin oxidoreductase 
Protein accessionYP_001795210 
Protein GI171186291 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.758609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTCT CTAGAAGAGA TGTTTTGAAA GCGGGAGCCA CCATTGGCTT GATCGGCGGC 
GTATCCGGCG TTCTGCTGAA GGCTGTGGCT GAACAGACTA AGGCCGAGGC GGCCTCTGCG
GTAACGTCGG TGCCCTCCAT CTGCGGGATG TGTATGGCCC AGTGCGCGAT CTATATCGAC
GTCGTAGACG GCAAGCCGGT GCGGATTAGG CCAAATACAA ACGCGCCGAC GAGCGCCAAG
GGCATATGCG CCCGCGGCGT CTCCGGCACC TTCAACGCCT GGCTTAACCC AGACGCGGTG
AAGAAGCCCA TGGCCAGGAA GGCCCTCGTG GACTGGGCCC AGGGGAAGAT AAGCTGGGAG
GAGGCCAAGA GACAGCTTGT AACAAACCGC GGCAGATACG ACGACATGGT CGAGGTGGAT
TGGAAAACCG CCATAGACAT AATCGCCAAG AAGCTCAAGG AGCTGGCCGA CAACAACGAG
CGCCACGCCT TCACGTTCCT CTTCGGCGCC TGGGGGCCCG TCGCCAGCAT GAGGGCGGGC
GTGCCGCTCA TGAGGTTCGC AGATACATAC GGCGGGGGCA TGATAACCTT CGACAACCCC
TACTGCACCT ACCCGAGATA CCTAGGCCAC TGGCTCACCT GGGGCCACGG GCACCAAGCC
CACGTCGCTT GTATAGACTA CGGCGAGGCC GAGGCTGTGC TTGTGGTGAG GAGAAACGTC
ATCGGCGCTG GGGTTGTCAC TGAGACGTGG CGCTTTATGG AGGCCGTGAA ACGCGGGGCG
AGGCTGGTGG TCCTAAGCCC GGTGTTTGAC GAAACCGCCT CCTACGCCGA CGTGTGGCTC
CCGGTGAAGC CGGGGACAGA CCTCGCGGTG CTTCTGGCCT TCATAAAGTA CGTGCTTGAC
AACGGCTACT ACATGGCCGA GTATCTGAGG CGGTTTACCA ACGCCCCCTT CCTCATAAAG
CCGGACGGCC TCCCCCTGCT GGCCTCTGAG GTAGACTGGG GTAAATACGG CGTGAAAGAG
CCGGCTTTCG CCTACGTCGT GTGGGACGAA GCCGCCGGCG GGCCGGCCCC CGACAACGCG
GCGCAGAGGG CGGCTCTCTT CGGCGAGTAC GAGGTGGCGC TTAAAGACGG GAGCCCGGTC
AGGGCGAAGA CCGCGTTGCA GATACTCAGG GAGTGGGTAA AGGCGAACCT CTCGGCGCTG
GCGGAGAAAC ACGGCGTGAA GGACTACATG GAGGCCGCCG CCAGAGAGGC CGACGTCGAC
GTAAACGACC TCAGAAGGGC GGCGGAGATC GTGGCTAAAT ACAGGGCTGT GTCTCCCATC
GGCTGGCACG ACCCAAGATA CAGCAATACT CCACAGACGT GGAGAGCAGT CGGAGTGTTG
ATGGCACTCC TCGGGAGAAT ACAGCAGCCG GGCGGCCTCT TCCTACTCAC ACACCTCATA
ATGCCCTACG CAGACGTGTA TACGAAAGTA ATGAAATATA CAAAGAAGGA CATCCCCTAC
AAGACCATCC GCGGCTTGAC CTTCGGCGAG TACGTCCCCG CGAACCTCCG CGGCATATAT
GTCATCCCCA TAGCGCCCCC CCTCCCCGGC CCCAGCGATA GAGGCGCGCC GCCGGTCAAG
TCGTTAACTG AGGTCTGGGG CGAGGAGGCG GAGAAGAAGG GCTACCTCTA CCCCTACGAC
ACAGTGCAGG CGCTTTACGA GAGCGTGGTT CACGGCAAGC CGTTTAAGAT CAAGGTGGTG
TTCATCACCG GCTCCAACCC CATTCCGCGG ATTGGAAACA GCAGACTTGT GGAGGAGATC
TTTAGAAACT TGGAGCTCGT AATCGTCCAC GACATCCAGT TCAACGACAC AACGGCCTTC
GCCGACGTAA TACTGCCGGA TCTGCCCTAT CTCGAGCGGC TTGATCTGGC GCTTCCCGGG
CCCTTCTCGC CGTTTCCAGC CATATCTGTC AGATTCCCCT GGTACTACGA GGAGTACAAG
AAGAAGCTGG CGGCCGGCGG GAAGCCGGGC GAGTTAGACA AGGCGTTTAG GTCTAGAAAT
GGGCGGACGG CCTTAGAGGT GTTGGTCATG ATCGCCCGGA GACTTTCGGA GCTGGGCATC
AAACCCCGCG ACAGAACTGA GTGGTCTCAG AACATGCCCG TGGGGATGAT CACCGAAGAG
GGCATACTCC CCATACCCAA CCTGGAGAGG TTCATCAACG CGCAACTCCG TAGAGTTCGT
ATCGTGGACG AGGCGGGGAA CGTGAGGGCG CCCACGGTGG AGGACATCTA CAAGATGGGT
GGCTACATGG TGTTGGTGCC CACGGGCCGC GTTGAAGCTG TGAAAGACGA GCTCTGGAGC
AATGCGCTGG GGAGAGACGT GGTGGTGAGG GTACACGTCT ACAGGCCTGT GAAATACAGC
GTTGACCTCG AGGAGTGGCT CTGGAGGACT ATCTACTACA ACTCGCCGAT GGCGAGGGGG
GAGGTGCCGT TGCCGACGCC CAGCGGCAGG GTGGAGATAT ACAGCATAAA CCTGGCCTAC
GACGTGAGGA GGGTCTTCGG CAAGCCCGCC ACTTCGATCG ACCCCTCCGA CCTAGAGGGT
AAGAAGAGCG GCGTGGATCC GCTGTTTTCG CCAGTGCCGC TCTACGCGGG TATGGCTAGG
CCGGACTACA TGTGGGCAAC CGGCCCGGCG ACGGAGGACG TGGAGATAAA CGGGCTGGTC
CCGCCGGAGC CGCCCAAGAG ACTGCTGCTC GTATACCGCC ACGGGCCCTA CACCCACACC
CACAGCAATA CTCAGAACAA CCTCCTGCTT GACACACTAA CCTCCAGTGA GCTGTTGTCC
GCCTGGATAC ATCCAGACAC CGCGGCGGCC CTCGGCGTAA AAGACGGCGA TTGGATAGAG
GTGAAGCCCG CGGCGCCCAA AGTGGCAAAA CAGCTGGAGT CGGTAGGCGT AAAGGAGGCG
CCCACGGCCC GGTTTAGGGT GAGAGTTACG CCTATGGTGA GGCGGGACAT CATCGCCATC
TACCACTACT GGCTTGTGCC AAGGGGTAGG CTAAGGGTCA AGGCATGGAA GCTGGCCGAC
GTTAGGGCTG GCTACAGCGA CGACAACTAC CTAGGCCCGA TGTTGGCCGG GAAGCTCGGC
ACGCCTGGCG CCATGGGTAA CACCGTTGTG GAAGTGAGCA AGGTGGGCGG GCTATGA
 
Protein sequence
MEVSRRDVLK AGATIGLIGG VSGVLLKAVA EQTKAEAASA VTSVPSICGM CMAQCAIYID 
VVDGKPVRIR PNTNAPTSAK GICARGVSGT FNAWLNPDAV KKPMARKALV DWAQGKISWE
EAKRQLVTNR GRYDDMVEVD WKTAIDIIAK KLKELADNNE RHAFTFLFGA WGPVASMRAG
VPLMRFADTY GGGMITFDNP YCTYPRYLGH WLTWGHGHQA HVACIDYGEA EAVLVVRRNV
IGAGVVTETW RFMEAVKRGA RLVVLSPVFD ETASYADVWL PVKPGTDLAV LLAFIKYVLD
NGYYMAEYLR RFTNAPFLIK PDGLPLLASE VDWGKYGVKE PAFAYVVWDE AAGGPAPDNA
AQRAALFGEY EVALKDGSPV RAKTALQILR EWVKANLSAL AEKHGVKDYM EAAAREADVD
VNDLRRAAEI VAKYRAVSPI GWHDPRYSNT PQTWRAVGVL MALLGRIQQP GGLFLLTHLI
MPYADVYTKV MKYTKKDIPY KTIRGLTFGE YVPANLRGIY VIPIAPPLPG PSDRGAPPVK
SLTEVWGEEA EKKGYLYPYD TVQALYESVV HGKPFKIKVV FITGSNPIPR IGNSRLVEEI
FRNLELVIVH DIQFNDTTAF ADVILPDLPY LERLDLALPG PFSPFPAISV RFPWYYEEYK
KKLAAGGKPG ELDKAFRSRN GRTALEVLVM IARRLSELGI KPRDRTEWSQ NMPVGMITEE
GILPIPNLER FINAQLRRVR IVDEAGNVRA PTVEDIYKMG GYMVLVPTGR VEAVKDELWS
NALGRDVVVR VHVYRPVKYS VDLEEWLWRT IYYNSPMARG EVPLPTPSGR VEIYSINLAY
DVRRVFGKPA TSIDPSDLEG KKSGVDPLFS PVPLYAGMAR PDYMWATGPA TEDVEINGLV
PPEPPKRLLL VYRHGPYTHT HSNTQNNLLL DTLTSSELLS AWIHPDTAAA LGVKDGDWIE
VKPAAPKVAK QLESVGVKEA PTARFRVRVT PMVRRDIIAI YHYWLVPRGR LRVKAWKLAD
VRAGYSDDNY LGPMLAGKLG TPGAMGNTVV EVSKVGGL