Gene Tneu_0843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0843 
Symbol 
ID6164375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp755598 
End bp757511 
Gene Length1914 bp 
Protein Length637 aa 
Translation table11 
GC content65% 
IMG OID641667999 
Productputative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein 
Protein accessionYP_001794226 
Protein GI171185307 
COG category[H] Coenzyme transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme
[COG1910] Periplasmic molybdate-binding protein/domain 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.505015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTAA TCTTCCACAA CCTCGTCACG CTGGAGGAGG CTTCGTCTGC ACTGCTCAAG 
TTCGCCAAGC CTCTTGGCTC TGAGGAGGTG GACATAGCTG AGGCCTACGG CCGCGTCCTC
TCAAGAGATG TGGTGGCGCA GGTGGACGTG CCCCCCTTCG ATAGATCCAC CGTGGATGGC
TATGCGGTGG TGGCCGAGTC CACCTACGGC GCCTCGGAGC TCACGCCGGT CGAGCTCAAG
GTGGTGGGCC GGGTGGAGGC GGGCGGTTGG CCCAACGTCG AGGTGAGGCA GGGCGAGGCC
GTCGAGGTGG CCACCGGCGC CCCGCTCCCC CGGGGCGCAA ACGCGGTGGT TATGGTGGAG
TATACGCAGG AGAGAGACGG CGTTGTGAAG ATCTTCCGAT CCGTCGCGCC GGGGGAGAAC
GTCATGTCCG CCGGGTCGGA TATATCTGCG GGGGAGGTGG CCCTGAGGAG GTGCACCAGA
CTAACCGCCA GGGAGATCGG CGTATTGGCC GCCCTCGGGG TAAAGACGGT CCAGGTTTTG
AGGAGACCCC GGGTGGCCAT CATCTCGACT GGCAACGAGC TGGCGCCGCC GGGCGCTCCG
CTGGGGGTTG GGAAGCTCTA CGACGTCAAC AGCTATGCGC TCGCGGCGTC TGTGGCGGAG
GCCGGGGGCG TGCCGGAGCT CGTGGGAATT GTGAGAGACG ACGCTGGGGA GTACAGAAGG
GCCTTGGAGG CCGCCCTCTC TTCGTCAGAC GTCGTGTTGA TAAGCGGCGG GACATCGGCT
GGGGTAGCCG ATTTGACGTA CAGGGTCCTA GGGGAGATGG GGGAGGTGCT GTTCCACGGC
ATAATGGTTA AGCCGGGGAA GCCCACCCTG GCCGCCGCGG TGGGCGGAAA GATCGTAGTC
GGCCTCCCGG GCTACCCCTC CTCCGCCTTG ATGATATTCC ACACGGTGGT GAGGCCCTTC
CTACTGAAGC TACAGTGCCT AGAGCCCGAC CCCCCTGCCG CCGTGAAAGC CTCCTTGGCT
GTTGGAGTCG AGGGGGCGAA GGGGCGGCGT GCCCTCTACC CCGTCGTGCT AGTGGGGAGG
GGAGGGAGCT ACAGGGCGTA TCCCCTATAC GCAGAGTCCG GCGCTATATC TGTGTTGGCG
AGGGCGGACG GATACGTCGT GGTGCCGGAG AACGTGGAGT TCTTGGCCGA GGGGGAGGAG
GTGGAGGTGA GGCTTTTCGA GAAATATAGG CCGGCGGAGT TCTACTTCAT CGGCAGCCAC
GACCCCCATC TCGACGCGGC GCTGGCTAGG CGCAACATAA AGGCCGTCTA CGTCGGGTCG
ATGGGGGGTC TCATGGCGGT TAAACGCGGC GAGGCCGATA TGGCAGGGGT GCACGTGTTC
GACCCGGGGA GCGGGCTCTA CAACACGCCG TTTGTGGAAA AGCTCGATAT AAGAGACGTC
GCTGTGGTGG GGCTTTACGA GCGGGAGCAG GGGCTGATGG TGCAGAGGGG GAACCCCAAG
GGCGTGAGGG GGGTTGAGGA CCTCCTCAGG CCCGACGTCG TGTTCGTAAA CAGGCCTAGG
GGCACGGGGA CCAGAGCGCT TTTGGACATG CTCCTGGGGG AGGCGGCGAG GAGGCTCGGG
CTTACCCTTG AGGAGGCGGC GGCGAAGATA AGGGGCTACA CCCACGAGGT TAAAACCCAC
ACGGCCGTGG CGGCCGCGGT AGCCCAGGGG AGAGCCGACG TTGGAGTCGG CGTGAGGTAC
GCGGCTGAGC TGTACGGCCT TGAGTTCATA CCGCTGGGCT GGGAGCAGTA CGACTTGGTG
GTGAAGAGGG GGGCTTTGGA GAAGGCGCTT GAGATCGCGG CTGAGCTGCT CAGCGATCTG
CCGCGGGGGT ACAGGAGGTA CGAGTGGTCG GGGAGGGTTA AGCTTCTTCG ATAG
 
Protein sequence
MRVIFHNLVT LEEASSALLK FAKPLGSEEV DIAEAYGRVL SRDVVAQVDV PPFDRSTVDG 
YAVVAESTYG ASELTPVELK VVGRVEAGGW PNVEVRQGEA VEVATGAPLP RGANAVVMVE
YTQERDGVVK IFRSVAPGEN VMSAGSDISA GEVALRRCTR LTAREIGVLA ALGVKTVQVL
RRPRVAIIST GNELAPPGAP LGVGKLYDVN SYALAASVAE AGGVPELVGI VRDDAGEYRR
ALEAALSSSD VVLISGGTSA GVADLTYRVL GEMGEVLFHG IMVKPGKPTL AAAVGGKIVV
GLPGYPSSAL MIFHTVVRPF LLKLQCLEPD PPAAVKASLA VGVEGAKGRR ALYPVVLVGR
GGSYRAYPLY AESGAISVLA RADGYVVVPE NVEFLAEGEE VEVRLFEKYR PAEFYFIGSH
DPHLDAALAR RNIKAVYVGS MGGLMAVKRG EADMAGVHVF DPGSGLYNTP FVEKLDIRDV
AVVGLYEREQ GLMVQRGNPK GVRGVEDLLR PDVVFVNRPR GTGTRALLDM LLGEAARRLG
LTLEEAAAKI RGYTHEVKTH TAVAAAVAQG RADVGVGVRY AAELYGLEFI PLGWEQYDLV
VKRGALEKAL EIAAELLSDL PRGYRRYEWS GRVKLLR