Gene Tpen_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1053 
Symbol 
ID4600796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp991094 
End bp992287 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content67% 
IMG OID639773831 
ProductMoeA domain-containing protein 
Protein accessionYP_920456 
Protein GI119719961 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.304827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCACAGCG TCGGCAGGGT TCTCGGGGAG GTTTCCGGCC TCCTCTCCCG GCCGCCCGCA 
GAGAGCGTGC CGGTCCTCGA CTCTCTCGGT AGGTACTCCG CCGAGACCGT AGTGTCGTCC
TTCAAGCTCC CCCCTGCCCC GAAGAGCGTG GTAGACGGCT ACGCCGTCAG AGCTGAAGAC
GTGGAGCCCG CGTCTCCGGG CGCCCCGGTT ACCCTGAGGT TGCTGGAAGG AGTCCTGAGG
CCCGGCTCAA CCGAGGGGTT CGAGTTGCCG AGGGGCTCCG CGGTAAGGGT TGAGACGGGC
GCTCTTCTAC CGGTGGGCGC GGACGCCGTC GTGCCCGTCG AGGACGCCTT GGAGGAGGAC
GGCAGGGTCC ACCTGTTCAG GAGGGTCGCG AGGTACGAGA ACGTCTCCCT GCCGGGCGAG
GAGTACGAGG AGGGAGTCCC CATAGTTAGG GTGGGCGACC GCATCCAGCC GCACCACCTC
TCGGCCCTCG TGCTCGAGGG GAGGAGCCAC GTGAACGTGT TCAGAGTCGA GGCGAGCATC
CTCAACGTGG GCGACGAGAT AGTGGGGGGC ACGTACTTCA GGCCGTTCAC GCACTTCCTC
GTAGCCTCCT GGCTGAGGAG CCTGGGCTTC AGGGTGACCG ACGTCTCCGT GGCCCCCGAC
TCCCCCGAGG CAGTGGCGGA GTGGGCGGGG AGCAGGGGTG AGTGGCTCGT CGTGATCCTA
GGCGGGACCT CGATGGGTGG GCACGACTTC ACCGTTAAGG CGCTCGAATC CCTAGGGCCC
GAGTACATCG TGCACGGGCT CGCGCTTCAA CCGGGCAAAA CGGCTTGCGT AGCCGTGAAG
GGCGGCCGCC TCTACCTCGC AGCTAGCGGG CTCCCCGTGG CAGCCCTCTC CACGCTCGAG
GTCTTCCTGA GGCCCCTCCT CAGACGCGTA GGCCTGAAGG TCCCGCTACT CCCGAGGGTG
AAGGCGAGGC TAACGAGGAG GATCACCGTC AAGGCCGGCG TGGTCGGGTT CGCCAGGGTC
AGGGTGTACA GGGAGGGAGG CACCCTTCTA GCTGAGCCTG TCATGCTGGG CGGCTCCGGG
GCGCTTGCGA GCCTTTTGAG GGGCAACGGC TACGTGATCG TGCCGGAGGG CCTAGAGGGC
TACGACGAGG GAGAAGAGGT CGAGGTACAC CTCTACGGGG AGGTCGAGGA GTGA
 
Protein sequence
MHSVGRVLGE VSGLLSRPPA ESVPVLDSLG RYSAETVVSS FKLPPAPKSV VDGYAVRAED 
VEPASPGAPV TLRLLEGVLR PGSTEGFELP RGSAVRVETG ALLPVGADAV VPVEDALEED
GRVHLFRRVA RYENVSLPGE EYEEGVPIVR VGDRIQPHHL SALVLEGRSH VNVFRVEASI
LNVGDEIVGG TYFRPFTHFL VASWLRSLGF RVTDVSVAPD SPEAVAEWAG SRGEWLVVIL
GGTSMGGHDF TVKALESLGP EYIVHGLALQ PGKTACVAVK GGRLYLAASG LPVAALSTLE
VFLRPLLRRV GLKVPLLPRV KARLTRRITV KAGVVGFARV RVYREGGTLL AEPVMLGGSG
ALASLLRGNG YVIVPEGLEG YDEGEEVEVH LYGEVEE