Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1159 |
Symbol | |
ID | 4602161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1098599 |
End bp | 1099894 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639773935 |
Product | dihydroorotase, multifunctional complex type |
Protein accession | YP_920560 |
Protein GI | 119720065 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR00857] dihydroorotase, multifunctional complex type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.120119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAGAG CGTTCGACCT CGTGGTCACC GGGAAGGCCT ACATCGGCGG CAGGGTAGTC GAGGCGTCCA TAGGCGTTGA GGACGGCAGG ATAGCGGCTG TGTCCTCGCC GGCTCTGGCG GGGAGCGCCG AGGAGAGAAT CGAGCTGGGC AAAGGCTACC TCGTGCTCCC GGGCATGGTG GACATACACG TGCACATGCG GGAGCCGGGG CAGGAGTACA AGGAGGACTG GCGCACCGGG TCGCGGGCCG CCGTCAAGGG GGGAGTGACC TTCGTGGCAG ACATGCCTAA CAACAAGCCT CCAGCTAACA CCTGCGAGAG GCTCGCCGAG AAGCTCAGAA GGGCGGGGGA GAAGTCCCTC GTAGACTTCG CGTTCTATGC GGGCTTCTCG GAGAACCCCG AGGAGTTGCT ACGGTGCCCG GAGCTCTTCG TGGGCGGGAA GCTCTACCCA GAGGAGCTCT TTTCGCGGTC AGCCGCTTAC TTCGCGCGCC TCATGGCGAA GCTCGGGAAG CCGCTCGTAG TGCACGCGGA GGATCCGAGC ATGTTCCGCG AGTCCAGGGG CTTGCCCCAC AGCTACGCGA GGCCCCCCGA GGCCGAGCTG AGCGGAGTGA GGAGGGCCCT GAGGCTCTGC GGGGAGGCCG GGGCATGGGT ACATGTAACG CACGTGAGCA CTGGCGCCGC GGTGACGGAG CTCCTCTCGG CGAAGCTCTC GGGTCTCAAG GCCACGTTCG ACGTTACGCC CCACCACGCA CTCCTCACCG ACTCCCTCTA CTCGACTACT CTCAGCAGGA TCGCCAAGGT GAACCCGCCG CTGAGAGGCG AGGCGGACAG GTCGGCGGTC TACTCCGCGC TGGCGCGGGG GCTCCCGGAC GCGCTCGTAA CGGACCACGC GCCCCACTCG CCGGAGGAGA AGGCCTCGGA AGACCCACCC CCGGGCTTCC CGGGGCTCGA GCTCGCTCTC CACCTGCTAC TCAGCGAGGT GCTCGCCGGC AGGCTACCGC TCGGCGTTAT CGACCTCTAC AGCTCCAGGC CGGCGTCCCT CCTCGGCGTC GAAAAGGGGG CTATAGCCGT GGGGATGGAC GCTGACCTCG TGGTCGTTAA GAGGGAGGAG TGGGTTGTGA GGGGCGACGA GATGGTGTCG AAGGCCAGGT ACACCCCCTT CGAGGGCTGG CGCCTCTCGA CGAAGACGCA CGCAGTCTTC GTGCGCGGCA GGATGGTCTA CGCCGAGGGA GAGTTCTTCG AAGACGCGAG GGGCTCTCTG CGCGCCCCCC GGGGGGCTCC GGAGCGGTGG TGGTGA
|
Protein sequence | MGRAFDLVVT GKAYIGGRVV EASIGVEDGR IAAVSSPALA GSAEERIELG KGYLVLPGMV DIHVHMREPG QEYKEDWRTG SRAAVKGGVT FVADMPNNKP PANTCERLAE KLRRAGEKSL VDFAFYAGFS ENPEELLRCP ELFVGGKLYP EELFSRSAAY FARLMAKLGK PLVVHAEDPS MFRESRGLPH SYARPPEAEL SGVRRALRLC GEAGAWVHVT HVSTGAAVTE LLSAKLSGLK ATFDVTPHHA LLTDSLYSTT LSRIAKVNPP LRGEADRSAV YSALARGLPD ALVTDHAPHS PEEKASEDPP PGFPGLELAL HLLLSEVLAG RLPLGVIDLY SSRPASLLGV EKGAIAVGMD ADLVVVKREE WVVRGDEMVS KARYTPFEGW RLSTKTHAVF VRGRMVYAEG EFFEDARGSL RAPRGAPERW W
|
| |