Gene Tpen_1159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1159 
Symbol 
ID4602161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1098599 
End bp1099894 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID639773935 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_920560 
Protein GI119720065 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.120119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGAG CGTTCGACCT CGTGGTCACC GGGAAGGCCT ACATCGGCGG CAGGGTAGTC 
GAGGCGTCCA TAGGCGTTGA GGACGGCAGG ATAGCGGCTG TGTCCTCGCC GGCTCTGGCG
GGGAGCGCCG AGGAGAGAAT CGAGCTGGGC AAAGGCTACC TCGTGCTCCC GGGCATGGTG
GACATACACG TGCACATGCG GGAGCCGGGG CAGGAGTACA AGGAGGACTG GCGCACCGGG
TCGCGGGCCG CCGTCAAGGG GGGAGTGACC TTCGTGGCAG ACATGCCTAA CAACAAGCCT
CCAGCTAACA CCTGCGAGAG GCTCGCCGAG AAGCTCAGAA GGGCGGGGGA GAAGTCCCTC
GTAGACTTCG CGTTCTATGC GGGCTTCTCG GAGAACCCCG AGGAGTTGCT ACGGTGCCCG
GAGCTCTTCG TGGGCGGGAA GCTCTACCCA GAGGAGCTCT TTTCGCGGTC AGCCGCTTAC
TTCGCGCGCC TCATGGCGAA GCTCGGGAAG CCGCTCGTAG TGCACGCGGA GGATCCGAGC
ATGTTCCGCG AGTCCAGGGG CTTGCCCCAC AGCTACGCGA GGCCCCCCGA GGCCGAGCTG
AGCGGAGTGA GGAGGGCCCT GAGGCTCTGC GGGGAGGCCG GGGCATGGGT ACATGTAACG
CACGTGAGCA CTGGCGCCGC GGTGACGGAG CTCCTCTCGG CGAAGCTCTC GGGTCTCAAG
GCCACGTTCG ACGTTACGCC CCACCACGCA CTCCTCACCG ACTCCCTCTA CTCGACTACT
CTCAGCAGGA TCGCCAAGGT GAACCCGCCG CTGAGAGGCG AGGCGGACAG GTCGGCGGTC
TACTCCGCGC TGGCGCGGGG GCTCCCGGAC GCGCTCGTAA CGGACCACGC GCCCCACTCG
CCGGAGGAGA AGGCCTCGGA AGACCCACCC CCGGGCTTCC CGGGGCTCGA GCTCGCTCTC
CACCTGCTAC TCAGCGAGGT GCTCGCCGGC AGGCTACCGC TCGGCGTTAT CGACCTCTAC
AGCTCCAGGC CGGCGTCCCT CCTCGGCGTC GAAAAGGGGG CTATAGCCGT GGGGATGGAC
GCTGACCTCG TGGTCGTTAA GAGGGAGGAG TGGGTTGTGA GGGGCGACGA GATGGTGTCG
AAGGCCAGGT ACACCCCCTT CGAGGGCTGG CGCCTCTCGA CGAAGACGCA CGCAGTCTTC
GTGCGCGGCA GGATGGTCTA CGCCGAGGGA GAGTTCTTCG AAGACGCGAG GGGCTCTCTG
CGCGCCCCCC GGGGGGCTCC GGAGCGGTGG TGGTGA
 
Protein sequence
MGRAFDLVVT GKAYIGGRVV EASIGVEDGR IAAVSSPALA GSAEERIELG KGYLVLPGMV 
DIHVHMREPG QEYKEDWRTG SRAAVKGGVT FVADMPNNKP PANTCERLAE KLRRAGEKSL
VDFAFYAGFS ENPEELLRCP ELFVGGKLYP EELFSRSAAY FARLMAKLGK PLVVHAEDPS
MFRESRGLPH SYARPPEAEL SGVRRALRLC GEAGAWVHVT HVSTGAAVTE LLSAKLSGLK
ATFDVTPHHA LLTDSLYSTT LSRIAKVNPP LRGEADRSAV YSALARGLPD ALVTDHAPHS
PEEKASEDPP PGFPGLELAL HLLLSEVLAG RLPLGVIDLY SSRPASLLGV EKGAIAVGMD
ADLVVVKREE WVVRGDEMVS KARYTPFEGW RLSTKTHAVF VRGRMVYAEG EFFEDARGSL
RAPRGAPERW W