Gene Htur_3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3951 
Symbol 
ID8744579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp211817 
End bp213154 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content66% 
IMG OID646514532 
Productamidohydrolase 
Protein accessionYP_003405479 
Protein GI284167201 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.825589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGATC GTCTCATCGA GGGCGCGCGT ATCGTCAGCG CGACCGGCGT TCGGGAAGGA 
GCTATCGCGA TCGCCGACGG CCAGATACGG GCGGTCGGCC CGGACGTTGC GAGCGAACAC
GGGGACGCAA CCGACGTGAT CGACGCCGCG GGGATGGTCG CCCTTCCGGG CGTCGTAGAC
GTCCACAATC ACCTGCACGA TCCGGAACTG TTTCCAGAGG GCATCGACTT CGCGTCACAG
ACGGCGAGCG CCGCGGCCGG CGGGGTGACG ACGGTCGTCG AACTCCCGAC GCAGACCCCG
ATCACCACGC CGGAAGCGTT CCGAAAGAAA CGAGAGGCGT GTGCCGACCT CGCTCACGTC
GACTTCGGAC TGGTCGCGGG GAACGTCGAA GCGCCGGACG TTGACGTCGA GAGAATCATG
GCGGAGGGAA CGCGCGACTT CAAGACGTTC ACGGCCGAGC CGTACCGCGC TTCCGACGGC
GCGATCGTGT CCCTGATGGA GGACGTCGGA AACACCGGCG GGAAAGTCCG CGTTCACTGC
GAGACGCAGG GTATTCTCGA CCACGCCCGA GAGTCGATCG ACGGGAACAC GCCCGACGTG
TACATGGATT CCCGCCCCCT CGAGGCGGAG CTCGACGCTA TCAACCGCAT GGGATGGTTC
GCCGAGTACG CCGACTGTCC CCTGCACGTC GTCCACGTTT CCAGCGGGAG CGGGGCCCGC
GAGGGCGGGC GCTTCAAATC CCGGGCGAAC GTTCCGGTGA CGCTCGAGAC CTGTCCGCAT
TACCTCGCGT TTTCGAAGGA CGACGTCGAG GAGAAGGGGC CGTTCCTGAA AGTCAATCCG
AGTCTCAAAT CACCCGCTGA GGTCGACCGC CTCTGGGACG CGGTCCGGGA CGGGACGATC
GATCTCGTCG CCAGCGAGCA CTTCCCTACC TATCGGGACG ACCGGGAGCG CGGCTGGGAG
AACATCTGGG AGCCGTACGC CGGGCTGCCG AGCATCGAAA CGATGCTCGA ATTCCTTGTC
AGCGCCGGCG TTCACGAGGA CCGGCTCTCC TGGACGCGGC TTCGCGAACT CGTCTGCTCG
CGGCCGGCGC GCGAGGCCGG CATCTACCCG TGCAAGGGCT CGCTTCGGGA AGGGACCGAC
GCGGACGTCG TGCTCGTCCG CGAAGAGAAG TTCACGGTTT CGGCCGACGA CCTTCAGTAC
GTCGGCGGCT GGACGCCATA CGAAGGTCGG GAGTGGAGCG CGCGCGTCGA CACGGTTATC
GCGGACGGTG ATATTATCGC CCGCGATCAC GAAATCGACT CCTCGCCCGG TCGAGGAACG
TTTCTCGCGC GGCCGTAG
 
Protein sequence
MTDRLIEGAR IVSATGVREG AIAIADGQIR AVGPDVASEH GDATDVIDAA GMVALPGVVD 
VHNHLHDPEL FPEGIDFASQ TASAAAGGVT TVVELPTQTP ITTPEAFRKK REACADLAHV
DFGLVAGNVE APDVDVERIM AEGTRDFKTF TAEPYRASDG AIVSLMEDVG NTGGKVRVHC
ETQGILDHAR ESIDGNTPDV YMDSRPLEAE LDAINRMGWF AEYADCPLHV VHVSSGSGAR
EGGRFKSRAN VPVTLETCPH YLAFSKDDVE EKGPFLKVNP SLKSPAEVDR LWDAVRDGTI
DLVASEHFPT YRDDRERGWE NIWEPYAGLP SIETMLEFLV SAGVHEDRLS WTRLRELVCS
RPAREAGIYP CKGSLREGTD ADVVLVREEK FTVSADDLQY VGGWTPYEGR EWSARVDTVI
ADGDIIARDH EIDSSPGRGT FLARP