Gene Hmuk_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1771 
SymboltrpD 
ID8411296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1692879 
End bp1693877 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content69% 
IMG OID645020100 
Productanthranilate phosphoribosyltransferase 
Protein accessionYP_003177592 
Protein GI257387819 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID[TIGR01245] anthranilate phosphoribosyltransferase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.983535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.303854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGAAT ACATCGAACG CGTGACCGAG GGAGAGGATC TGACACAGGA CGAGGCTCGC 
GAGGCAGCGA CGGCGGTCTT CGAGGACGCC ACGGAGGCCC AGATCGGGGC ACTGCTGGCG
GGTCTGCGAG CGAAAGGGGA GACCGAAGCG GAGATCGCCG GCTTCGCCGA GGGGATGCGC
GACGCCGCGC GGACGATCGA CCCCGACCGG CGGCCGCTGG TCGACACCTG TGGCACCGGC
GGCGACGACT ACGACACGAT CAACGTCTCG ACGACGAGCA CGATGGTCGC CGCGGGTGCC
GGCGTCCCCA TCGCCAAGCA CGGTAACTAC TCAGTCTCCT CCTCGTCGGG GAGCGCGGAC
GTGCTGGAGG TGGCCGGCGT CGACGTGGAG GCCGAACCGC CACAGGTCGA GCAGGCCATC
GAGGACGACG GGATCGGGTT CATGCTCGCG CCCGTCTTCC ACCCGGCGAT GAAGGCCGTC
ATCGGCCCGC GCAAGGAACT CGGCATGCGG ACCATCTTCA ACATCCTCGG ACCGCTGACC
AACCCCGCCG GTGCGGACGC GCAGGTGCTC GGCGTCTACG ATCCGGCCCT CGTGTCGACG
ATCGCGGAGG CACTGGCTCG GATGGACGTC GAGCGAGCGA TGGTCGTCCA CGGATCGGGC
CTCGACGAGA TCGCGATCCA CGGAGAGACC GTCGTCGCAG AGGTCACCGG TTCCGAGATC
GAGGAGTACA CGCTCGTCCC GGAGGACATC GGACTGACGA CGGCCGACAT CGAAGACGTG
GCCGGCGGCA CGCCCGAAGA AAACGCCGAG GACCTGCGAG GGATCGTCGA GGGGACCGTC
ACCGGACCGA AACAGGACAT CATCCTCGCG AACGCGGGCG CGGCGATCTA CGTCGCCGGT
GAAGCCGACA GCCACGAGGC CGGCGTCGAG GCGGCTCGCG AGGCGATCGA GTCCGGCGAC
GCCGCCCGGA AGTTCGACGA GCTCAGAGGC GAGGCATGA
 
Protein sequence
MQEYIERVTE GEDLTQDEAR EAATAVFEDA TEAQIGALLA GLRAKGETEA EIAGFAEGMR 
DAARTIDPDR RPLVDTCGTG GDDYDTINVS TTSTMVAAGA GVPIAKHGNY SVSSSSGSAD
VLEVAGVDVE AEPPQVEQAI EDDGIGFMLA PVFHPAMKAV IGPRKELGMR TIFNILGPLT
NPAGADAQVL GVYDPALVST IAEALARMDV ERAMVVHGSG LDEIAIHGET VVAEVTGSEI
EEYTLVPEDI GLTTADIEDV AGGTPEENAE DLRGIVEGTV TGPKQDIILA NAGAAIYVAG
EADSHEAGVE AAREAIESGD AARKFDELRG EA