Gene Hmuk_1769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_1769 
Symbol 
ID8411294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1690610 
End bp1692229 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content70% 
IMG OID645020098 
Productanthranilate synthase component I 
Protein accessionYP_003177590 
Protein GI257387817 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR01820] anthranilate synthase component I, archaeal clade 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.498968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTCG ATATCGACCG CGAGACGTTC GTCGACCGAG CGACCGCCGA CGGACCGGTC 
TCCGTGCGAG CGGTCGCGGA GCTGTCGGCC GACGTGGAGC CGCTGGCGGC CTACGCCGCG
CTGACCGGCC GGACCGCGGA CGGTGCCCCC GGCAAAGGCG GCCACACCTT CCTGCTCGAA
AGCGCGGAGA AGGTCGCCTC CAGCGACCCA GACGGCGCGT TCGCACCGGC GACCGACGAC
CGCCACGCCC GCTACTCGTT CGTCGGCTAC GACGCCGACG CGGTGATCAC GGTCGACGAC
GAGACCAGCG TCGAGGTCTT CGACGACCGG ATCGCCGACC TCGTGACGAC CGACGGGGGC
GACGTGTTAG ACGATCTCCG GGCGGCCATG CCGGACGTGG AGCTGCGGGG CTTCCCGGAC
CACGATCGCC AACACCTCGA CGGCGGCCTC GTCGGCTTCC TGGCCTACGA CGCCGTCTAC
GACCTCTGGC TCGACGAGGT CGGACTCGAC CGGCCGGACT CTCGCTTCCC CGACGCACAG
TTCGCGCTGA CGACGACGAC GCTGCGCTTC GACCACGTCG AGGAGACGGC CGCGCTCGTG
TTCACTCCGA TCGTCCGACC CGGTGAGGAT CCCCGCGCAC GCTACGACGC GCTCCTCGAC
GAGGTCGAGC GGGTCGAGGC CGTCCTCGAC GGAGCCACGG ACCTCGAGAC CGGCAGCTTC
GTCCCCGCCC ACGAAGAGGC CGGGCCGCGC GACGAGTACG AAGACGCCGT CGAGCACGCC
AAGGAGTCCG TTCTCAGCGG CGACATCTAC CAGGGCGTCA TCTCGCGCAA GCGGGAACTG
TACGGTGAGA TGGACCCGCT GGCGCTGTAC GAGTCGCTGC GGGCGGTCAA CCCGTCGCCG
TACATGTACC TGCTCGACTA CGACGGGCTG AGCATCGTCG GCGCGAGCCC CGAGACGCTC
GTCTCCGTGG CCGACGACCG GATCGTCTCG AATCCGATCG CGGGGACCTG TCCGCGCGGG
ACGAGTCCGG TCGAGGACCG CCGCCTCGCG GGCGAGATGC TCGCCGACGA CAAGGAACGG
GCCGAGCACA CGATGCTGGT CGACCTCGCG CGCAACGACG TGCGTCGGGT CGCAGAGCCC
GGGAGCGTCC GCGTCGAGGA GTTCATGAAC GTCCTCAAGT ACAGCCACGT CCAGCACATC
GAATCGACCG TGACCGGCAC GCTCGACGAG GGAGCGGACG CCTTCGACGC CGCGCGCGCC
ACGTTTCCGG CGGGGACGCT CTCGGGTGCG CCCAAGATCC GGGCCATGGA GATCATCGAC
GAACTGGAGC GGTCGCCACG CGGGGTCTAC GGCGGCGGCG TCGGTTACTT CTCCTGGACC
GGCGACACGG ACTTCGCGAT CGTGATCCGG TCGGCGACGG TCGAGGACTG TGATACCCCA
GCAGGCGTCG ATGGCGACCG GACCCAGCGC ACGACCGTCC AGGCCGGTGC GGGAATCGTC
GCCGACTCCG TTCCCGGGTC GGAGTACGAG GAGACCGAGA ACAAGATGGG CGGCGTGCTA
GACGCCGTAG AGCGCATCCG AGCACCCGCG GCCGACGCGA ACGAGGAGGT GTCCCGATGA
 
Protein sequence
MTLDIDRETF VDRATADGPV SVRAVAELSA DVEPLAAYAA LTGRTADGAP GKGGHTFLLE 
SAEKVASSDP DGAFAPATDD RHARYSFVGY DADAVITVDD ETSVEVFDDR IADLVTTDGG
DVLDDLRAAM PDVELRGFPD HDRQHLDGGL VGFLAYDAVY DLWLDEVGLD RPDSRFPDAQ
FALTTTTLRF DHVEETAALV FTPIVRPGED PRARYDALLD EVERVEAVLD GATDLETGSF
VPAHEEAGPR DEYEDAVEHA KESVLSGDIY QGVISRKREL YGEMDPLALY ESLRAVNPSP
YMYLLDYDGL SIVGASPETL VSVADDRIVS NPIAGTCPRG TSPVEDRRLA GEMLADDKER
AEHTMLVDLA RNDVRRVAEP GSVRVEEFMN VLKYSHVQHI ESTVTGTLDE GADAFDAARA
TFPAGTLSGA PKIRAMEIID ELERSPRGVY GGGVGYFSWT GDTDFAIVIR SATVEDCDTP
AGVDGDRTQR TTVQAGAGIV ADSVPGSEYE ETENKMGGVL DAVERIRAPA ADANEEVSR