Gene Hmuk_2640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2640 
Symbol 
ID8412190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2530689 
End bp2531936 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID645020985 
Productchorismate synthase 
Protein accessionYP_003178453 
Protein GI257388680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGGA ACGAGTTCGG TCGGCTCTTT CGGCTGACCA CCTTCGGCGA GAGCCACGGG 
GATGCGATGG GTTGTACGGT TTCAGGTGTG CCGGCGGGCG TCGAACTGTC CGAGGAAGCG
ATTCAGGAAG ATCTCGACCG GCGCAAGCCC GGTCAGTCGA TGATCACGAC CTCGCGGGGC
GAGCCCGACA AGGTGTCGAT CAAGTCCGGA CTGCAGGACG GCTACACGAC GGGAACGCCG
ATCGGCATGG TCATCCAGAA CAAAGACGCC CGATCGGGGA AGTACGAGCC CTTCATCACG
GCACCGCGGC CCTCTCACGG CGACTACACC TACTCGGCGA AGTTCGGCAC GCGCAACTGG
GGCGGTGGCG GCCGCTCGTC GGCCCGCGAG ACGGTCAACT GGGTCGCTGC CGGCGGCGTC
GCCAAGCAGG TCCTCGCACA GTCTGACTAC GACGTGCAGA TCAAGGCTCA CGTCTGCCAG
ATCGGCGACG TGGTTGCCGA CGACGTGACC TGGGAGGAGA TGCTCGAACA CAGCGAGGAC
AACGAAGTCC GCTGTGGCGA TCCCGACGCC GCCGAGGAGA TGCGCGACCT CGCGGACGAG
TACCAGAAGG AGGGCGACTC GATCGGCGGC GCGATCTACT TCGAGTGTCG CGGCGTTCCG
CGGGGCCTCG GTGCGCCGCG GTTCGATTCG ATACCCGCAC GCCTCGGGCA GGCGATGTAC
TCCATCCCCG CAGTCACGGA CTTCGAGCTG GGGATCGGGC GCGATGCTCG GACGGCCACC
GGGACCGACT ACACCGAAGA CTGGGAGTTC GGCGAGAGCG AGGCGACAGC CTCGGAAAAC
GCGAGCGGCG ACGAGCCGCG AGCGAGAGGC GACCCGAAGC CAGTCGGCAA CGACCACGGC
GGCATCCAGG GCGGGATCAC GACCGGCGAC CCGATCTACG GCGAGGTCAC CTGGCACGCG
CCGGTCTCGT TCCCGAAGAC CCAGGAGACC GTCGACTGGG AGACCGGCGA GAGAAAGGAG
ATAACGGTGA CGGGGCGACA CGACCCCGTC CTCCCGCCGC GGGCGGTCCC CGTCGTCGAA
GCGATGCTGT ACTGTACGGT GCTGGACTTC ATGCTGCTCG GTGGCCGGAT CAACCCGGAC
CGGCTCGACG ACCGGCCCGG CGAGTACGAC ACCGACTACC ACCCGTCGAG CCCGCGGAAC
GATCCCGAAG ACGCCGACAC GCACGCGACG ACCGTCGACG AGGACTGA
 
Protein sequence
MNGNEFGRLF RLTTFGESHG DAMGCTVSGV PAGVELSEEA IQEDLDRRKP GQSMITTSRG 
EPDKVSIKSG LQDGYTTGTP IGMVIQNKDA RSGKYEPFIT APRPSHGDYT YSAKFGTRNW
GGGGRSSARE TVNWVAAGGV AKQVLAQSDY DVQIKAHVCQ IGDVVADDVT WEEMLEHSED
NEVRCGDPDA AEEMRDLADE YQKEGDSIGG AIYFECRGVP RGLGAPRFDS IPARLGQAMY
SIPAVTDFEL GIGRDARTAT GTDYTEDWEF GESEATASEN ASGDEPRARG DPKPVGNDHG
GIQGGITTGD PIYGEVTWHA PVSFPKTQET VDWETGERKE ITVTGRHDPV LPPRAVPVVE
AMLYCTVLDF MLLGGRINPD RLDDRPGEYD TDYHPSSPRN DPEDADTHAT TVDED