Gene Hmuk_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3038 
Symbol 
ID8412591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2925857 
End bp2927152 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID645021385 
Producttryptophan synthase subunit beta 
Protein accessionYP_003178850 
Protein GI257389077 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0133] Tryptophan synthase beta chain 
TIGRFAM ID[TIGR00263] tryptophan synthase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.25263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0776736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGCG AATCCAAATT CGGCGACTAC GGCGGACAGT TCGTACCCGA GGCGTTGATG 
CCGGCCATCG AGGAGCTGAC CGACGCCTAC GAGCGGTACG TGATCGACAA CGAGGACGGC
TTCATGGACG AGTTCCGTCG GCGGCTGGCG GACTTCGGCG GCCGACCCCT GCCCTTACAG
CGGGCCGACC AGCTCTCGGC GCGCTACGAC ACCGAGGTGT ACCTCAAGCG AGAGGACCTG
CTCCACGGCG GGGCTCACAA GCTCAACAAC GCGCTCGGGC AGGTCCTCCT GGCGAAGTAC
ATGGGCAAGG AGCGTATCAT CGCGGAGACC GGTGCCGGGC AACACGGCAC GGCGACGGCG
ATGGCCGCGG CCCACCTCGA CATGCCCTGC GAGATCTACA TGGGCGAGAC CGACATCGCC
CGCCAGCGGC CCAACGTCTT CCGGATGCGG ATCAACGACG CCGAGGTGAA TCCGGTGACG
ACCGGCCGCG GGACGCTCAA AGAGGCGATC AGCGAGACGA TGCGGGACTG GGCGACCACC
GTCGAGACGA CCCACTACGT CATCGGGTCG ATCGTCGGCC CGGCTCCGTT CCCGGCGATG
GTCCGGGACT TCCAGTCGGT CATCAGCGAG GAGGCCCGCG AGCAGATCCA GGAGCAGACC
GGCGGCCTCC CGGACGCCGT CCTGGCCTGT GCGGGTGGCG GCTCGAACAC GATGGGCGCG
TTCCATCACT TCGTTCCGGA CGACGACGTG GACCTCTACG CCGTCGAGGC CGGCGGCTCC
TCGCTGTCGG TCGACGAGGA GGAAGGCGTC GCGCCCAACT CGGCGTCGCT GTCGACCGGC
GACGAGGGGG TACTCCACGG AGCGCGCACG AAGCTCCTTC AGGACTCTGA CGGCCAGATC
ATGGAGTCAC ACTCCGTCTC CGCGGGGCTC GACTACGCGG GCGTCGGTCC GGAACTGGCC
CACCTCGTCG ACGAGGGGCG CGTGACGCCG GTCAACGTCG ACGACGACAC CGCACTTGAA
GCGTTCCACC GGCTCTCCCA GGACGAGGGG ATCATCCCGG CGCTGGAGAC GGCCCACGCG
TTCGGCTATC TGGAGCGCGT GGCGGGACCC GCCACGGACG ATCGAGCGGC GGAGCCGCGA
GATGAGCACC ACGACGAACT GGGCGACACC GTGATCGTCA ACGTCTCTGG CCGGGGGGAC
AAGGACCTCG AAACCGTCAT CGAAGAGACC CAGAAGCGCG ATCTGGACAT CGCGCCGGAC
ATGTCGATCT TCAACGAGAT CGGGGGCGGG ATCTAG
 
Protein sequence
MSSESKFGDY GGQFVPEALM PAIEELTDAY ERYVIDNEDG FMDEFRRRLA DFGGRPLPLQ 
RADQLSARYD TEVYLKREDL LHGGAHKLNN ALGQVLLAKY MGKERIIAET GAGQHGTATA
MAAAHLDMPC EIYMGETDIA RQRPNVFRMR INDAEVNPVT TGRGTLKEAI SETMRDWATT
VETTHYVIGS IVGPAPFPAM VRDFQSVISE EAREQIQEQT GGLPDAVLAC AGGGSNTMGA
FHHFVPDDDV DLYAVEAGGS SLSVDEEEGV APNSASLSTG DEGVLHGART KLLQDSDGQI
MESHSVSAGL DYAGVGPELA HLVDEGRVTP VNVDDDTALE AFHRLSQDEG IIPALETAHA
FGYLERVAGP ATDDRAAEPR DEHHDELGDT VIVNVSGRGD KDLETVIEET QKRDLDIAPD
MSIFNEIGGG I