Gene Hmuk_2075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2075 
Symbol 
ID8411611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1985609 
End bp1986628 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content69% 
IMG OID645020414 
ProductSuccinylglutamate desuccinylase/aspartoacylase 
Protein accessionYP_003177895 
Protein GI257388122 
COG category[R] General function prediction only 
COG ID[COG3608] Predicted deacylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.763346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG CGCGGCCGTT CACCTACGAC GGCGGACGGG TGGACCCCGG CGAGACCGAG 
AACGTCAGAT ACACGGTCAG CCAGACGTAT CTCGGCGATC CCGTCCGGAT GCCGGTCACG
ATCGTCAACG GCGAGGGCGC GGGGCCGAGG CTGTTTCTCG CGGCGGCGGC CCACGGCGAC
GAACTCAACG GCGTCGAGGT CGTCCGCGAG GTCGCCCACG AGTGGGATCT CGCCGACCTC
TGCGGGACGC TGGTCTGCTG TCCCGTCCTG AACGTTCCGG GGTTCATCGC CCAGCAGCGG
TACCTCCCGG TCTACGATCG GGACCTGAAC CGCTCGTTTC CGGGCCGAGA CGACTCGACG
GGCGCAAAGC GGATCGCCAA CGAGATCTTC CGGAACTTCG TTGCACCCTG TGACTACGGG
CTCGACTTCC ACACCTCCAC GCGCGGACGG ACCAACATGC TCCACGTGCG GGCCGACATG
GCGGACGAGC GGGTCGCGCG ACTGGCGAAG TCGTTCGCGA CGAACGTCGT CATCGCCTCG
GACGGTCCCG GGGGAACGCT GCGACGCGAG GCGACCGACG CGGGCGTCCC GACCGTCACC
GTCGAGATGG GGGAAGCCCA CCGCTTCCAG CGGGACCTGA TCGACCGAGC GCTCGTCGGC
GTCCGCTCGG TCCTGGCGGA GTTCGGCCTC CTGCCCAGCG CGTCCGTCCA CTGGCCGGGC
TGGCGGACGA TCATCGACGG GAGCGACGAG AAGACGTGGA TCCGGGCCGA CGCTGGGGGC
CTCGTCGACA TGCACTTCGA CCGCGGTGCC TTCGTCGAGG CGGGCGACCC GATCTGTACG
ATCGCCAACC CCTTCAAGAC CGATCGGACG GTCGTCGAAG CACCGTTCAC CGGACTGCTG
GTCGGGGTGC TGGAGAACCC GCTCGTCTAC CCCGGCAACC CGCTCTGTCA CCTCGTGAAA
CTCGACGACC GGACCCAGCG CGCGCTGGAG CGCAACGGCG GGATCGACGA TCGCGCGTGA
 
Protein sequence
MDEARPFTYD GGRVDPGETE NVRYTVSQTY LGDPVRMPVT IVNGEGAGPR LFLAAAAHGD 
ELNGVEVVRE VAHEWDLADL CGTLVCCPVL NVPGFIAQQR YLPVYDRDLN RSFPGRDDST
GAKRIANEIF RNFVAPCDYG LDFHTSTRGR TNMLHVRADM ADERVARLAK SFATNVVIAS
DGPGGTLRRE ATDAGVPTVT VEMGEAHRFQ RDLIDRALVG VRSVLAEFGL LPSASVHWPG
WRTIIDGSDE KTWIRADAGG LVDMHFDRGA FVEAGDPICT IANPFKTDRT VVEAPFTGLL
VGVLENPLVY PGNPLCHLVK LDDRTQRALE RNGGIDDRA