Gene Hmuk_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3029 
Symbol 
ID8412582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2916492 
End bp2918333 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content68% 
IMG OID645021376 
Productglycosyl transferase family 2 
Protein accessionYP_003178841 
Protein GI257389068 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.586376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0326144 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG TGGAAGTCAG CGTCGTCCTG CCGGCCTACA ACGAGGCCGC GACCATCGAG 
GAGACGGTCG AGACGACGCT GTCGACGCTT GCAGCCTTCC TGCCAGCGGG GAGCTTCGAA
GTGATCGTCG CGGAGGACGG CTGTGAGGAC CGCACACCGG AGATCGCGAC CCGGATGGCC
GACGCCGACG AGCGCGTACG CCACGTCCAC TCCGACGAGC GACTCGGACG AGGCGGGGCG
CTGTCCTACG CTTTCCGGCA GGCGGAGGGC GAGACGCTGG TGTACTTCGA TACGGATCTG
GCGACGGACA TGCGCCACCT CGAAGAGCTG GTCGAGTCGG TCCGTTCGGG CGAGTACGAC
GTGGCCACGG GATCGCGCTG GCTCCCCGAG AACCGGGCCG ATCGCCCCGC GAAACGAGGG
GTGCCGAGTC TCGGCTACAA CACCCTCGTG CGGCTGTTCC TGCGATCGGA TCTGCAGGAC
CACCAGTGTG GCTTCAAAGC GTTCGATCGG GCGGCGGCGC TCGATCTCCT CGACGAAGTC
GAAGACGAAC ACTGGTTCTG GGACACGGAG CTGCTGGTCC GCGCCCAGCG CGAGGGCTAC
CGCGTCAAGG AGTTCCCGGT CGACTGGACG CCGAAGGGCG ACTCGAAGGT CGATCTCGTG
CGGGACGTGT TCGGGATGGG GAGCCAGATC GTCCGAACAT GGTGGCAGCT GTCGGTGAGC
CCACGGATCA CGCGGAAGGT GAGCATGACT GCCGGGTCGC TGCTGGTGAT CGCCGCGCTC
GTGCTCGCGG TGACGGTCGT CTTCGATCCG GCGGCCGTCC TGGACGCGAT TAGCGGGGCG
GACGGCGTCG TCGTCGCGTT CTCTGGCGTG GTGTACCTGC TGTCGTGGCC GCTGCGCGGA
CAGCGCTATC GGGACATCCT CGCTCGGCTG GGCCACGACA GCGACACGTG GTTCCTCACG
GGCGCGATCT TCATCAGTCA GACGGGGAAC CTCGTCTTTC CCGCCAGACT CGGCGACGGC
GTCCGGGCGT ACGTCGTCAA GGCCCGCCGG CAGATCCCGT ATCCGACCGG GTTCGCCTCG
CTGGCCGTCG AACGCGTCTT CGACCTGCTC GCGATCACGG TCCTGGCCGG GAGCGTCCTC
GTCGGTCTCG TCGTCACCGG CGGGACCGAT CAGGTCGCCC AGGCGATCGC GGCCGACGTA
CCCCCGGTGA CGATCGGAGA CGACACGCTC GATCCCGCCG CGGCGGCGCG GACGGCGCTC
CGGGTCGCCG CCGTCGTCGG AGCAGCAGCG ATTGCCGGCG TCGCCGTGAT CGTCGTCAGC
GCCCGCCGGG ACAGCGACCT CGTCCACCGT GCGGTCACCG CGCTCAGCAA CGACTCCTAC
GCCCAGTACG TCTCCGGGAT CGTCGAGCAG TTCGTCGGCG ACGTCGAGAC CGTCGTCGCC
GACCGGGGGG CCTTCCTCCG GGTCGGTGTC GGCAGCCTCG TCATCTGGAT CGTCGACGTG
TTGACGGCGG TCGTCGTCTT CGCTGCCTTC CCCAGCATCG AGCTCTCGCC GTCGCTGGTG
GCAGCCGCGT TCTTCGCGGT CAGCGTCGGT AACCTCGCGA AGATCCTCCC GCTGTCGCCG
GGCGGGATCG GCCTCTACGA GGGTGCCTTT ACCCTCATCG TCGTCGGGCT GACGACCGTT
ACGGGGCCGG TCGCACTCGC GATCTCGATC GTCGATCACG CCGTCAAAAA CGCCGTCACG
ATCGTCGGCG GCCTGGGATC GATGGCCTGG CTCAACGTCT CGCTGACGAC CGCGGTCGAA
GAGTCCCAGC AGTCCGGCGA AATCGAGCCG GAGGCCGACT AG
 
Protein sequence
MSDVEVSVVL PAYNEAATIE ETVETTLSTL AAFLPAGSFE VIVAEDGCED RTPEIATRMA 
DADERVRHVH SDERLGRGGA LSYAFRQAEG ETLVYFDTDL ATDMRHLEEL VESVRSGEYD
VATGSRWLPE NRADRPAKRG VPSLGYNTLV RLFLRSDLQD HQCGFKAFDR AAALDLLDEV
EDEHWFWDTE LLVRAQREGY RVKEFPVDWT PKGDSKVDLV RDVFGMGSQI VRTWWQLSVS
PRITRKVSMT AGSLLVIAAL VLAVTVVFDP AAVLDAISGA DGVVVAFSGV VYLLSWPLRG
QRYRDILARL GHDSDTWFLT GAIFISQTGN LVFPARLGDG VRAYVVKARR QIPYPTGFAS
LAVERVFDLL AITVLAGSVL VGLVVTGGTD QVAQAIAADV PPVTIGDDTL DPAAAARTAL
RVAAVVGAAA IAGVAVIVVS ARRDSDLVHR AVTALSNDSY AQYVSGIVEQ FVGDVETVVA
DRGAFLRVGV GSLVIWIVDV LTAVVVFAAF PSIELSPSLV AAAFFAVSVG NLAKILPLSP
GGIGLYEGAF TLIVVGLTTV TGPVALAISI VDHAVKNAVT IVGGLGSMAW LNVSLTTAVE
ESQQSGEIEP EAD