Gene Hmuk_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2020 
Symbol 
ID8411551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp1923262 
End bp1925187 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content69% 
IMG OID645020354 
Producthypothetical protein 
Protein accessionYP_003177840 
Protein GI257388067 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.356882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAGCG CGACGCCCCG CGAGCGACTC CCGGCGGAGC CGGACCTCCT CGTGGGGCTC 
GCCGGCCTCG CGGTCGCGGT CGTCCTCTTC CCGCTTCGAT TCCTCTCCGG ACAGCTGTTC
ATCCAGGTCC TCCCACCGGT GCTGGGCCTC GGGAGCCTGG TGTATCTCGT CGGACGCGTC
CGTGCTGGCG AGTCCGAGCG CGCGATCGAG CGCCGACGCG TCGGCGTCGA CGGGCGTGTC
GTCGCGGGCC TCACGGCGCT TGGGATCGGC GGGCTCGCGC TGTTTGGCGC GAGTGCCGGC
GGTCGGACGA CGGCGTTCCT GCTGGCCAGC GGGCTCGTCG GGACCGCGAT CCTCGCACAG
ATCCTGTTTC TCGACGACGA CGCACTCGAT CCCGGACTCG TGTTGGGCCA GTTGCTCGCG
CTGGCGCTCG TCGTCAGGTT CACCGCGTTG CTCTCGACGC CCGGACTGAT CGGCGTCGAC
AGCTGGACGC ACCTCACGGA CTACGCGGCG GCCATCCAGT CGACGGACTC GCTGTCGGCG
ATCGCCGACG TGAAGTACCG GACTGCGCCG CTCTTTCACG TCCTGGTGGT GATCGCGGCC
GACGCCATCG GTGTCGGCCT GCGAGCGGCC ACCTACGTCT CGATGGGGCT CGCGCTCCCG
CTGTCGACGC TGCTGGTGTA CGCGATCGGG ACGCTGCTGT TCGACCGCCG CTGGGCGTTG
CTGGCAGCGG GCCTGTTCGT GATGGCCGAT CACGTGATCC GGTGGGGCGT CCACATCATC
CCGACCAGCA TGGGGCTGGT CTTCTTCCTC GGAGCGCTGG CCGGAGCGAC GCGGCTCCTC
GCTGGCGACA CCAGACGCGC CACGTACGCG ATCGTCGTGG CGTTCGGGAT CGCGACCGCG
CTGACCCACC AGATCTCGGC GTTCATCCTG CTGGTGGTGC TGGGCGTCGG TGCCGTGGTG
GGGTCGCTCG GATCGGTGCT GCCCGGCGAC TTCGACCGCG CGGGGTCGCT GTGGCCGGTC
TTCGCGCTCG TCACCGCCTT CGTGGCAGCG CTCTGGTCGA TCACGCCGTA CCGAGACAGC
GTGTTCGCCC TGGAACTGCT CGACACCGTC GACCGGGCGA TCGCGACCTC CGTCGGCTTC
CTCAACCTCG CCGGGTCGGA TCCGGGCGGC GCGGGCGGTG CCAGCAGCGC TGCGGGCGTC
CCGATCGACG TCGCCTTCGC GGACGCGCTC GGCTTCTTCG CGCTGTTTTT CGCGGTCGTG
ATCGGGACCG TCGCCGTCTT CCGCCGACGC AACGCCACTC CGGCGACGGT GACCTACGCC
GCGGCCGCGG TGGTGCTGGC GACGTTTACC TTCGGACTCC CGCTGTTTGG CTTTACGACC
TTCCTGCCCG GCCGCTGGTA CGCGTTCATG TACGTCCCGA TGGCGCTGCT CGCGGCGCTT
GGCTGCCGGT TCGCGGTGCG CCGGCTCTCG CCACGGATGG CGATGACCGG CCTGCTCGTG
TTCGCGCTGG TGGTCCCCGG TGCGATGGCG ATCAACCACA AGGGGACCCC GGACAGCCCG
GTCTTCGACC AGGAGTACTC GACCTACGCC TACGACGAGA CGGAGCTGTC GGCCGTCGAG
ACGGTCGGAG CGACGCGGCC CGAAGTGGCC GATCCGGTCT ACACCGACCA CCCCTATCGG
ACGGTGTTCG AGCGCAGCGG TGCGACGCCG GCCAACATGC TGGCCGTCGA GGACGGCGAG
ATCTACCACG ACACCGTCGT CTACCGGAAG TACCAGTCGA CCGGCGCGCC GGTGCTCCTC
GTCGACAACG AGTCGCGGAC GCGCAGGGTC GCACCCTCGG AGGTGTGTCG CGAGGATATG
CACCGGCTCT ACGCGAACAG CAACGTCACC GTCTGTACCG GTATCGACGG GATAGACGGA
GCATAA
 
Protein sequence
MASATPRERL PAEPDLLVGL AGLAVAVVLF PLRFLSGQLF IQVLPPVLGL GSLVYLVGRV 
RAGESERAIE RRRVGVDGRV VAGLTALGIG GLALFGASAG GRTTAFLLAS GLVGTAILAQ
ILFLDDDALD PGLVLGQLLA LALVVRFTAL LSTPGLIGVD SWTHLTDYAA AIQSTDSLSA
IADVKYRTAP LFHVLVVIAA DAIGVGLRAA TYVSMGLALP LSTLLVYAIG TLLFDRRWAL
LAAGLFVMAD HVIRWGVHII PTSMGLVFFL GALAGATRLL AGDTRRATYA IVVAFGIATA
LTHQISAFIL LVVLGVGAVV GSLGSVLPGD FDRAGSLWPV FALVTAFVAA LWSITPYRDS
VFALELLDTV DRAIATSVGF LNLAGSDPGG AGGASSAAGV PIDVAFADAL GFFALFFAVV
IGTVAVFRRR NATPATVTYA AAAVVLATFT FGLPLFGFTT FLPGRWYAFM YVPMALLAAL
GCRFAVRRLS PRMAMTGLLV FALVVPGAMA INHKGTPDSP VFDQEYSTYA YDETELSAVE
TVGATRPEVA DPVYTDHPYR TVFERSGATP ANMLAVEDGE IYHDTVVYRK YQSTGAPVLL
VDNESRTRRV APSEVCREDM HRLYANSNVT VCTGIDGIDG A