Gene Hmuk_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2147 
Symbol 
ID8411686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2056479 
End bp2058518 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content63% 
IMG OID645020489 
Producttype II secretion system protein 
Protein accessionYP_003177967 
Protein GI257388194 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1955] Archaeal flagella assembly protein J 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0236165 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG ACACGCGCGG CCAGCGACAG CTCTCGGGTG GCGCGCTCGG GGACACGTTC 
TATCCCCTCT TCCAGTGGCT GTTCAACGAG GACGGCGACT TCGTCAGGAA CGTCGAGAAG
AAGCTCGCAC AGGCCCGGAT GGCCGACAAC GTCGAGATGT TCCTCGCACG CGCGCTGGCG
ATCGGCGTCA TCTCGGGACT GGCGCTGTGG CTCGTGGGCA CGCTGATCGG CTACCTCGGC
GTCACCTTCC TGCTGGGCGG CACTGGAGCC GAGGCACCGA CGTTCATCGG GATCCCACTG
CCAGAGGGGG TCTCGGCCGC GCTCGACGTG ATCAAGATCC CGGCGCTGAT CCTGGTCACC
GGGCTGGTGT TCGGTGTCAT CGGCTTCGGT ATCGGCTTCG GATCGCTGGT CTCGATCCCG
TACTTCCGGG CCAACGCACG CGAGCGGGAG GTGAACGTCC TGCTGTCGGA CTCGATCTCG
TTCATGTACG CGCTGTCGGT CGGGGGGCTC AACCAGTTGG AGATTCTCCA GGCGATGGCC
AAGGCCGACG ACACCTACGG GGAGTGTGCC AAGGAGTTCC AGTCGATCGT CTTGGAGACG
GAGTACTTCG ATACTGACTA CCGGACCGCG ATCCGCAATC AGGCACTCGA GACGCCGTCG
GACGAGCTCT CGCAGTTCCT GACCGACATG CTCTCGATCA TCAACTCCGG CGGGGACATG
ACCTCCTTCC TCGAAGACCA GAAGGACAAG CACATGCGGA CCGCAAAGCA GGAACAGCAG
AAGATGCTCG ACACCCTCGA ACTGTTCGGG GAGATGTACA TGACCCTGTC GCTGTTTCCG
CTCTTGCTCA TCATCATCCT CGTCATCATG TCGATGATGG GCGACGCGCA GAACCGGCTC
CTCTATGGCA CGGTCTACGG GCTGATCCCG CTGACCGGTG CCGGCTTCCT CGTGCTCGTC
TCGACGGTGA CCCGAGACGA GGTCGGCGAC GGCTACTTGC GGCCCGACGG CAAGGACGAC
GACTTCGTCG TCGACGACGG GCTGGGCTTT CTGAACCTCG GACTGGTCGA GAACTACACC
GGCCAGTACA CGATCTTCGA CCGCATCAAG AGCCGCGAAG GGACCTACGA GTTCATGCAG
GTGCTCAAGC GCCCGGACCT GTTCTTCCGC GATCACCCGC TGTTCGTGCT CGGCGTGACG
GTCCCCGTGA CGATCGTCGC GTTGCTGCTG GTCGTCGTGT TCGATCTCGC ACCGATGAGC
CTCGACGGGA TGATCGCTCG ACCAGTGCTC GGGACGTTCT TCTGGGTGTA CGTGCCACTG
TACATCAACC TGCTCCCGCT AGCGATCTTC TACGAGTGGA ACGTCCGCTC GCGCAAGACG
ATCATCGGCA GTCTCTCGGA GAACCTCAGG AAACTCGCCA GCGCGAACGA CACGGGCATG
ACGCTGCTGG AGTCCGTACA GGTGGTCTCG ACGACATCGG GAGGCAAGCT CTCGGAGGAG
TTCGAGATCA TGCACGCGAA GGTCAACTAC GGCACCAGCC TCAAAGACGC CCTCAGGGAG
TTCAACAACA AGTACCACGT CCCGCGACTC GCCCGGACGG TCAAGCTCAT CAGCGAGGCA
CAGGAAGCGT CCAGCCAGAT CCAGAACGTG CTCTCGACGG CGGCACAGGC CTCGGAGAAT
CAGGACGACA TCGACCGCGA GCGGATCGCC CGGACCCGAA TGCAGGTCGT CATCATCCTC
ATGACGTACC TGACGCTGCT CGGCGTGATG GCACTGCTGA AGACTCAGTT CCTCGACGTG
ATGGCGGGAC TGTCCGAGAG CGCGGCCGGT GCGGGGGGAA GTGGCGCGAC CGGGCAGAGC
TTCGGCGGCA ACGTCGACAC GGACCTGCTC TCGTTGCTGT TCTTCCACGC CGTCACGCTG
CAGGCGCTCC TCTCGTCGTT CATCGCGGGG TACATCCGGG ACGTGAACAT CATCTCGGGA
GTGAAGTTCG CGGTGATCCT CCCGACGATC GCGCTCATCA CCTGGATCGC GGTCGGATAG
 
Protein sequence
MSLDTRGQRQ LSGGALGDTF YPLFQWLFNE DGDFVRNVEK KLAQARMADN VEMFLARALA 
IGVISGLALW LVGTLIGYLG VTFLLGGTGA EAPTFIGIPL PEGVSAALDV IKIPALILVT
GLVFGVIGFG IGFGSLVSIP YFRANARERE VNVLLSDSIS FMYALSVGGL NQLEILQAMA
KADDTYGECA KEFQSIVLET EYFDTDYRTA IRNQALETPS DELSQFLTDM LSIINSGGDM
TSFLEDQKDK HMRTAKQEQQ KMLDTLELFG EMYMTLSLFP LLLIIILVIM SMMGDAQNRL
LYGTVYGLIP LTGAGFLVLV STVTRDEVGD GYLRPDGKDD DFVVDDGLGF LNLGLVENYT
GQYTIFDRIK SREGTYEFMQ VLKRPDLFFR DHPLFVLGVT VPVTIVALLL VVVFDLAPMS
LDGMIARPVL GTFFWVYVPL YINLLPLAIF YEWNVRSRKT IIGSLSENLR KLASANDTGM
TLLESVQVVS TTSGGKLSEE FEIMHAKVNY GTSLKDALRE FNNKYHVPRL ARTVKLISEA
QEASSQIQNV LSTAAQASEN QDDIDRERIA RTRMQVVIIL MTYLTLLGVM ALLKTQFLDV
MAGLSESAAG AGGSGATGQS FGGNVDTDLL SLLFFHAVTL QALLSSFIAG YIRDVNIISG
VKFAVILPTI ALITWIAVG