Gene Hmuk_0207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0207 
Symbol 
ID8409705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp203552 
End bp204679 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content62% 
IMG OID645018532 
ProductABC transporter related 
Protein accessionYP_003176051 
Protein GI257386278 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.91298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACGC TTCGCGTGGA CTCTCTGCGC AAAGAGTTCG ACAACGGTCG TATCGTCGCC 
GTCAACGACC TCTCTCTGGA CGTGGCAGAC GGAGAGTTCG TCACGGTCGT CGGCCCGTCG
GGATGTGGCA AGTCGACGAC CCTACGGATG CTGGCTGGCC TCGAACAGCC GACTGACGGC
AAGATCTTCA TCGACGGCGA GGACATCACA GACGTGCATG CGCGCAGCCG AGACGTCGCG
ATGGTGTTCC AGAACTACGC GCTGTACCCG CACAAGACGG TCCGCCAGAA CATGGCATTC
GGCCTCCGGA TGAGCACGGA TCTCTCGTCG GACCAGCGCG AGGAGAAGGT CCGTGAGACA
GCCGCGATGA TGGATATCGA GGAACTACTG GACGACAAGC CCAATGCGCT CTCGGGGGGA
CAGAAACAGC GCGTCGCGCT CGGCCGTGCC ATCGTGCGCG AGCCCGACGT GTTCCTGTTC
GACGAACCAC TCAGCAACCT CGACGCGAAG CTCCGGACGA CGATGCGAAC GGAGATTCAG
CGACTCCAGG ACGAACTGGG AACGACCTCG ATCTACGTCA CACACGATCA GGAAGAAGCG
ATGACGATGG GTGACCGGAT CGCAATCCTC GACAACGGAA TCCTCCAGCA GGTGGGATCA
CCAAAGCACG TCTACCAGAA CCCGGTCAAC GAGTTCGTCG GGACGTTCGT GGGCTCCCCG
GCGATGAACA TGCTCGACGT GAGCGTCAGT ACGGGCGACA GCGTATGCTT GACTAACGGC
GATCGCTTTG CCTATTCTCT CGACGGGCCC GTTGCACAGG CGGTCGCAGA CGCCGAGGTC
GACAGCGCTC GGCTCGGGAT CCGTCCGGAA GACGTCGCCG TCAGCCGAGA GCCCGCAGAG
AGCGACATCC GGGCGGTCGT CGAGGTCGTC GAACCGATCG GAAGCGACAA CTACCTCTAT
CTCGATCTGG GCGAGTCGTT CATCGCGCGC GTCGCCGCCG ATATCGAACC GACACGGGGC
GACACTGTCG GCGTTCAGTT CGACGAGTCC GATATCCATC TGTTCGATAC GTATGGGTTC
TCGATTCTCT CCGAGAAAGA GACCGAACGA TCCCCCGTCA CGGCCTGA
 
Protein sequence
MATLRVDSLR KEFDNGRIVA VNDLSLDVAD GEFVTVVGPS GCGKSTTLRM LAGLEQPTDG 
KIFIDGEDIT DVHARSRDVA MVFQNYALYP HKTVRQNMAF GLRMSTDLSS DQREEKVRET
AAMMDIEELL DDKPNALSGG QKQRVALGRA IVREPDVFLF DEPLSNLDAK LRTTMRTEIQ
RLQDELGTTS IYVTHDQEEA MTMGDRIAIL DNGILQQVGS PKHVYQNPVN EFVGTFVGSP
AMNMLDVSVS TGDSVCLTNG DRFAYSLDGP VAQAVADAEV DSARLGIRPE DVAVSREPAE
SDIRAVVEVV EPIGSDNYLY LDLGESFIAR VAADIEPTRG DTVGVQFDES DIHLFDTYGF
SILSEKETER SPVTA