Gene Hmuk_3313 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_3313 
Symbol 
ID8409391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013201 
Strand
Start bp119893 
End bp120981 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content64% 
IMG OID645018245 
Productperiplasmic solute binding protein 
Protein accessionYP_003175766 
Protein GI257372992 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGACA CACCGCAGTC TCGAACGCGG GACAGATTCT CACGCAGGAG AGCCATTACT 
GCCGGTGCAG GACTTCTCGC CACCGGCCTC GCCGGATGCA CGAACAGCGT CGGTAGCAGC
GATACCACGC CCGCGAGCGA CGCGGGCGGC GCGAACGGAG ACGGCCCCAC TGTCGCAGTC
GCCTCCTTTT TCAGCTTCTA CGACTTCGCA CGGAACGTCG TCGACGGGAC ACCGCTTCGA
GTGAAAAACC TCGTCCCGAC CGGACTGCAC GGTCACGGGT GGGAGCCGAA TGCGAGCGTC
ACGAAAGAGA TCGTCGAAGC CGACGCGTTT CTCCACGTCG GTCCGGGGTT CCAGCCGTGG
GCCGACCGCG CGATTCAGAC GCTCGAAGAC GACGCCGTCG ACACACAGTT GATCAACGCC
CGTGAGGGCG TCGAAATGGT CGATCTCGCC GCGACGCTGG ACCCCGAGGA AGAGGGGGTC
GGAAAGCAGC AAGGGAAGGA CCCACACTTC TGGCTCGATC CCGACCGCGC GAAGAAATCG
GTAGACAACA TCGCCGACGG GCTCGCGAAA CTCGCGCCCG ACCAAGCCAA CACTCTCCGA
ACGAACGCCG AGACGTACAA ATCCGACACC CTCGAACGGA TCGACCGGGA CTACCGGGCC
ATCTTCGATG CCGCCGACCG AAACGTCGTG CAGCTCGCGG CGCACAACGC CTTCCAGTAC
ATCGGCGTCA AATACGACGC CGAGATGGTC CCCCTCGTTA CGAACCTCGC AGCCAGCGGT
GACGTCAAGC CCTCGGACAT CACCGAGGCG AAGGCGGTCA TCGAGCGAAA CGACATCGAC
TACATCGCAA ACGGCGTCTT CGAGTCACGG AAGCCGGCGA AGCAACTGCT CGACGAAACG
CGAGTCGCCG GCTATCTCCC CGTCACCCCC TACGCGGGGG TCCGGGAAGA CTGGGTCGAG
AACGACTGGG GCTACGAGGA GATCGCCTAC AACATCAACA TGCCCACGTT CGAGGTCGTC
CTCGGCAACA AACGACCCGA GGAAGCCGGA CCCGACGGCT GGGCCGACGA GTGGCTGAAC
TTCGAGTGA
 
Protein sequence
MDDTPQSRTR DRFSRRRAIT AGAGLLATGL AGCTNSVGSS DTTPASDAGG ANGDGPTVAV 
ASFFSFYDFA RNVVDGTPLR VKNLVPTGLH GHGWEPNASV TKEIVEADAF LHVGPGFQPW
ADRAIQTLED DAVDTQLINA REGVEMVDLA ATLDPEEEGV GKQQGKDPHF WLDPDRAKKS
VDNIADGLAK LAPDQANTLR TNAETYKSDT LERIDRDYRA IFDAADRNVV QLAAHNAFQY
IGVKYDAEMV PLVTNLAASG DVKPSDITEA KAVIERNDID YIANGVFESR KPAKQLLDET
RVAGYLPVTP YAGVREDWVE NDWGYEEIAY NINMPTFEVV LGNKRPEEAG PDGWADEWLN
FE