Gene Hmuk_2419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_2419 
Symbol 
ID8411963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp2318656 
End bp2320218 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content71% 
IMG OID645020762 
Producthypothetical protein 
Protein accessionYP_003178236 
Protein GI257388463 
COG category[S] Function unknown 
COG ID[COG3390] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.226401 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACGAGGATCA GGGCGGCGCG GGCCGGCGCG AGGTCGCCTA TCGCCTCTTT 
GCCGCCGAGT TCGACGACGC GGACTTCTCG TACTCCGAGA GCGACGAGGA GCGTGCGCCG
AACTACGTCG TCACGCCGAC GGGCGCACGC GTGAACAGAG TGTTTCTGGT GGGCGTGCTG
ACGGAGGTCG AGACGGTCAG CGAGGACTAC CTCCGGGCCC GCGTCGTCGA CCCCTCGGGT
CCCTTCGTCG TCTACGCCGG CCAGTACCAG CCCGAGGCGC TTGCCTTCCT GGAGAGTGCC
GATCCGCCGA CGTTCGTGGC GGTCACGGGC AAGGCTCGGA CCTACCAGCC CGACGACAGC
GATCAGGTGT ACACCTCCGT CCGGCCCGAG TCGATCAGCG AGGTCGACGC GGCGACGCGG
GACCGCTGGG TCGTCCAGAC GGCAGAGCAG ACGATCGACC GCGTGGCACG CGCCGCGGAG
GGCAAACACG CTGGCCTGAC CGGCGAGGAC CTGCGGGCGG CGCTCGTCGA CCACGGCGTC
GACGAGGGAG TGGCCGCTGG CATTCCGATC GCACTGGAAC GCTACGGAAC GACCGGCGAC
TACCTCGCGG CAGTGCGGGA CCTCGCGACC GACGCCGCCC GCGTCGTCGC CGGCCAGCGA
GACGAGGTCG AGCCACTGAC CGTCCGACCG GGACAGGGCA GCGACGACCA ACTCGCGGGG
CTCGTCGAGC GACCGATCGA GGTGTCGACC GACGACGAAG CGGCCGGAAC GGTCGCCGAG
GAATCGGTCG AAACAGACCG ACCGGACGAG ACGGCCGACG ACCAGCGTGC GGGAACCGCG
GAAGCGACGA CGGGCCAGCA GTCGAGCGGT TCGGACGCCC CCTCGATCGA CGACGAGTCC
GCGGACGAGA CGACCGAGAG CGTCGACGAC GTGGCCGAGG AAGAACCGTC CGGAGCCGGC
TCGCCGGACG AGACGGCAGA AGCGTCGCCG GACGAGATCG ACACGACGAC GGGCGACGAG
CCCACCGACG AGTCAGCGTC AGACCTCGGG ACGGCCGCCG ATTCGTCTGC GACGGCGGAG
GCGGACTCGG TGGACGACGA TCTCGGGGAG TTCGACCCCG AGTTCGAACT CGACGAGAGC
GAACGCGAGG AGATCGAGTC CGAGTACGGC ACGGAGTTCC AGAGCGGGAC CGAGGTCGAC
GAGCCCGGCG CGGCCGACAT CGACACGCCC GATCCGGAGG CGGTCGACGC CGAGAGCGCC
ACGGCAGCAG ACGCGGAGCC GTCGTCCGCC GCCGATCCGA GCGAGGGCGA AGTCGGGGCC
GCCCGAAGCG CCGACTCGCC AGACGGGGGC GAGGACGACG CGGCCGACGT GGACCTCGAA
GACGCGGTGA TGGACGTGAT GGACGATCTG GACGACGGGG ACGGCGCAGC CCGGACGGCG
ATCGTCGACG CCGTCGTCGA CCGGACCGGC GGCGATCCCA ACGCCGTCGA GGACGCCATC
CAGGACGCGC TGATGGGCGG GCGCTGCTAC GAGCCCGACG ACGGCCTGTT GAAACCGATC
TGA
 
Protein sequence
MSTDEDQGGA GRREVAYRLF AAEFDDADFS YSESDEERAP NYVVTPTGAR VNRVFLVGVL 
TEVETVSEDY LRARVVDPSG PFVVYAGQYQ PEALAFLESA DPPTFVAVTG KARTYQPDDS
DQVYTSVRPE SISEVDAATR DRWVVQTAEQ TIDRVARAAE GKHAGLTGED LRAALVDHGV
DEGVAAGIPI ALERYGTTGD YLAAVRDLAT DAARVVAGQR DEVEPLTVRP GQGSDDQLAG
LVERPIEVST DDEAAGTVAE ESVETDRPDE TADDQRAGTA EATTGQQSSG SDAPSIDDES
ADETTESVDD VAEEEPSGAG SPDETAEASP DEIDTTTGDE PTDESASDLG TAADSSATAE
ADSVDDDLGE FDPEFELDES EREEIESEYG TEFQSGTEVD EPGAADIDTP DPEAVDAESA
TAADAEPSSA ADPSEGEVGA ARSADSPDGG EDDAADVDLE DAVMDVMDDL DDGDGAARTA
IVDAVVDRTG GDPNAVEDAI QDALMGGRCY EPDDGLLKPI