Gene Hmuk_0828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHmuk_0828 
Symbol 
ID8410342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalomicrobium mukohataei DSM 12286 
KingdomArchaea 
Replicon accessionNC_013202 
Strand
Start bp799406 
End bp800503 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content66% 
IMG OID645019163 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003176666 
Protein GI257386893 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCAC GTGTCCGCGA ACACGCAGAG ATCGTCGCAG ACCACTCCGT CGAGCTGCAG 
GCCGGTGACG ACGTCGTCAT CGACGCCCAT CCCGACGCGG CGGACCTCGT GACGGCGCTC
CACGAGGTGA TCGCCGACCG CGGCGCGAAC CCACTCACCG TCCAGGACCG CCTCGGTGCT
CGCTTCCGAC GCGCGTATCT GCGCAACCAC GACGGCGACT TCGAGACGCC GGCACACGTC
CAGGCGCTGT ACGACGAGAT GGACGTGTAC ATCGCCATCC GCGGCGGCGG CAACGCCACC
GAGACCAGCG ACGTCGACCC CGAGACGACC GCGGCCTACC AGCAGGCCCA GCAACCGCTG
CTCGACGAAC GCCTCTCGAA GCGGTGGTGT CTCACCCAGT ACCCCGCCCA GACCAACGCC
CAGCTGGCCC AGCTCAGCAC GGAGGGCTAC GAGAACTTCG TCTGGGACGC GGTCAACAAG
GACTGGGACG CCGTCCGCGA ACACCAGTCC CAGATGGTCG ACATCCTCGA CCCCGCCGAC
GAGGTCCGGA TCGTCTCGGG CGACACCACC GACGTGACGA TGAGCGTCGC CGGCAACCCG
ACGCTCAACG ACTACGGCGA GCGCAACCTC CCCGGCGGCG AGGTCTTTAC CGCCCCCGTC
GCCGACAGCG TCGAGGGCGA GGTCCTGTTC GACAAGCCCC TGTACCATCA GGGCCGAGAA
GTGACGGACG CATACCTCAC GTTCGAGGAC GGCGAGGTCG TCGACCACAG CGCGTCGAAA
AACGAGGACG TGCTGACGGA AGTGCTCGAC ACCGACGCGG GCGCGCGCCG ACTCGGCGAA
CTCGGGATCG GGATGAACCG CGACATCGAC CAGTTCACCT ACAACATGCT GTTCGACGAG
AAGATGGGCG ACACCGTCCA CATGGCCGTC GGCCGCGCGT ACGACGACAC CGTCGGCGAA
GACAACGAGC AAAACGACAG CGCCGTCCAC GTCGACATGA TCGTGGACAT GAGCGAGGAC
TCGTACATCG AGGTGGACGG CGAGCGCGTA CAGGAGGACG GGACGTTCGT GTTCGAGGAC
AACGAAATCG AGCAGTAG
 
Protein sequence
MDPRVREHAE IVADHSVELQ AGDDVVIDAH PDAADLVTAL HEVIADRGAN PLTVQDRLGA 
RFRRAYLRNH DGDFETPAHV QALYDEMDVY IAIRGGGNAT ETSDVDPETT AAYQQAQQPL
LDERLSKRWC LTQYPAQTNA QLAQLSTEGY ENFVWDAVNK DWDAVREHQS QMVDILDPAD
EVRIVSGDTT DVTMSVAGNP TLNDYGERNL PGGEVFTAPV ADSVEGEVLF DKPLYHQGRE
VTDAYLTFED GEVVDHSASK NEDVLTEVLD TDAGARRLGE LGIGMNRDID QFTYNMLFDE
KMGDTVHMAV GRAYDDTVGE DNEQNDSAVH VDMIVDMSED SYIEVDGERV QEDGTFVFED
NEIEQ