Gene Hlac_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1664 
Symbol 
ID7400421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1685033 
End bp1686472 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content74% 
IMG OID643708733 
Productpeptidase M28 
Protein accessionYP_002566319 
Protein GI222480082 
COG category[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.617767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCGG AGCCGAACGA GACGAACGCG GCGGTCGATC CGGCTGCCGT CGAGCGCGTC 
CGCGAGCGAC GCACCGAACT CGCACCGGCG CTCGGCCGGA CGTGGACCGA CGGTGACCCG
TGGCGCTTCC TCACCGACCT CACCGCGATC GGGAGCCGGA TGGCCGGTAG CGAGGGCGAG
CGCCGGGCCG CCGAGATCGT CGCCGACGCG TTCGAGCGGG CGGGGCTCTC CGCGGTCGAG
ACGCGCCCGT TCGAGATGGC GGCGTGGGAG CGCGGGAGCG CGACGCTCCG CGTGACGGCG
CCCGGACGCG ACGGCGCGGC GGCGACCCGC GAGTTCGAGG CGCTCGCGCT GCCGTACTCG
CCGGGCGGGA GTGTCACTGG GGAGCTCGTG GACGTGGGGT ACGGCACTCC CGCCGAGATC
GACGAGCGGG AGGTTGAGGG CCGGATCGCA GTCGCGTCGA CGACGACCCC GGAGGGCGGT
CGGTTCGTCC ACCGGATGGA GAAGTTCGGG TACGCGCTCG ACGCGGGCGC GGTCGGCTTC
GTCTTCGTCA ACCACCTCGA CGGCCAGCTT CCCCCCACCG GATCCCTGAC CTTCGGCGAG
GAGGCCGAGG CCGTCGCCGT CGGCGTCTCG AAGGAGACCG GCGCGTGGCT CCGGGAGTAC
GCGGCCGGAG GGGACGGCGG GGTCGCCGCC GAATCGAGCC CCGCTGCGCA GGCCGAGCTG
TCGGTGACGG CGACGACCGA GCCGGGCGAG AGCCGGAACG TAGTCGGTCA CGCGGGACCG
GACACCGACG AGCGGCTCCT CCTGCTCGCG CACTACGACG CCCACGACAT CGCGGAGGGC
GCGCTCGATA ACGGTTGCGG GATCGCGACC GTCGCGACCG CCGCGGGAAT CCTGACCGAG
GCGGACCTCC CGCTCGGCGT CGACGTGGTC GCGGTCGGGG CGGAGGAGGT GGGGCTCCTC
GGTTCGGAGC AGTTGGCAGA GCGGCTCGAC CTCGACCGGG TGAAGGGAGT GATCAACGTC
GACGGCGCGG GGCGGTTCCG CGACCTCGTG GCGCTGGCGC ACGCCTCCGA GACGGCTGCG
TCGGTCGCCG AGGCGGTGTC GACGGCGACG AACCAGCCGA TCGCTGTGGA CGCGGAGCCG
CACCCGTTCT CCGACCAGTG GCCGTTCGTC CGGCGCGGGG TGCCGGCGAT CCAGCTACAC
AGCGACTCCG GCGATCGGGG ACGCGGCTGG GGACACACCC ACGCCGACAC CCGCGACAAG
GTCGACGACC GAAATGTTCG GGAACACGCG ATGCTCATCG CCCTGCTCGT CGCCGAGTTC
GCAGCCCCCG AGCGCGACGC GCCCCGCCTC GACCGCGACG ACCTGATCGC GGCGTTCCGG
GACGCCGACT TCGAGACGGG CATGCGCGCG GCCGACCTCT GGCCGGCCGG CTGGGAGTAG
 
Protein sequence
MHSEPNETNA AVDPAAVERV RERRTELAPA LGRTWTDGDP WRFLTDLTAI GSRMAGSEGE 
RRAAEIVADA FERAGLSAVE TRPFEMAAWE RGSATLRVTA PGRDGAAATR EFEALALPYS
PGGSVTGELV DVGYGTPAEI DEREVEGRIA VASTTTPEGG RFVHRMEKFG YALDAGAVGF
VFVNHLDGQL PPTGSLTFGE EAEAVAVGVS KETGAWLREY AAGGDGGVAA ESSPAAQAEL
SVTATTEPGE SRNVVGHAGP DTDERLLLLA HYDAHDIAEG ALDNGCGIAT VATAAGILTE
ADLPLGVDVV AVGAEEVGLL GSEQLAERLD LDRVKGVINV DGAGRFRDLV ALAHASETAA
SVAEAVSTAT NQPIAVDAEP HPFSDQWPFV RRGVPAIQLH SDSGDRGRGW GHTHADTRDK
VDDRNVREHA MLIALLVAEF AAPERDAPRL DRDDLIAAFR DADFETGMRA ADLWPAGWE