Gene Hlac_2686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2686 
Symbol 
ID7400893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2675097 
End bp2676896 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content66% 
IMG OID643709760 
Productoligoendopeptidase F 
Protein accessionYP_002567327 
Protein GI222481090 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR00181] oligoendopeptidase F 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGG TTCCCGAGCG AGCGGAGATC GACGAGGCGT ACAAGTGGGA CCTCCAGGGC 
ATCTACGCGG ACGACGAGGA GTGGGAGTCC GCGTACGAGG CGGTGTCGGA GCGGATCGAG
GAGTTGGAGG CGTACGAAGG GCGCGTCGCC GACGACGCCG CGACCCTCCT CGAACTGCTG
GAGTTGCGCG AGGCGATCTT CCGCGATCTC GAGCAGGTGA TGACGTACGC CCAGCTCCGC
AGCGCCGAGG ACACCCGGAA CCAAGAGTAT CAGGCGATGT CCGCGAAGGC GTCCTCGCTC
GGCTCCGAGG CGTCGAGCAC GATCTCCTAT CTCGAACCGG AGCTCCAGTC GCTTACCGAG
GCGGACGTGG CGGCGTTCGT CGACGACGAG CCCGCGCTCG CCGAGTACGA GCACTACCTC
GACGACGTGT TACGGATGAA AGCCCACACG CGTTCGGCGG AGGTCGAGGA GGTGCTCGCG
GACCTGTCAG AGGTTACCGA CGCGCCGAGC GAGATCTACT CGATGCTGAC GAACGCCGAC
ATGACCTACG GCGTCGTCGA GAACCCCGAC GGCGAGGAGG TCGAGATCAC GCAGGCGAAC
TTCACAAAGC TCCAGAAGAA CCCGGACCGC AAGTTCCGCG AGCGCGTCCA CGAGACGTTC
TACGACGAGT GGGCCGGCGT CCGCAACACG GTCGGTACGT CGCTTAAAAA GGCCGTCCGA
GAGCACGCGA CGAGCGCCGA GATCCGCGGC TACGACTCCG CGCGTCAGGC CGCGCTCGAC
GGCTCGAACG TCCCGGTCGA AGTGTACGAC ACCCTCGTCG ACACGGTCGA CGACAACCTC
GACGTGCTTC ATCGGCATGC CGAGCTGAAG GCGGACGCGC TCGGCGTCGA CCAGTTGGAA
AGCCACGACC TGTACATGTC ACTGACGGGC GATCAGGGGC CGGACGTGGC GTACGAGCAG
GCCCGCGAGT GGGTGATCGA GGCGGTCGCG CCGCTGGGCG AGGCGTACCA AGAGCGGATG
GCCGAGGGGC TCGACTCGCG GTGGGTCGAC GTGTACGAGA ACCGCGGGAA GCGCTCGGGC
GCGTTCTCGT CGGGTACGTA CGACACCCAG CCGTACATTC TGATGAACTA CCAGGACGAT
ATCGCCTCGA TGTTCACGCT GGCCCACGAG CTTGGCCACT CGATGCACTC GGAGCTGTCC
GGCGACACCC AGCCGTGGCA CGATGCGAGC TACGACATCT TCGTCGCCGA GATCGCCTCT
ACCGTCAACG AGACCCTGCT CACCCACCAC CTGCTCGACA CGGTCGAGGA CGACGAGCTG
CGGACCCACG TGCTCGACGA GTATCTCGAA CGCTTCCGTT CGACCCTGTT CCGCCAGACG
ATGTTCGCGG CCTTCGAACA GCGGATTCAC GAGCGCGTCG AGGCCGACGA CGCGCTCACG
CCCGACGCGT TCGACGAGAT CTACGCCGAC CTCAAAGGCG ACTACTACGC GCCCGCCGAA
CTGACCGGCG GCGTCGAACG AGAGTGGGAG CGGATTCCGC ACTTTTACTA CAACTTCTAC
GTGTACCAGT ACGCGACCGG CATCTCCGCG GCGGCCGCGA TCGTCGAGCG CGTCCTCGAC
GAGGGCGACG AGGCCGCCGC CGACTACCGC GAGATGCTGC GGGCGGGCGG TTCGGATTAC
CCCCTCGACG TCGTCGAACT CGCGGGGATC GACATGGCAT CGCCCGAACC GATCGAGTCG
GCCGTCGGGA TCTACGACGA GTACCTCGAC GAGATTGCGG CGCTTCTGGA CGTGGAGTAG
 
Protein sequence
MSSVPERAEI DEAYKWDLQG IYADDEEWES AYEAVSERIE ELEAYEGRVA DDAATLLELL 
ELREAIFRDL EQVMTYAQLR SAEDTRNQEY QAMSAKASSL GSEASSTISY LEPELQSLTE
ADVAAFVDDE PALAEYEHYL DDVLRMKAHT RSAEVEEVLA DLSEVTDAPS EIYSMLTNAD
MTYGVVENPD GEEVEITQAN FTKLQKNPDR KFRERVHETF YDEWAGVRNT VGTSLKKAVR
EHATSAEIRG YDSARQAALD GSNVPVEVYD TLVDTVDDNL DVLHRHAELK ADALGVDQLE
SHDLYMSLTG DQGPDVAYEQ AREWVIEAVA PLGEAYQERM AEGLDSRWVD VYENRGKRSG
AFSSGTYDTQ PYILMNYQDD IASMFTLAHE LGHSMHSELS GDTQPWHDAS YDIFVAEIAS
TVNETLLTHH LLDTVEDDEL RTHVLDEYLE RFRSTLFRQT MFAAFEQRIH ERVEADDALT
PDAFDEIYAD LKGDYYAPAE LTGGVEREWE RIPHFYYNFY VYQYATGISA AAAIVERVLD
EGDEAAADYR EMLRAGGSDY PLDVVELAGI DMASPEPIES AVGIYDEYLD EIAALLDVE