Gene Hlac_1688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1688 
Symbol 
ID7400445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1707744 
End bp1709645 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content69% 
IMG OID643708757 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_002566343 
Protein GI222480106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.311184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACA CTGACGACGC CGACGTGCTC CGCGAACTCG CCTCTCTCCC GACGATCGCG 
AGCCCGCGGG TGTCCCCCGA CGGCGAGACG GTAGCCCTCT ACTACGACGT GACCGGCCGA
AACGAGCTCC ACCTCCTCGA TCCGCGCGAC GGGAACCGAG AGCAGCTGAG CGACGGCGAG
GTCCCGCGCT CGGTCCGCGC CGGGTTCGAG TGGGATCCGT CGGGCGACCG GCTCTTTTAC
CATCGGGACG AGGACGGCGA CGAACAACAC GACGTGTGGG CGATGTCGCT TGACGGCGAG
AGCGAGCCGA TCGTCGAGAT GGACGGCCAG CTCCGTCTCC ACAGCGTGAG CGAGGACGGC
GAGACGCTCC TGCTCGGCTC CAGTCGCGAC GGGCAGATGA ACCTCTATCG CCACGATCTG
CAGAGCGACG AGACGACGAA ACTCACCGAC TACGAGCGCG CCGTCGCCGC CGGCGAACTG
GCGCCCGACG GCGACCGGAT CGCGTACGCG ACCAACGAGA CCGACGCCTA CGAGAACCTC
GACGTGTACG TCGCCGACGC CGACGGGTCG AACCCACGGA ACCTCGATAT CGGCGACGTG
GGCGCGGAGG CGGCCCCGAT CGACTGGGGG CCGGACGGCG ACCGACTCCT CGTGAGCGAC
AACACCGAGG ATCTGAATCG CAGCGGGATC GTCGACCTGA GTGGGGACGT CTCCGGCGCC
GCCGACGTGA CCTGGTTCGG CGGCGACGAG TTCGAGGAGT CGCCGAGCCA CTTCCTGGAG
GCCGGCGACC AATTCGTCGC GAGCCGGACG CGCGGGGCCG TGACGGTGCC CGTAATCTAC
GACGTCGAGA CGGGTGAGGC GCGCGAGCTC GACTTCCCGG CCGGCGTCGC CAACGTGACT
GAGGGTCGAC TGGCCGACGA CCGCCTGCTG GCGTACCGGA CCACGTCGAG CCGGCGGCCG
GAGCTGGTCG CGTACGACCT CGCGAGCGAC GCGACGGAGA CGGTTCTCGA CGCCGAGTAC
GGCCCGTTCG CGCCCGACGA CTTCGTCGAG CCCGAGACGG TCTCGTTCGT CTCCGACGGC
GTTCCGGAGA CCCCGGCGCG GGCAGTCGAT CACGCCCCCT ACGAGGAGTT CGAGATCGAG
GGACTGCTGT TCGACTCCGG CCGCCGCCCC TCGCCGCTTA TCGTGAATCC GCACGGCGGC
CCACGACACC GCGACAGTCG GCAGTTCAGC TACCGGGTGC AGTTCCTGCT CGCGCGCGGC
TACTCGGTGC TGCAAGTGAA CTACCGCGGC TCCACCGGGC GCGGCCGCGA GTTCGTCGAG
GAGTTGTACG ACGACTGGGG CGGCGCCGAG CAGGGCGACG TGGCGACCGG CGTCGAGCAC
GTCCTCAACG AATACGATTG GCTCGACGAG GATCGCGTCG CCGTCTACGG CGGCTCCTAC
GGCGGCTACT CGGCAAACTG GCAGATGGTC CAGTACCCCG ACCTGTACGC CGCTGGGATC
GCGTGGGTCG GCGTGAGCGA TCTGTTCGAC ATGTACGAGA ACACGATGCC GCACTTCCGG
ACGGAGCTGA TGGTGAAGAA CCTCGGCGAG CCAGACGAGA ACGAGGCGCT CTACCGCGAG
CGCAGTCCCG TGACCCACGT CGAGAACCTC GACGCGCCCC TCCTGATCGT CCACGGCGTG
AACGATCCGC GGGTGCCGGT CTCGCAGGCC AGAATTCTTC GGGACGCGCT CGACGACGCC
GGCTTCGAGG AGGGCGTCGA CTACGAGTAC GAGGAGCTCG GCGAGGAGGG CCACGGTTCC
GGCGACATCG ACCAGAAGAT CCGGTCGCTG GAACTGCTCG ACGACTTCCT CGACCGCCGG
ATCGGCGCGG AGCGGACCGC GGTCGCCTCG CTGGACGACT AG
 
Protein sequence
MSDTDDADVL RELASLPTIA SPRVSPDGET VALYYDVTGR NELHLLDPRD GNREQLSDGE 
VPRSVRAGFE WDPSGDRLFY HRDEDGDEQH DVWAMSLDGE SEPIVEMDGQ LRLHSVSEDG
ETLLLGSSRD GQMNLYRHDL QSDETTKLTD YERAVAAGEL APDGDRIAYA TNETDAYENL
DVYVADADGS NPRNLDIGDV GAEAAPIDWG PDGDRLLVSD NTEDLNRSGI VDLSGDVSGA
ADVTWFGGDE FEESPSHFLE AGDQFVASRT RGAVTVPVIY DVETGEAREL DFPAGVANVT
EGRLADDRLL AYRTTSSRRP ELVAYDLASD ATETVLDAEY GPFAPDDFVE PETVSFVSDG
VPETPARAVD HAPYEEFEIE GLLFDSGRRP SPLIVNPHGG PRHRDSRQFS YRVQFLLARG
YSVLQVNYRG STGRGREFVE ELYDDWGGAE QGDVATGVEH VLNEYDWLDE DRVAVYGGSY
GGYSANWQMV QYPDLYAAGI AWVGVSDLFD MYENTMPHFR TELMVKNLGE PDENEALYRE
RSPVTHVENL DAPLLIVHGV NDPRVPVSQA RILRDALDDA GFEEGVDYEY EELGEEGHGS
GDIDQKIRSL ELLDDFLDRR IGAERTAVAS LDD