Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1688 |
Symbol | |
ID | 7400445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1707744 |
End bp | 1709645 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643708757 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_002566343 |
Protein GI | 222480106 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.311184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGACA CTGACGACGC CGACGTGCTC CGCGAACTCG CCTCTCTCCC GACGATCGCG AGCCCGCGGG TGTCCCCCGA CGGCGAGACG GTAGCCCTCT ACTACGACGT GACCGGCCGA AACGAGCTCC ACCTCCTCGA TCCGCGCGAC GGGAACCGAG AGCAGCTGAG CGACGGCGAG GTCCCGCGCT CGGTCCGCGC CGGGTTCGAG TGGGATCCGT CGGGCGACCG GCTCTTTTAC CATCGGGACG AGGACGGCGA CGAACAACAC GACGTGTGGG CGATGTCGCT TGACGGCGAG AGCGAGCCGA TCGTCGAGAT GGACGGCCAG CTCCGTCTCC ACAGCGTGAG CGAGGACGGC GAGACGCTCC TGCTCGGCTC CAGTCGCGAC GGGCAGATGA ACCTCTATCG CCACGATCTG CAGAGCGACG AGACGACGAA ACTCACCGAC TACGAGCGCG CCGTCGCCGC CGGCGAACTG GCGCCCGACG GCGACCGGAT CGCGTACGCG ACCAACGAGA CCGACGCCTA CGAGAACCTC GACGTGTACG TCGCCGACGC CGACGGGTCG AACCCACGGA ACCTCGATAT CGGCGACGTG GGCGCGGAGG CGGCCCCGAT CGACTGGGGG CCGGACGGCG ACCGACTCCT CGTGAGCGAC AACACCGAGG ATCTGAATCG CAGCGGGATC GTCGACCTGA GTGGGGACGT CTCCGGCGCC GCCGACGTGA CCTGGTTCGG CGGCGACGAG TTCGAGGAGT CGCCGAGCCA CTTCCTGGAG GCCGGCGACC AATTCGTCGC GAGCCGGACG CGCGGGGCCG TGACGGTGCC CGTAATCTAC GACGTCGAGA CGGGTGAGGC GCGCGAGCTC GACTTCCCGG CCGGCGTCGC CAACGTGACT GAGGGTCGAC TGGCCGACGA CCGCCTGCTG GCGTACCGGA CCACGTCGAG CCGGCGGCCG GAGCTGGTCG CGTACGACCT CGCGAGCGAC GCGACGGAGA CGGTTCTCGA CGCCGAGTAC GGCCCGTTCG CGCCCGACGA CTTCGTCGAG CCCGAGACGG TCTCGTTCGT CTCCGACGGC GTTCCGGAGA CCCCGGCGCG GGCAGTCGAT CACGCCCCCT ACGAGGAGTT CGAGATCGAG GGACTGCTGT TCGACTCCGG CCGCCGCCCC TCGCCGCTTA TCGTGAATCC GCACGGCGGC CCACGACACC GCGACAGTCG GCAGTTCAGC TACCGGGTGC AGTTCCTGCT CGCGCGCGGC TACTCGGTGC TGCAAGTGAA CTACCGCGGC TCCACCGGGC GCGGCCGCGA GTTCGTCGAG GAGTTGTACG ACGACTGGGG CGGCGCCGAG CAGGGCGACG TGGCGACCGG CGTCGAGCAC GTCCTCAACG AATACGATTG GCTCGACGAG GATCGCGTCG CCGTCTACGG CGGCTCCTAC GGCGGCTACT CGGCAAACTG GCAGATGGTC CAGTACCCCG ACCTGTACGC CGCTGGGATC GCGTGGGTCG GCGTGAGCGA TCTGTTCGAC ATGTACGAGA ACACGATGCC GCACTTCCGG ACGGAGCTGA TGGTGAAGAA CCTCGGCGAG CCAGACGAGA ACGAGGCGCT CTACCGCGAG CGCAGTCCCG TGACCCACGT CGAGAACCTC GACGCGCCCC TCCTGATCGT CCACGGCGTG AACGATCCGC GGGTGCCGGT CTCGCAGGCC AGAATTCTTC GGGACGCGCT CGACGACGCC GGCTTCGAGG AGGGCGTCGA CTACGAGTAC GAGGAGCTCG GCGAGGAGGG CCACGGTTCC GGCGACATCG ACCAGAAGAT CCGGTCGCTG GAACTGCTCG ACGACTTCCT CGACCGCCGG ATCGGCGCGG AGCGGACCGC GGTCGCCTCG CTGGACGACT AG
|
Protein sequence | MSDTDDADVL RELASLPTIA SPRVSPDGET VALYYDVTGR NELHLLDPRD GNREQLSDGE VPRSVRAGFE WDPSGDRLFY HRDEDGDEQH DVWAMSLDGE SEPIVEMDGQ LRLHSVSEDG ETLLLGSSRD GQMNLYRHDL QSDETTKLTD YERAVAAGEL APDGDRIAYA TNETDAYENL DVYVADADGS NPRNLDIGDV GAEAAPIDWG PDGDRLLVSD NTEDLNRSGI VDLSGDVSGA ADVTWFGGDE FEESPSHFLE AGDQFVASRT RGAVTVPVIY DVETGEAREL DFPAGVANVT EGRLADDRLL AYRTTSSRRP ELVAYDLASD ATETVLDAEY GPFAPDDFVE PETVSFVSDG VPETPARAVD HAPYEEFEIE GLLFDSGRRP SPLIVNPHGG PRHRDSRQFS YRVQFLLARG YSVLQVNYRG STGRGREFVE ELYDDWGGAE QGDVATGVEH VLNEYDWLDE DRVAVYGGSY GGYSANWQMV QYPDLYAAGI AWVGVSDLFD MYENTMPHFR TELMVKNLGE PDENEALYRE RSPVTHVENL DAPLLIVHGV NDPRVPVSQA RILRDALDDA GFEEGVDYEY EELGEEGHGS GDIDQKIRSL ELLDDFLDRR IGAERTAVAS LDD
|
| |