Gene Hlac_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0002 
Symbol 
ID7399445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2519 
End bp3616 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content64% 
IMG OID643707056 
ProductSignal peptidase I-like protein 
Protein accessionYP_002564678 
Protein GI222478441 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0681] Signal peptidase I 
TIGRFAM ID[TIGR02228] signal peptidase I, archaeal type 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.756287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG GGGACCGGTC GGACGGCGGC GACGAACCCG ACCGGAAGAC CGGATCGGCA 
GACAGCGATC CCGTCAACGA CGGCGATTTC AGCGGCGACG ACGGAAACGG CGGCGACCAG
TCGGTTTCCA AAGAGGATTG TGAGACGGGT CCCGGGGAAA GTTCTGAAAC AGGTTCTGCG
GACACCGAGA CACCGAGTGA CGAACGCGGC CGAGCCTCCG ACGATCGGAT CGAAACTGGT
CAGGCCGGAT CAAAACCAAG TCGAGGGAGG GGTGAAACGG GACGAGACCG AACCGAAACG
GACCGAGACT CGAATTCTTC CGGCAAAGGG GTCCTGTACC GGTTCCGTCA CAACAGAGAG
GGGCCGCTGA TGTGGATCCG GGAAATGCTC TCCAGCGTGG CCATCGTGCT TGTGATCGGC
CTGATCCTGT TCGGCGTCAG CGGCGTGTGG CCGCCGATGG TCGCGGTCGA GTCCGGGAGC
ATGGAGCCGA ACATTGAGGT TGGTGACCTC GTCTTCGTCA CGGAGCCGGG GCGACTTGCA
CCCGACGCCG CGGACAACGA CATCGGTGTC GTGACTCACG AGGTCGGGGA GACCGCTGAC
TACCAGACGT TCGGGTCCTA CGGCTCGGTG GTGATCTACC GACCACCGGG ACGGACGACT
TCGCCGATCA TCCATCGGGC GATGTTTCAT GTGGAGGAAG GCGAAAACTG GCACGACCGC
GCCGACGATC GGTACCACAA CGCCGCCGAC TGCGGGGAAC TCAACCACTG TCCCGCACCC
CACGACGGAT TCATCACGCT CGGCGACAAC AACGGCGAGT ACGATCAGGC GAACGGGCTC
GCCGCGCCGG TCAAGGCCGA CTGGGTGACC GGAGTCGCGC GGGTCCGTGT GCCGTACCTC
GGCTACGTGC GACTGATCAC GACCGGCCAG GCGGATCTGA GTGACGTGTT GGCGACGAGC
GTCGTGATGC AGACTGGAGG GGTCGGCGCC GACGCCGACG GAGTCAGTAG TGGAAGTGGA
TCTAGCGAGA AGATCACCGT TCCTGACGCG AAGCCCATCG TTTCGGGTGG AGAGGTAACC
GCGGAGGCCG TCGCTTAA
 
Protein sequence
MDEGDRSDGG DEPDRKTGSA DSDPVNDGDF SGDDGNGGDQ SVSKEDCETG PGESSETGSA 
DTETPSDERG RASDDRIETG QAGSKPSRGR GETGRDRTET DRDSNSSGKG VLYRFRHNRE
GPLMWIREML SSVAIVLVIG LILFGVSGVW PPMVAVESGS MEPNIEVGDL VFVTEPGRLA
PDAADNDIGV VTHEVGETAD YQTFGSYGSV VIYRPPGRTT SPIIHRAMFH VEEGENWHDR
ADDRYHNAAD CGELNHCPAP HDGFITLGDN NGEYDQANGL AAPVKADWVT GVARVRVPYL
GYVRLITTGQ ADLSDVLATS VVMQTGGVGA DADGVSSGSG SSEKITVPDA KPIVSGGEVT
AEAVA