Gene Hlac_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0195 
Symbol 
ID7402124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp211042 
End bp212994 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content72% 
IMG OID643707258 
Producttype II secretion system protein E 
Protein accessionYP_002564870 
Protein GI222478633 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.617369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCG ATCTCCCTTC GCTGTCGCGG CTCGGTGGCG ACGACACGTC GCTGCGCTCG 
CTCCCGTGGC TCGACGGCGA CACAGATAGC TGTCAGTGCG ATCCATCGTT CAGAGAGCCA
GTCGGGACCG GTGTTGACGA CCGAGTGGTC CTTTCAGTGG ACGCCGACGA CTGCCCCGGG
CGCGGCGACC TCGCGGCGAG CCCCGCCTGC CTCGCGACGG TCGTGGAGAC GCTGACGGAG
CGCGACGCCG ACGTGATCCG GACGCACCAC GCGGGCCGAG AGCGAACCTA TGCCGGGCGG
GCCGCGGCGT GCCTGATCGC CGCCGGGCGG TTCCGAGAGC AGATCGAGTT CCACGAGACG
CGGCTCGCGG AGCGCGTGAC CCGAGAGCCT ATCGAGGCAG CGCGGGAGGC GAGCGGCCGC
GAGGGACCAC CGAAGCGGAT CGCCGCGGAG ACCGGACTGG CCGAGACCGT CGCTAGCGCT
GAGGAGACCG GCGACGTGTT GCGAGCGCAC GCCGGTCCGA CGATCGCGGC CACGCGCATC
GCGTCGGCGC CGCCGCCGGA TGCGGCGCTC GTCGACCGAT GGGAGATCGA TACCGGCGCG
ACCGTGCGAC TGTACGAGGG AGCGGGACCG CTTCGGACGT ACCACCTCAC GCCCCCCTCG
GCCCGGCTCG ACGACGAGGC CGTCGAGCGG CTCGCCGACG CCAAAGACCG ACTGCTCGAT
GACCCAACCG GCGGCGACAG GGCACCAGGG CGAGCGGTGC GGTCGATCGC GTCGGCCGGT
GACCCCGTCT CGACACTGGT CGACGTGCTT CGACGACACA CGCGCGGATA CGGCGCGTTC
GAGCACGTGT TCGCTGACGA CCGCGTGAGC GACGCGACGC TGTCGGCGCC CGTCGCCGAG
AATCCGCTTC GGGTGGTCGT CGACGGGGAG CGGTGTCGCA CCAACGTCAG GCTGCCGCCG
GAGGGGGCGG CGACGTTGGC GTCCCGCCTT CGGCGGACGA GCGGGCGAGG GTTCTCCCGA
GCAAGTCCCA CGCTCGACGC GACGATCGAG GCCGAAGCCG GTCGAGTTCG GGTCGCAGCG
ACGACGGCCC CCGCCAGCGA CGGCCTCTCG TTTACGTTCC GCCGCGGTGA CCCGGACGCG
TGGACCCTCG CGCGGCTCGT GGATGGAGGG ACGATCACGC CCGCGGCCGC TGGGCTCCTT
TCCGTGGCCG TCAAGCGAGG CGTGACTGGA CTCGTGGCCG GCGGCCGCGG CGCGGGGAAG
ACGACCGCGC TTGGGTCCCT CCTCTGGGAG CTGCCAGCCG AGACGCGATC GATATTGATC
GAGGATACGC CGGAGCTGCC GGCCGCCGCG GTGACTGCGG CCGGGCGCGA CACCCAGCGG
CTCCGAGTGG GTGACGGCGC AGAGCCGTCA CCGAGCGAGG CGGTGCGAAC TGCCCTCCGG
CTCGGCGGTG GTGCGATCGT CGTCGGCGAG GTCCGCGGCG AGGAGGCAGC GGCGTTGTAC
GAGGCGATGC GCGTCGGCGC GGCCGGCGAG GCGGTCCTCG GGACGATCCA CGGCGAGGAC
CCCGCCGCGG TCCGAGAGCG GGTCGTTACC GATCTCGGCG TCTCCCCATC CTCGTTCGCT
GCGACCGATC TGATCGTTGT GCTCGACGAT CATCGCGTCG AGACGATCGC CGAGGTCGTC
GGTCACGACG GCGAGGCCTC GTTCGAGCCG CTGTTCGAGC GAACCGGATC AGGGTTGGTC
TCTACCGGCC GGATCGATCG CGGGGAGAGC CGGCTCGTCG AGTCGCTCGC AGAGACCGAC
GAATCGTACG CATCCGTCCG CGACGCCGTC GATCGGCGGG GAGAGAGAAT CGGTGAGGCA
GCGAGAACCG GCCGGATCAC GCCGGAGCGC TACGTCGGCC GCGACGGCGA CGGCTGGCAG
AGGACGGACA GCCTCGGCGG TGGCAACCAA TGA
 
Protein sequence
MSFDLPSLSR LGGDDTSLRS LPWLDGDTDS CQCDPSFREP VGTGVDDRVV LSVDADDCPG 
RGDLAASPAC LATVVETLTE RDADVIRTHH AGRERTYAGR AAACLIAAGR FREQIEFHET
RLAERVTREP IEAAREASGR EGPPKRIAAE TGLAETVASA EETGDVLRAH AGPTIAATRI
ASAPPPDAAL VDRWEIDTGA TVRLYEGAGP LRTYHLTPPS ARLDDEAVER LADAKDRLLD
DPTGGDRAPG RAVRSIASAG DPVSTLVDVL RRHTRGYGAF EHVFADDRVS DATLSAPVAE
NPLRVVVDGE RCRTNVRLPP EGAATLASRL RRTSGRGFSR ASPTLDATIE AEAGRVRVAA
TTAPASDGLS FTFRRGDPDA WTLARLVDGG TITPAAAGLL SVAVKRGVTG LVAGGRGAGK
TTALGSLLWE LPAETRSILI EDTPELPAAA VTAAGRDTQR LRVGDGAEPS PSEAVRTALR
LGGGAIVVGE VRGEEAAALY EAMRVGAAGE AVLGTIHGED PAAVRERVVT DLGVSPSSFA
ATDLIVVLDD HRVETIAEVV GHDGEASFEP LFERTGSGLV STGRIDRGES RLVESLAETD
ESYASVRDAV DRRGERIGEA ARTGRITPER YVGRDGDGWQ RTDSLGGGNQ