Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0195 |
Symbol | |
ID | 7402124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 211042 |
End bp | 212994 |
Gene Length | 1953 bp |
Protein Length | 650 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643707258 |
Product | type II secretion system protein E |
Protein accession | YP_002564870 |
Protein GI | 222478633 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4962] Flp pilus assembly protein, ATPase CpaF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.617369 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCG ATCTCCCTTC GCTGTCGCGG CTCGGTGGCG ACGACACGTC GCTGCGCTCG CTCCCGTGGC TCGACGGCGA CACAGATAGC TGTCAGTGCG ATCCATCGTT CAGAGAGCCA GTCGGGACCG GTGTTGACGA CCGAGTGGTC CTTTCAGTGG ACGCCGACGA CTGCCCCGGG CGCGGCGACC TCGCGGCGAG CCCCGCCTGC CTCGCGACGG TCGTGGAGAC GCTGACGGAG CGCGACGCCG ACGTGATCCG GACGCACCAC GCGGGCCGAG AGCGAACCTA TGCCGGGCGG GCCGCGGCGT GCCTGATCGC CGCCGGGCGG TTCCGAGAGC AGATCGAGTT CCACGAGACG CGGCTCGCGG AGCGCGTGAC CCGAGAGCCT ATCGAGGCAG CGCGGGAGGC GAGCGGCCGC GAGGGACCAC CGAAGCGGAT CGCCGCGGAG ACCGGACTGG CCGAGACCGT CGCTAGCGCT GAGGAGACCG GCGACGTGTT GCGAGCGCAC GCCGGTCCGA CGATCGCGGC CACGCGCATC GCGTCGGCGC CGCCGCCGGA TGCGGCGCTC GTCGACCGAT GGGAGATCGA TACCGGCGCG ACCGTGCGAC TGTACGAGGG AGCGGGACCG CTTCGGACGT ACCACCTCAC GCCCCCCTCG GCCCGGCTCG ACGACGAGGC CGTCGAGCGG CTCGCCGACG CCAAAGACCG ACTGCTCGAT GACCCAACCG GCGGCGACAG GGCACCAGGG CGAGCGGTGC GGTCGATCGC GTCGGCCGGT GACCCCGTCT CGACACTGGT CGACGTGCTT CGACGACACA CGCGCGGATA CGGCGCGTTC GAGCACGTGT TCGCTGACGA CCGCGTGAGC GACGCGACGC TGTCGGCGCC CGTCGCCGAG AATCCGCTTC GGGTGGTCGT CGACGGGGAG CGGTGTCGCA CCAACGTCAG GCTGCCGCCG GAGGGGGCGG CGACGTTGGC GTCCCGCCTT CGGCGGACGA GCGGGCGAGG GTTCTCCCGA GCAAGTCCCA CGCTCGACGC GACGATCGAG GCCGAAGCCG GTCGAGTTCG GGTCGCAGCG ACGACGGCCC CCGCCAGCGA CGGCCTCTCG TTTACGTTCC GCCGCGGTGA CCCGGACGCG TGGACCCTCG CGCGGCTCGT GGATGGAGGG ACGATCACGC CCGCGGCCGC TGGGCTCCTT TCCGTGGCCG TCAAGCGAGG CGTGACTGGA CTCGTGGCCG GCGGCCGCGG CGCGGGGAAG ACGACCGCGC TTGGGTCCCT CCTCTGGGAG CTGCCAGCCG AGACGCGATC GATATTGATC GAGGATACGC CGGAGCTGCC GGCCGCCGCG GTGACTGCGG CCGGGCGCGA CACCCAGCGG CTCCGAGTGG GTGACGGCGC AGAGCCGTCA CCGAGCGAGG CGGTGCGAAC TGCCCTCCGG CTCGGCGGTG GTGCGATCGT CGTCGGCGAG GTCCGCGGCG AGGAGGCAGC GGCGTTGTAC GAGGCGATGC GCGTCGGCGC GGCCGGCGAG GCGGTCCTCG GGACGATCCA CGGCGAGGAC CCCGCCGCGG TCCGAGAGCG GGTCGTTACC GATCTCGGCG TCTCCCCATC CTCGTTCGCT GCGACCGATC TGATCGTTGT GCTCGACGAT CATCGCGTCG AGACGATCGC CGAGGTCGTC GGTCACGACG GCGAGGCCTC GTTCGAGCCG CTGTTCGAGC GAACCGGATC AGGGTTGGTC TCTACCGGCC GGATCGATCG CGGGGAGAGC CGGCTCGTCG AGTCGCTCGC AGAGACCGAC GAATCGTACG CATCCGTCCG CGACGCCGTC GATCGGCGGG GAGAGAGAAT CGGTGAGGCA GCGAGAACCG GCCGGATCAC GCCGGAGCGC TACGTCGGCC GCGACGGCGA CGGCTGGCAG AGGACGGACA GCCTCGGCGG TGGCAACCAA TGA
|
Protein sequence | MSFDLPSLSR LGGDDTSLRS LPWLDGDTDS CQCDPSFREP VGTGVDDRVV LSVDADDCPG RGDLAASPAC LATVVETLTE RDADVIRTHH AGRERTYAGR AAACLIAAGR FREQIEFHET RLAERVTREP IEAAREASGR EGPPKRIAAE TGLAETVASA EETGDVLRAH AGPTIAATRI ASAPPPDAAL VDRWEIDTGA TVRLYEGAGP LRTYHLTPPS ARLDDEAVER LADAKDRLLD DPTGGDRAPG RAVRSIASAG DPVSTLVDVL RRHTRGYGAF EHVFADDRVS DATLSAPVAE NPLRVVVDGE RCRTNVRLPP EGAATLASRL RRTSGRGFSR ASPTLDATIE AEAGRVRVAA TTAPASDGLS FTFRRGDPDA WTLARLVDGG TITPAAAGLL SVAVKRGVTG LVAGGRGAGK TTALGSLLWE LPAETRSILI EDTPELPAAA VTAAGRDTQR LRVGDGAEPS PSEAVRTALR LGGGAIVVGE VRGEEAAALY EAMRVGAAGE AVLGTIHGED PAAVRERVVT DLGVSPSSFA ATDLIVVLDD HRVETIAEVV GHDGEASFEP LFERTGSGLV STGRIDRGES RLVESLAETD ESYASVRDAV DRRGERIGEA ARTGRITPER YVGRDGDGWQ RTDSLGGGNQ
|
| |