Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0031 |
Symbol | |
ID | 5054730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 24477 |
End bp | 25961 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640467611 |
Product | type II secretion system protein E |
Protein accession | YP_001152300 |
Protein GI | 145590298 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACT TCCGGCTAGA GCCTTGCCAG GGAGTTGTGG TTGAGGAGTA TCAATCCGAG GGCGCAAGGG TACAGATATA CCAGTCGGAG GATGGGTACT GCTACAATGT GGCTTATGCC TTTCCATACA GCGAGAAGAT CGGTGAGTAT GCCTACAAGG TCGTCGAATA CCTGACAGCC AACCAGTTCA TACGTCCCAA CATAACAAGG GAGGAGTTGG GAAAGCTTAT CGAAATGGCC ATGACTGATA TAGGAGTGCC TAAACAGCTA CGCGCGGCTG TGCGGTACTA CGTCCAGCTC GAAGCAGCCG ACTATTCCTA TCTCACGCCC ATACTGTACG ACATAAGGCT CGAGAACATC AACATCAACG GCACGGAAAA TCCCATCTTT GTCGACCACA GGGACTACGG CTACAACATA AAGACTAACG TAATCCCGAC TAACAAGGAG ACGCTTATTA AAATCGTGGG CAGAGTGTAC GCAGAGACCG GGAGGCCGCT TAACGAGCAG TACCCCATCC AAGACACGTA CATTAGGCTG AGGAACGGCG CGTTGCTCCG CTTCGCCACC GCCATGTCTG GCCGCGTGGC AAGAAACCCG CCGTATGTGT CTGTACGTGT GCAACCGCCG TTCCCCATCT CGCCTACTGA GCTGATAAAG AGAAAAACCA TATCGCCTCT CGCCATGGCG TACCTCTGGT ACATGTTCGA GCACCACAAA TCGGTGATGG TTATCGGGGG CACAGGCACA GGCAAAACCA CCTTGCTTAA CGCGTTGATG GTACTACTGC CACATAAACG CCTGGCAATC GCAGAGGAGA CACCGGAAAT TAGAGTGCCG CCGAGCTTCC AGAACGTTGT CATGCTCTTC ACATCGCCAA TGTACGACTA CATGAAAAAC CTCCCCGGCT CAGAGTCGGC TATATACCTA ATTGACCTCG TGAAGTATCT CCTGAGGGCT AGACCTGACA TAATCGTAAT CGGCGAGAGC CGCGGCAGAG AGATCCACGA GCTTATACAA GGCGTCCTCA CAGGCCACGG CGGTGCTACC ACCTTCCACG CCGAGGACAT CATGGAGGTG TTTATGAGGC TGACAGGAGA GGCTATAGGC GTGTCCTCCG AACATCTCTC GGCTTTCCAC GTACTCGCAA CTATTAGAAG GTTCGACTTC GGCAGACGTG TCACTTCTAT TACAGAGGTT GTGTGGCTGA GGGCGTACCC CTACGCCGCG CCTGGAAAAG TAAAAATCAA AGATGAAGAA TTCGGGCTGA TAAACGTGGG CTGGTACGAC CCGCGGACAG ATACTGTGGA AATAGATCTC AGAAGATCGT ACTGGTTGCA AAAAATAGGG GGCTACGAGG AGATACTTGA AAGAGCAAAA TTCCTAACAG CATTGGTAGA GAGAGGTGTG ATCGATGCCG AGAAAGTGGC AGAAGCCGTA AGGGAGTACT ACAGAGAGAA GCACGCGCTG CTAAAGAAAG TCTAG
|
Protein sequence | MLDFRLEPCQ GVVVEEYQSE GARVQIYQSE DGYCYNVAYA FPYSEKIGEY AYKVVEYLTA NQFIRPNITR EELGKLIEMA MTDIGVPKQL RAAVRYYVQL EAADYSYLTP ILYDIRLENI NINGTENPIF VDHRDYGYNI KTNVIPTNKE TLIKIVGRVY AETGRPLNEQ YPIQDTYIRL RNGALLRFAT AMSGRVARNP PYVSVRVQPP FPISPTELIK RKTISPLAMA YLWYMFEHHK SVMVIGGTGT GKTTLLNALM VLLPHKRLAI AEETPEIRVP PSFQNVVMLF TSPMYDYMKN LPGSESAIYL IDLVKYLLRA RPDIIVIGES RGREIHELIQ GVLTGHGGAT TFHAEDIMEV FMRLTGEAIG VSSEHLSAFH VLATIRRFDF GRRVTSITEV VWLRAYPYAA PGKVKIKDEE FGLINVGWYD PRTDTVEIDL RRSYWLQKIG GYEEILERAK FLTALVERGV IDAEKVAEAV REYYREKHAL LKKV
|
| |