Gene Pars_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0031 
Symbol 
ID5054730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp24477 
End bp25961 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content52% 
IMG OID640467611 
Producttype II secretion system protein E 
Protein accessionYP_001152300 
Protein GI145590298 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGACT TCCGGCTAGA GCCTTGCCAG GGAGTTGTGG TTGAGGAGTA TCAATCCGAG 
GGCGCAAGGG TACAGATATA CCAGTCGGAG GATGGGTACT GCTACAATGT GGCTTATGCC
TTTCCATACA GCGAGAAGAT CGGTGAGTAT GCCTACAAGG TCGTCGAATA CCTGACAGCC
AACCAGTTCA TACGTCCCAA CATAACAAGG GAGGAGTTGG GAAAGCTTAT CGAAATGGCC
ATGACTGATA TAGGAGTGCC TAAACAGCTA CGCGCGGCTG TGCGGTACTA CGTCCAGCTC
GAAGCAGCCG ACTATTCCTA TCTCACGCCC ATACTGTACG ACATAAGGCT CGAGAACATC
AACATCAACG GCACGGAAAA TCCCATCTTT GTCGACCACA GGGACTACGG CTACAACATA
AAGACTAACG TAATCCCGAC TAACAAGGAG ACGCTTATTA AAATCGTGGG CAGAGTGTAC
GCAGAGACCG GGAGGCCGCT TAACGAGCAG TACCCCATCC AAGACACGTA CATTAGGCTG
AGGAACGGCG CGTTGCTCCG CTTCGCCACC GCCATGTCTG GCCGCGTGGC AAGAAACCCG
CCGTATGTGT CTGTACGTGT GCAACCGCCG TTCCCCATCT CGCCTACTGA GCTGATAAAG
AGAAAAACCA TATCGCCTCT CGCCATGGCG TACCTCTGGT ACATGTTCGA GCACCACAAA
TCGGTGATGG TTATCGGGGG CACAGGCACA GGCAAAACCA CCTTGCTTAA CGCGTTGATG
GTACTACTGC CACATAAACG CCTGGCAATC GCAGAGGAGA CACCGGAAAT TAGAGTGCCG
CCGAGCTTCC AGAACGTTGT CATGCTCTTC ACATCGCCAA TGTACGACTA CATGAAAAAC
CTCCCCGGCT CAGAGTCGGC TATATACCTA ATTGACCTCG TGAAGTATCT CCTGAGGGCT
AGACCTGACA TAATCGTAAT CGGCGAGAGC CGCGGCAGAG AGATCCACGA GCTTATACAA
GGCGTCCTCA CAGGCCACGG CGGTGCTACC ACCTTCCACG CCGAGGACAT CATGGAGGTG
TTTATGAGGC TGACAGGAGA GGCTATAGGC GTGTCCTCCG AACATCTCTC GGCTTTCCAC
GTACTCGCAA CTATTAGAAG GTTCGACTTC GGCAGACGTG TCACTTCTAT TACAGAGGTT
GTGTGGCTGA GGGCGTACCC CTACGCCGCG CCTGGAAAAG TAAAAATCAA AGATGAAGAA
TTCGGGCTGA TAAACGTGGG CTGGTACGAC CCGCGGACAG ATACTGTGGA AATAGATCTC
AGAAGATCGT ACTGGTTGCA AAAAATAGGG GGCTACGAGG AGATACTTGA AAGAGCAAAA
TTCCTAACAG CATTGGTAGA GAGAGGTGTG ATCGATGCCG AGAAAGTGGC AGAAGCCGTA
AGGGAGTACT ACAGAGAGAA GCACGCGCTG CTAAAGAAAG TCTAG
 
Protein sequence
MLDFRLEPCQ GVVVEEYQSE GARVQIYQSE DGYCYNVAYA FPYSEKIGEY AYKVVEYLTA 
NQFIRPNITR EELGKLIEMA MTDIGVPKQL RAAVRYYVQL EAADYSYLTP ILYDIRLENI
NINGTENPIF VDHRDYGYNI KTNVIPTNKE TLIKIVGRVY AETGRPLNEQ YPIQDTYIRL
RNGALLRFAT AMSGRVARNP PYVSVRVQPP FPISPTELIK RKTISPLAMA YLWYMFEHHK
SVMVIGGTGT GKTTLLNALM VLLPHKRLAI AEETPEIRVP PSFQNVVMLF TSPMYDYMKN
LPGSESAIYL IDLVKYLLRA RPDIIVIGES RGREIHELIQ GVLTGHGGAT TFHAEDIMEV
FMRLTGEAIG VSSEHLSAFH VLATIRRFDF GRRVTSITEV VWLRAYPYAA PGKVKIKDEE
FGLINVGWYD PRTDTVEIDL RRSYWLQKIG GYEEILERAK FLTALVERGV IDAEKVAEAV
REYYREKHAL LKKV