Gene Pars_1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1781 
Symbol 
ID5055542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1602668 
End bp1605001 
Gene Length2334 bp 
Protein Length777 aa 
Translation table11 
GC content54% 
IMG OID640469326 
Productoligosaccharyl transferase, STT3 subunit 
Protein accessionYP_001153984 
Protein GI145591982 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0356227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAGGA GGGAGAAGCT TCTGGTTACG GTTCCGGCCC TTATTGCCGC CTTCGCTATT 
GCGCTATACA TGAGGATGTA TAGAGTTTTT GCATGGGGAT GGTGGCTTGA CGAGTTTGAC
CCCTATATCC GCTACTATTT GGCCAAGTAC ACACTTGAGC ACGGGCCTGG CTGGTGGTGG
AGCGGCGCCC ACTTTACGCA GTTCTGGTAC CCCTTCGGCG TAGATTGGGG AAAAGTCCTT
CTGCCCGGTA CCTCGTTCTA TGGGCTTTTC ACGTACTTTC TCCTGAGTCC TTTCGGAGTT
GACCTATGGC ACGCGGTTAT CGCCGCGCCG GCTATCATGA ACGCCTTGGC TGTATTCTCC
ATGTTTTACC TTGGCTACAG GATAGGGGAG GAGCTGGGTA TTCCAAATGG CGGGCTGAGA
GTAGGCCTCG CCGCGTCGCT CCTTGTCGCC ATAATGCCGG CTTACATAGA GCGAGGCTCT
GCTACGTGGT TTGATGACGA GCCGATTAGC CTGTTCCTAA TACCCCTCGG CCTCGCGCTG
TTGATAGACG GTTTGAGGCG GCCGTGGTTG GGGGCGGCGG CGGGGGCCGC CTTGGGCTAC
ATCGCATGGA CTTGGGGAGC TCACTTCTAT ATCTGGAACT TAGTGGGCCT ATACGCCGTT
GTCCTCCCAA CTTACTACTT CCTAAAACTT GCCTTTGCGA GAAGCGAGAG GCCCTCAAAG
GTTGGAAAGC GGCAGCCGGC TCCCCAGCCG CTGAGCCTCC CCTTCAACAC AAGAAACTTC
TTCGTATCCT ATGCTCTGTT CTACGTGGTC TACGCCGCGT TCGTGGCTAC TATCCCCCGC
TACGGGGTCA GCAGTTTGCT GAGCGCATTT AATATCTTGC CCACCTTCGG CCTATTCGTC
GCTCTTGCCA CCTGGCTCTT GGACTCAAGG CTAGGCGTAA CCCGAACAAC GCAGATGCTG
AGAAGGTACG TATGGTTGCT CGTGGGCGCT GCGGCTCTGG GCATAGTCTT GTTGTTAGTG
GCCATAGCCA CTGGGCAAAT AGGCGGCAAG TTCTTGGCCA CTCTTCTCCC AGTTGGGCGC
TCAGCGATTG TAGCAAGCGT CGCCGAGCAC AGCACTACCC AATTTATCCA GGTCCTCATT
AGATACGGCC CCATCCTGCC CTTCATAGTA GTCTCTATTC CCCACCTCTT TGCCACTGGC
GGCCTAATGG CGTTGACGTA TTTAGTAACC GGGGGATACG CCTCGGCCAC TATGGTGCGT
CTTTTGGTGT TGCTGGCGCC GGTGGCAGCA GTGACGGCAG CTGTGGGAAT AGTTAAGTTA
GTAAATAATA GAAGGCTTGG CTATCTTGCC CTACTTGCAT CGGCTATCTC GCTTATCATA
CTTACAAACT CATCGTTTGG GATCGCCTCG CAACCAACCC AGATCGTCAC TTCAGCTGTA
GGCGCAGTAA GCGACGACTT TCTCGACGCG TTAATGTGGC TTAAGACTAG GCTACCTGCC
TACGAGCCTG TGGCGTCTTG GTGGGACTAC GGCTACTGGA TATCCATAAT TGGGAATAAG
ACAAGCCTGG CGGACAACTC TACTATCAAC GCCACCCAGA TCGGGATGAT CGGTCTCGGC
ATGATGGCGC CGCCGCAGAT CGGCACTAAA ATATTCGCCG ACAACTTCCA TACCAAGTAC
ATACTGGCCA TAATGCCCTA CGCCGTGTAT CCCATAAACC TGCAACAGTA TGGGACCGTC
CCTGTGCTCG TACACGAATA TCCCCCCGGC GGCGACTTTC TCAAATCGTA CTGGATGACG
AGGATAGCTC TTGAAAACGT GGCGGCGGCT AGGGCTATTT TAGGCGCGCC GGCTGGAGTC
TCCGCAGACG ACTTTATATA TACAAGGGTC ATGTCTTACC CCGGCCCCCC GCCGGCTAAC
TACGCCTACT TCACCATAGG CCAGAGCTAT TATCCTGTAC CCATTAGCCT AAACAGGACT
CTATACACAA TGCTGTTCTC CAAGGTGAGG TTGTTCCCAA TCTACGGCAA CGATTCACAG
TACCTTTCCT GGATTTTTGA GGGGGTACAC ACTGGTGTTG GTTTCACCTC GTTTAGAAGC
CTAAAATTAG TTGGAGGCAA CGTCTATCTC CCTGTAGATC TGACCCAGAA ACTCCAGGGC
ATCGCGACCT ACGTCATGCA AGGCTGGACA CAGGGCGGTG TCGACGTGGT TGTAGATCCG
CTGACCGGCA AGACGGAGCG CATACCCTAT ATCACAGTGG TGCCCCAGAA TCTGAGGCCT
GTCTACGTCT CGAAGCCCTA CGGCTGGGTT GTTATATACC AGGTATCGTC GTAA
 
Protein sequence
MDRREKLLVT VPALIAAFAI ALYMRMYRVF AWGWWLDEFD PYIRYYLAKY TLEHGPGWWW 
SGAHFTQFWY PFGVDWGKVL LPGTSFYGLF TYFLLSPFGV DLWHAVIAAP AIMNALAVFS
MFYLGYRIGE ELGIPNGGLR VGLAASLLVA IMPAYIERGS ATWFDDEPIS LFLIPLGLAL
LIDGLRRPWL GAAAGAALGY IAWTWGAHFY IWNLVGLYAV VLPTYYFLKL AFARSERPSK
VGKRQPAPQP LSLPFNTRNF FVSYALFYVV YAAFVATIPR YGVSSLLSAF NILPTFGLFV
ALATWLLDSR LGVTRTTQML RRYVWLLVGA AALGIVLLLV AIATGQIGGK FLATLLPVGR
SAIVASVAEH STTQFIQVLI RYGPILPFIV VSIPHLFATG GLMALTYLVT GGYASATMVR
LLVLLAPVAA VTAAVGIVKL VNNRRLGYLA LLASAISLII LTNSSFGIAS QPTQIVTSAV
GAVSDDFLDA LMWLKTRLPA YEPVASWWDY GYWISIIGNK TSLADNSTIN ATQIGMIGLG
MMAPPQIGTK IFADNFHTKY ILAIMPYAVY PINLQQYGTV PVLVHEYPPG GDFLKSYWMT
RIALENVAAA RAILGAPAGV SADDFIYTRV MSYPGPPPAN YAYFTIGQSY YPVPISLNRT
LYTMLFSKVR LFPIYGNDSQ YLSWIFEGVH TGVGFTSFRS LKLVGGNVYL PVDLTQKLQG
IATYVMQGWT QGGVDVVVDP LTGKTERIPY ITVVPQNLRP VYVSKPYGWV VIYQVSS