Gene Pars_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0480 
Symbol 
ID5055937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp422904 
End bp425954 
Gene Length3051 bp 
Protein Length1016 aa 
Translation table11 
GC content60% 
IMG OID640468045 
Productpeptidase S41 
Protein accessionYP_001152730 
Protein GI145590728 
COG category[S] Function unknown 
COG ID[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.359611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGGCT ATTACCTATT CCCCGACATT CACGGCGACG AAATCGTATT CGTCACAGAA 
GACGACTTGT GGCGATACAG AGGCGGGGCT GGCCAGAGGC TCACCTCAGA CTTCGGCGTG
GTTATAAGGC CGAAGTTCTC GCCTGACGGG AAGTGGATAG CCTTTACGAG GCTACAACAG
ACTGACCAGG GCACGACGTC GGACGTCTAT GTAATACCTG CGGAGGGGGG CGAGCCGAGG
CGGCTGACCT ACTTCGGCAC TCCCTTCACG AGGGTGGTCG GTTGGACGCC TGACGGCCGG
GTGTTGGCCT ACAGCGACTA CAAGACGCCG TTTCCCCAGT GGCGGGAGCT ATATGCAATA
GCCCTAGACG GGACGTACGA GAGGCTAAAC CTGGGCCCAG CCACGGCCCT GGTCTACGGC
GACGGCGGCG TAGTTGTCCT CGGCAGGAAC AACTACGAGC TCCCCCACTG GAAGAGGTAC
AGAGGCGGCG CGAGGGGAGT TTTGTGGATT AGCAGAGACG GCGGCAAGAC CTTCGCCAAG
TTCCTAGACC TCCCCGGGAA CATCACCTCG CCGATGATAG TGGGCGGCCG GGTGTTCTTC
GTCTCCGACC ACGAGGGGGT GGGCAACCTC TACTCAGTTG ATTTGTCGGG AGGCGACTTG
AGGCGCCACA CCAACTTCAC CGACTTCTAC GTGAGGAACG CCAGCTCCGA CGGTCGGCGC
ATTGTCTTCC AAGTCGCCGG CGATATCTGG CTGTACGACC CAGCCGCCGA CAAGCTGGAG
AGGCTGGACA TAGACCTCCC CCTCTCCCGC AAGGCGAAGA TGGCAAAATT CGTAGACCCC
CTCAAATACC TGGAGTACTT CGCCCTGGCG TCTGGGGAGA GACTGGCCGT AATAACGAGA
GGTCAGGCCT TCCTTGTGCC CAGCTGGGAG GGGGCCGTGG TTCAGCTCGG CGAGAGGGGC
GGAGGGGTGA GGTACAAGCA CGTCGCCACC GACGGGGAGA AAATCGCAGT GGCCACCTAC
GACGGCGCTG TGGAGGTCTA CTCCATAGAC GGCAGGATTG TCAAGAGGAT AGAGCCCGGG
GTAGGCCTCG TGGAGGCCCT GGCGGTGAAG GGCTCTAAAA TCGCCTTGGC GAACCACCGG
GGGGAGCTCT GGATCTTGGA CTTAGAGAGC GGCGCCGCGT CGCTGGTGGA TAGGAGCGAG
TACGGGCTGA TTACGGAGAT GGCTTGGCAC CCGTCGGGGA GGTGGCTGGC CTACGCCAAG
CCCGCCGGAG TGTACGCCCA AAACATCCGC CTCCTCGACG CAACCACCGG GAAGGCCTAC
GACGTGACGT CGCCCACCGC CTACGACTAC TCGCCGGCGT TTGACCCCCA TGGAAGGTAT
CTCTACTTCC TGTCAAGGAG GGCGCTGAAC CCAGCGCTTG ACCCCGTGCA GTTCGTCTAC
TCCTTCGCCA AGCACTCAAA ACCATACCTA GTAGTGTTGA GGAAAGACGA CGCATCGCCC
TTTGTCGAAT ACAAGAGACG GGAGGAGAAG GTGGAGGACA TAGACGTGGA GGGGATTGAG
AGGAGAGTGG AGCCGTTCCC CGTTGAGGAG GGGCTCTACT CCGCCGTGGT TGGCCTAAAG
GGCGGGAAGG TGGCGTGGCT AAAATACGAT GTCGAGGGGG CCCTGAGGTA CTACCTCTGG
TCTGCGCAGG AGCGGCGGGG CGCCGTCGAG GTCTACGACT TGGAGACGAA GATGAGGGAG
CAACTGATCT CCGGCGTCTC GGCGATGAGG GCCTCCCCAG ACGGGAAGTA CCTCCTGGTC
AAGGAGGAGA ATAGGCTACG CCTTATCGAC ATCGAGAAGA AGCCCGACGT CCAGTCGAGG
GAGCCGGGGA GGAAGTCAGG CGTGTTGGAC ATGGGGCGGG TTAAGGTGTA TGTCGAGCCG
GAAAAGGAAT GGAGGCAGAT GTTGCGGGAG GCGTGGTTAC TCATGAGGGA GAACTACTGG
AAGGGGGATA TGAACGGCGT TGATTGGGAC GCCGTTTACA AGAAGTACGA GCCGCTCCTA
GAAAGGGCGG GCACCAGGTA CGAGCTGAGC GACGTGATTA ACGAGATGCA GGGAGAGCTG
GGGACGAGCC ACGCCTACGA GATCGTGCCT GATTTCGAGG TGGATAAGCC GTACCTCGTC
GGGGGGCTCG GCGCCGAGTA CAAGTGGGAT GGGAAGTGCT GGCGCATCGT AAAGATATTT
GCGGGGGATC CCTCTTACGA GAACGAGAAG TCGCCCCTAC TGGCGCCGGG GGTGGACGTG
AGGGAGGGCG ACTGCCTCGT CTCCATCGCT GGAGTCAGGC TCGGGCCGGG GGCGCCGCCG
GAGTACGCCC TCCTCAACCG CCCTGGCGAC GTCGTCGCTA TAGAGGTAGA TCGGGGCGGC
GAGGTGAGGA CCTACGTCGT GAGGACGGTG CGCGACGAGA AGTACCTAAT ATACCGCCAC
TGGGTGGAGG AGAACAGGCG GAAGGTGCAT AAAGCCACTT GGGGCCGGGT TGGCTATATC
CACATCCCAG ACATGGGGCC TGCGGGTTAC GCCGAGTTCT TTAAATCCCT AAACGCCGAT
GGCGACAAGG AGGCCTTTAT CATCGACATC CGTTACAACC GGGGCGGCCA CACGTCGGGG
ATGCTGGTGC CGAGGATATG CGTCGGCGTT TTTGGGAAGT TCCTCACCCG CCACTTCAAG
CCGTTTCCCT ACCCGGAGCT TGTACTGCCT AAAAAACTTG TGCTGGTGAC TAACGAACAC
GCGGGCTCAG ATGGCGACAT ATTTACATAC GACTTCAAAC ACCTAGGCCT TGGGCCCGTC
GTCGGCAAAC GGACGTGGGG AGGTACTGTG GGCATAGACA CGAGATACAA GCTTGTTGAC
GGCACCATTA TCACCCAGCC TAAATACGCC TTCTGGGGCG AGGGCGTTGG GACGGGAATA
GAGGGCTACG GAGTAGACCC TGACATTGAG GTAGAGATCG CGCCGCAGGA CTACAGAGAG
GGGAAAGACC CGCAATTAGA AAAAGCGCTA GAGATTTTCA AAGAGAGTTA G
 
Protein sequence
MKGYYLFPDI HGDEIVFVTE DDLWRYRGGA GQRLTSDFGV VIRPKFSPDG KWIAFTRLQQ 
TDQGTTSDVY VIPAEGGEPR RLTYFGTPFT RVVGWTPDGR VLAYSDYKTP FPQWRELYAI
ALDGTYERLN LGPATALVYG DGGVVVLGRN NYELPHWKRY RGGARGVLWI SRDGGKTFAK
FLDLPGNITS PMIVGGRVFF VSDHEGVGNL YSVDLSGGDL RRHTNFTDFY VRNASSDGRR
IVFQVAGDIW LYDPAADKLE RLDIDLPLSR KAKMAKFVDP LKYLEYFALA SGERLAVITR
GQAFLVPSWE GAVVQLGERG GGVRYKHVAT DGEKIAVATY DGAVEVYSID GRIVKRIEPG
VGLVEALAVK GSKIALANHR GELWILDLES GAASLVDRSE YGLITEMAWH PSGRWLAYAK
PAGVYAQNIR LLDATTGKAY DVTSPTAYDY SPAFDPHGRY LYFLSRRALN PALDPVQFVY
SFAKHSKPYL VVLRKDDASP FVEYKRREEK VEDIDVEGIE RRVEPFPVEE GLYSAVVGLK
GGKVAWLKYD VEGALRYYLW SAQERRGAVE VYDLETKMRE QLISGVSAMR ASPDGKYLLV
KEENRLRLID IEKKPDVQSR EPGRKSGVLD MGRVKVYVEP EKEWRQMLRE AWLLMRENYW
KGDMNGVDWD AVYKKYEPLL ERAGTRYELS DVINEMQGEL GTSHAYEIVP DFEVDKPYLV
GGLGAEYKWD GKCWRIVKIF AGDPSYENEK SPLLAPGVDV REGDCLVSIA GVRLGPGAPP
EYALLNRPGD VVAIEVDRGG EVRTYVVRTV RDEKYLIYRH WVEENRRKVH KATWGRVGYI
HIPDMGPAGY AEFFKSLNAD GDKEAFIIDI RYNRGGHTSG MLVPRICVGV FGKFLTRHFK
PFPYPELVLP KKLVLVTNEH AGSDGDIFTY DFKHLGLGPV VGKRTWGGTV GIDTRYKLVD
GTIITQPKYA FWGEGVGTGI EGYGVDPDIE VEIAPQDYRE GKDPQLEKAL EIFKES