Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0480 |
Symbol | |
ID | 5055937 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 422904 |
End bp | 425954 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640468045 |
Product | peptidase S41 |
Protein accession | YP_001152730 |
Protein GI | 145590728 |
COG category | [S] Function unknown |
COG ID | [COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.359611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGCT ATTACCTATT CCCCGACATT CACGGCGACG AAATCGTATT CGTCACAGAA GACGACTTGT GGCGATACAG AGGCGGGGCT GGCCAGAGGC TCACCTCAGA CTTCGGCGTG GTTATAAGGC CGAAGTTCTC GCCTGACGGG AAGTGGATAG CCTTTACGAG GCTACAACAG ACTGACCAGG GCACGACGTC GGACGTCTAT GTAATACCTG CGGAGGGGGG CGAGCCGAGG CGGCTGACCT ACTTCGGCAC TCCCTTCACG AGGGTGGTCG GTTGGACGCC TGACGGCCGG GTGTTGGCCT ACAGCGACTA CAAGACGCCG TTTCCCCAGT GGCGGGAGCT ATATGCAATA GCCCTAGACG GGACGTACGA GAGGCTAAAC CTGGGCCCAG CCACGGCCCT GGTCTACGGC GACGGCGGCG TAGTTGTCCT CGGCAGGAAC AACTACGAGC TCCCCCACTG GAAGAGGTAC AGAGGCGGCG CGAGGGGAGT TTTGTGGATT AGCAGAGACG GCGGCAAGAC CTTCGCCAAG TTCCTAGACC TCCCCGGGAA CATCACCTCG CCGATGATAG TGGGCGGCCG GGTGTTCTTC GTCTCCGACC ACGAGGGGGT GGGCAACCTC TACTCAGTTG ATTTGTCGGG AGGCGACTTG AGGCGCCACA CCAACTTCAC CGACTTCTAC GTGAGGAACG CCAGCTCCGA CGGTCGGCGC ATTGTCTTCC AAGTCGCCGG CGATATCTGG CTGTACGACC CAGCCGCCGA CAAGCTGGAG AGGCTGGACA TAGACCTCCC CCTCTCCCGC AAGGCGAAGA TGGCAAAATT CGTAGACCCC CTCAAATACC TGGAGTACTT CGCCCTGGCG TCTGGGGAGA GACTGGCCGT AATAACGAGA GGTCAGGCCT TCCTTGTGCC CAGCTGGGAG GGGGCCGTGG TTCAGCTCGG CGAGAGGGGC GGAGGGGTGA GGTACAAGCA CGTCGCCACC GACGGGGAGA AAATCGCAGT GGCCACCTAC GACGGCGCTG TGGAGGTCTA CTCCATAGAC GGCAGGATTG TCAAGAGGAT AGAGCCCGGG GTAGGCCTCG TGGAGGCCCT GGCGGTGAAG GGCTCTAAAA TCGCCTTGGC GAACCACCGG GGGGAGCTCT GGATCTTGGA CTTAGAGAGC GGCGCCGCGT CGCTGGTGGA TAGGAGCGAG TACGGGCTGA TTACGGAGAT GGCTTGGCAC CCGTCGGGGA GGTGGCTGGC CTACGCCAAG CCCGCCGGAG TGTACGCCCA AAACATCCGC CTCCTCGACG CAACCACCGG GAAGGCCTAC GACGTGACGT CGCCCACCGC CTACGACTAC TCGCCGGCGT TTGACCCCCA TGGAAGGTAT CTCTACTTCC TGTCAAGGAG GGCGCTGAAC CCAGCGCTTG ACCCCGTGCA GTTCGTCTAC TCCTTCGCCA AGCACTCAAA ACCATACCTA GTAGTGTTGA GGAAAGACGA CGCATCGCCC TTTGTCGAAT ACAAGAGACG GGAGGAGAAG GTGGAGGACA TAGACGTGGA GGGGATTGAG AGGAGAGTGG AGCCGTTCCC CGTTGAGGAG GGGCTCTACT CCGCCGTGGT TGGCCTAAAG GGCGGGAAGG TGGCGTGGCT AAAATACGAT GTCGAGGGGG CCCTGAGGTA CTACCTCTGG TCTGCGCAGG AGCGGCGGGG CGCCGTCGAG GTCTACGACT TGGAGACGAA GATGAGGGAG CAACTGATCT CCGGCGTCTC GGCGATGAGG GCCTCCCCAG ACGGGAAGTA CCTCCTGGTC AAGGAGGAGA ATAGGCTACG CCTTATCGAC ATCGAGAAGA AGCCCGACGT CCAGTCGAGG GAGCCGGGGA GGAAGTCAGG CGTGTTGGAC ATGGGGCGGG TTAAGGTGTA TGTCGAGCCG GAAAAGGAAT GGAGGCAGAT GTTGCGGGAG GCGTGGTTAC TCATGAGGGA GAACTACTGG AAGGGGGATA TGAACGGCGT TGATTGGGAC GCCGTTTACA AGAAGTACGA GCCGCTCCTA GAAAGGGCGG GCACCAGGTA CGAGCTGAGC GACGTGATTA ACGAGATGCA GGGAGAGCTG GGGACGAGCC ACGCCTACGA GATCGTGCCT GATTTCGAGG TGGATAAGCC GTACCTCGTC GGGGGGCTCG GCGCCGAGTA CAAGTGGGAT GGGAAGTGCT GGCGCATCGT AAAGATATTT GCGGGGGATC CCTCTTACGA GAACGAGAAG TCGCCCCTAC TGGCGCCGGG GGTGGACGTG AGGGAGGGCG ACTGCCTCGT CTCCATCGCT GGAGTCAGGC TCGGGCCGGG GGCGCCGCCG GAGTACGCCC TCCTCAACCG CCCTGGCGAC GTCGTCGCTA TAGAGGTAGA TCGGGGCGGC GAGGTGAGGA CCTACGTCGT GAGGACGGTG CGCGACGAGA AGTACCTAAT ATACCGCCAC TGGGTGGAGG AGAACAGGCG GAAGGTGCAT AAAGCCACTT GGGGCCGGGT TGGCTATATC CACATCCCAG ACATGGGGCC TGCGGGTTAC GCCGAGTTCT TTAAATCCCT AAACGCCGAT GGCGACAAGG AGGCCTTTAT CATCGACATC CGTTACAACC GGGGCGGCCA CACGTCGGGG ATGCTGGTGC CGAGGATATG CGTCGGCGTT TTTGGGAAGT TCCTCACCCG CCACTTCAAG CCGTTTCCCT ACCCGGAGCT TGTACTGCCT AAAAAACTTG TGCTGGTGAC TAACGAACAC GCGGGCTCAG ATGGCGACAT ATTTACATAC GACTTCAAAC ACCTAGGCCT TGGGCCCGTC GTCGGCAAAC GGACGTGGGG AGGTACTGTG GGCATAGACA CGAGATACAA GCTTGTTGAC GGCACCATTA TCACCCAGCC TAAATACGCC TTCTGGGGCG AGGGCGTTGG GACGGGAATA GAGGGCTACG GAGTAGACCC TGACATTGAG GTAGAGATCG CGCCGCAGGA CTACAGAGAG GGGAAAGACC CGCAATTAGA AAAAGCGCTA GAGATTTTCA AAGAGAGTTA G
|
Protein sequence | MKGYYLFPDI HGDEIVFVTE DDLWRYRGGA GQRLTSDFGV VIRPKFSPDG KWIAFTRLQQ TDQGTTSDVY VIPAEGGEPR RLTYFGTPFT RVVGWTPDGR VLAYSDYKTP FPQWRELYAI ALDGTYERLN LGPATALVYG DGGVVVLGRN NYELPHWKRY RGGARGVLWI SRDGGKTFAK FLDLPGNITS PMIVGGRVFF VSDHEGVGNL YSVDLSGGDL RRHTNFTDFY VRNASSDGRR IVFQVAGDIW LYDPAADKLE RLDIDLPLSR KAKMAKFVDP LKYLEYFALA SGERLAVITR GQAFLVPSWE GAVVQLGERG GGVRYKHVAT DGEKIAVATY DGAVEVYSID GRIVKRIEPG VGLVEALAVK GSKIALANHR GELWILDLES GAASLVDRSE YGLITEMAWH PSGRWLAYAK PAGVYAQNIR LLDATTGKAY DVTSPTAYDY SPAFDPHGRY LYFLSRRALN PALDPVQFVY SFAKHSKPYL VVLRKDDASP FVEYKRREEK VEDIDVEGIE RRVEPFPVEE GLYSAVVGLK GGKVAWLKYD VEGALRYYLW SAQERRGAVE VYDLETKMRE QLISGVSAMR ASPDGKYLLV KEENRLRLID IEKKPDVQSR EPGRKSGVLD MGRVKVYVEP EKEWRQMLRE AWLLMRENYW KGDMNGVDWD AVYKKYEPLL ERAGTRYELS DVINEMQGEL GTSHAYEIVP DFEVDKPYLV GGLGAEYKWD GKCWRIVKIF AGDPSYENEK SPLLAPGVDV REGDCLVSIA GVRLGPGAPP EYALLNRPGD VVAIEVDRGG EVRTYVVRTV RDEKYLIYRH WVEENRRKVH KATWGRVGYI HIPDMGPAGY AEFFKSLNAD GDKEAFIIDI RYNRGGHTSG MLVPRICVGV FGKFLTRHFK PFPYPELVLP KKLVLVTNEH AGSDGDIFTY DFKHLGLGPV VGKRTWGGTV GIDTRYKLVD GTIITQPKYA FWGEGVGTGI EGYGVDPDIE VEIAPQDYRE GKDPQLEKAL EIFKES
|
| |