Gene Pars_2238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2238 
Symbol 
ID5054332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2005483 
End bp2006991 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content53% 
IMG OID640469791 
Productpeptidase M50 
Protein accessionYP_001154436 
Protein GI145592434 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.577346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.574396 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCGG CGATATGGTT TATTCTCAGC TGGCTGGCGC TCGTAGGTGT GTTATACATT 
TTCAAAAAAG ATGCGGTGAA GTACTTCGTA GCAATATACT GGAAAAGCGA GAAACTTACG
AAGTACGTGG TATTGTTTAC GGATAAGCTG AGGTTCATTC CGTTGAAGGT ATACCTCGTC
TCGGTATTGG TGCTGTTTGC GTTGCCTATG TTTCTAGCTA TGCCTTTTAT AAGGCCTGAC
GGACAGCTGA GCTCTCTGCC AGGTTTCCTC TACATCTTGG TAGGTGGCAC GGTAAACGCC
TTGAGAATGC TTTTAGGCGG GGCGCCAGTA CAAGAGGCCG CCGCAGGCTC GGCAGGCGTC
ACGCCGATTG TGCCGGGCCT TACCCTTCCG TGGGATCAAC TTCCCTACCT CGCTGTGGCT
ATTGCCGTGG CCGTAGTACT GCACGAGCTG ATGCACGGCT ACGCCGCGCT GAGATACGGA
ATACCGGTGA AATCAGTGGG CGTCTTCTCC CTATTCTACA TACTAAGCGG CGCCTTTGTA
GAGCCTGACG AAGACCAGTT CAAAAAAGCG AGTACCGAGG CCAAAGCCGC GGTGCTGGCC
AGCGGCGTTG CCGCAAACGT CGTGATAGCC ATCATCGCAA TGTTAATAGG AGTTTTTGGT
GCCTGGGCTG GACTTGGCGG CGCCGTGTTT GGCGCATCAG CTTACGGCAT ACATCCCGGC
GACAGAGTGG TAGAAATCCG GGGTTGCGGC TTTGTGGAGA GGGTCTACAC GCCAGATGAT
TTTGTGACGA AGATCAACGT GTTAGCGGGA CTTGGTCCCC TGCTGGGGAT TAACAAAACT
ATTAGTTGCA AGCCCGGCGA CAAAGTGACT CTGGTTGCCT ATTCTTGGCT TCACAGGTAC
GAGGTGCAGG TAGACTACTC GAATTTCACA ACACCTTCAC AGCTGAGGTG GCTCTACACC
GATGGCTCTC TCTACCTAGG CGGGGTAAGG CCTGGCGACG TTATTAAGAG AGTGGAGGGT
TGCGGGGTGG CCAGAGACAT CACAAGTAGC GGGGACTTCC TCGCCTTTAT ACTGGAGTCG
AGAAGAATCT GCAAAGCAGG CGATGCGGTA AAGGTATATG TTGAGAGAAA CGGCACAATC
CACGTCTTTA ATGTCACGCT TGTGGAAAAA GACGGGCGGT TGTTTTACGG CATCGGACCA
ACCAGCTTCC CATTACTGGG ATATGACTAT GGGCCCGTGA AGAGGGAACA GCTTTACAAC
ACAGATTTTA CAAAACTAAT TTTCTGGCTC CTCGTGGTGA ACTACGGCCT GGCAGCTATA
AACGCCTTGC CAATCTATCC ACTTGACGGG GGGCAACTTC TCGCGGCTGT GGCACAGCGG
AAGCTCGGCG AAAAGAAAGG CACCGCGGTA GTCAACGCAG TGACTTGGAT CCTCGCCGCG
ATGCTGATCT TCAACATCGC CCTGGGGCTA ATAGGCGAGC AGTACAGAGT CCTAGAAGCG
ATAAGATGA
 
Protein sequence
MDAAIWFILS WLALVGVLYI FKKDAVKYFV AIYWKSEKLT KYVVLFTDKL RFIPLKVYLV 
SVLVLFALPM FLAMPFIRPD GQLSSLPGFL YILVGGTVNA LRMLLGGAPV QEAAAGSAGV
TPIVPGLTLP WDQLPYLAVA IAVAVVLHEL MHGYAALRYG IPVKSVGVFS LFYILSGAFV
EPDEDQFKKA STEAKAAVLA SGVAANVVIA IIAMLIGVFG AWAGLGGAVF GASAYGIHPG
DRVVEIRGCG FVERVYTPDD FVTKINVLAG LGPLLGINKT ISCKPGDKVT LVAYSWLHRY
EVQVDYSNFT TPSQLRWLYT DGSLYLGGVR PGDVIKRVEG CGVARDITSS GDFLAFILES
RRICKAGDAV KVYVERNGTI HVFNVTLVEK DGRLFYGIGP TSFPLLGYDY GPVKREQLYN
TDFTKLIFWL LVVNYGLAAI NALPIYPLDG GQLLAAVAQR KLGEKKGTAV VNAVTWILAA
MLIFNIALGL IGEQYRVLEA IR