Gene Pars_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2127 
Symbol 
ID5054548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1901700 
End bp1904171 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content56% 
IMG OID640469679 
Productpeptidase M1, membrane alanine aminopeptidase 
Protein accessionYP_001154325 
Protein GI145592323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTACG TGGTGGGGAG AGATTTCGCT TTCCCAGAGT ATCTCCCCCG CTATCCCCGG 
GAATACGGCT TTGACGTCCT CTACATGTGG CTTGATATTT CAATAGATGT GCACTCCGGC
GTTGTAGAAG GCGCCGTTAG GTATAGAGTC AGGGCTAGGA AAGATGGGGT TCCCGTGGTT
TTAGACGCGG TCGAGATGGA GGTGCGGGGG GCGAGCCACG ACTACTACTA CGACGGCGAG
AAAATAGAGA TAAGGCCGAG CTGGAAAAAG GGCGACGAGA TTGAGGTGCA AATCTCTTAC
AGGGCGAAGC CCCGGGCGGG TATGTATTTC ATTAAGCCCG ACAAGACGAG GAGGGGGGTC
TACGTTTGGA CCCAGGGGGA GACTGAGTAC AACAGGTACT GGGTCCCCCT GCCAGACTCG
CCTAATATAA AGTTCCCCTG GAGAGTCGCC GTCACGGTGC CTAAGCCCTA CGTGGCCGGT
AGCAACGGCG TCTTGGTAGA GGTGAGGAAT GGAGGAGACC GTAATACCTA TGTCTGGGAG
ATGAGGCACC CCATGTCGCC TTATCTACTC GCCATAGCTA TCGGCGAGTA CGAAGTGCAC
AAGGAGGACT GCGGCGGCGT GGTGCTGGAG TACTACATCC CCAAGTACAT CGACGATAGG
TGGCGCTTCT CCTTCTACAA TACCTGTAAA ATAATGAAGT TCTTCTCGGA GTACCTCGGC
GTGCCGTACC CCTACGAGCG GTACGCCCAG GTGGTGGTGC CCGAGTTTAT CTACGGCGGC
ATGGAGAACA CCACCTTCAC AATCCTGACA GACTGGACTA TCCACGACAA ACACGCCCAT
TGCCCATACA CCGGTTTCCC CTGCCCGGAG CACGAGGACT TCTCCTCTGA TCCCCTCGTG
GCGCACGAGA TGGCCCATAT GTGGTTCGGC GACCTCGTAA CCGCTAAGGA CTGGGCCCAC
ATAGCGATAA ACGAGTCCTT CGCCACGTTT ATAGAGGATC TCTGGACCGA GGCCTCAAAG
GGCAGGGACG AGTACCTCTA TGAGATCTAT ACAAACTTTA AGACTTATCT GGGGGAGTAC
ACCAGGCGGT ATTCAAGGCC CATTGTGACA AACCTGTACA AGATACCAGA CGAGGTGTTT
GATAGGCATG CCTACGAAAA GGGGTCAGTG GTCCTCCACA TGCTTAGGAG CCTCCTGGGC
GAGGAGAGCT TCCGGAAGGG GCTAAAGCTC TTTCTCGAAA GACACAGATA TAAAGCGGTC
GACATGGAGG ACTTACGGAA GGCCTTCGAG GAGGCGTCTG GACGAGACCT TGAGTGGTTC
TGGAAGCAGT TCTGGTACTC GGCAGGCCAC CCCGTGGTGA AAGTGTCTTG GAGTTACTCG
GACGGGGCTC TAAAGCTCCA GCTGAGGCAG GCGCAGGGGG AGGACAGCTA CCCAGTCTAC
GCATTGCCTC TTGAGGTGAA GATTGTGTAT GAGGACGGGA GGAAGGAGGT GAGGGAGGTT
TTGCTAAACG AGAAGGAGGT GACTCTATAC GTGCAGGGCG GGAAGCCTAG GTATATTTGT
GTTGACCCCC GCTTTAAGCT TATGAAGTCG CTTGATCTGG GCTACCCGCT TGAGTCGGCC
GTGGCTATGT TAGAGGATGA AGACATGTAC TGCCGTCTCC AAGCTGTGGA GGTTCTCAAG
AAAAATGGAA GCCCCAAAGC TGTCGATGCG CTGGCTAAGG CGCTTGGGGA CAAGTTCTGG
GGTGTGGCTG CTGAGGCGGC TAGGGCTCTT GGAGAAATAG GCACAGGAGC TGCTGTGGCT
AAGCTTGTGG AGTCTTACAG GATCGTGTCT CATCCAAGAG TGAGGCGGGC TATTGTCGAG
GCCTTGGGGT CGGCCAAGAG GAAGGAGGCA GCGGAATTCC TTGACATGGT GTTGCACGAC
GCTGGGGAGA GCTACTATGT TAGGTCGGAG GCGGCAAGAT CGCTTGGCAG GGTTAAGTGG
GAATTCGCCG AGTATAGTCT GAAAAAGGCG CTTGAGTATC CAAGCCACGT AGATGTGATA
AAGAGAGGCG CCCTCGAGGG ACTCGCTGAA TTGGGGACCG ATGAGGCGTT GAAAATCGTC
CTCCGCCACG CCGAGCCCGA TATGCCCACC CCTGTTAGGG CGACGGCCGT CCAGTCCTTG
GCGAAGTTTG GGCCGCGTAA GGAAGTTGTG GATGCCGTGA GGATGTACAT GCGTGACGAG
AATTTCAGAG TTCGCTTCGC CGCCGTCACA GCCGCCTTGG AGCTACTTGA GCCTAAGCTG
TTGCCGGATC TCCAGGAGAG AGCTGAACAG GACATTGACG GGAGGGTTAG GAGAGTGGCG
AGGGAGGTCG CGGAGAAGAT CAAGAAGTTT ATGGAGAGGG GGACTGAGTA CCAGAAGCTG
AGAGAAGAGG TTGAGAAGCT GAGAGAGGAG TATCGTAAGC TAGCTGACCG CGTAGCGAGG
CTGGAGAGGT AG
 
Protein sequence
MKYVVGRDFA FPEYLPRYPR EYGFDVLYMW LDISIDVHSG VVEGAVRYRV RARKDGVPVV 
LDAVEMEVRG ASHDYYYDGE KIEIRPSWKK GDEIEVQISY RAKPRAGMYF IKPDKTRRGV
YVWTQGETEY NRYWVPLPDS PNIKFPWRVA VTVPKPYVAG SNGVLVEVRN GGDRNTYVWE
MRHPMSPYLL AIAIGEYEVH KEDCGGVVLE YYIPKYIDDR WRFSFYNTCK IMKFFSEYLG
VPYPYERYAQ VVVPEFIYGG MENTTFTILT DWTIHDKHAH CPYTGFPCPE HEDFSSDPLV
AHEMAHMWFG DLVTAKDWAH IAINESFATF IEDLWTEASK GRDEYLYEIY TNFKTYLGEY
TRRYSRPIVT NLYKIPDEVF DRHAYEKGSV VLHMLRSLLG EESFRKGLKL FLERHRYKAV
DMEDLRKAFE EASGRDLEWF WKQFWYSAGH PVVKVSWSYS DGALKLQLRQ AQGEDSYPVY
ALPLEVKIVY EDGRKEVREV LLNEKEVTLY VQGGKPRYIC VDPRFKLMKS LDLGYPLESA
VAMLEDEDMY CRLQAVEVLK KNGSPKAVDA LAKALGDKFW GVAAEAARAL GEIGTGAAVA
KLVESYRIVS HPRVRRAIVE ALGSAKRKEA AEFLDMVLHD AGESYYVRSE AARSLGRVKW
EFAEYSLKKA LEYPSHVDVI KRGALEGLAE LGTDEALKIV LRHAEPDMPT PVRATAVQSL
AKFGPRKEVV DAVRMYMRDE NFRVRFAAVT AALELLEPKL LPDLQERAEQ DIDGRVRRVA
REVAEKIKKF MERGTEYQKL REEVEKLREE YRKLADRVAR LER