Gene Pars_0107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0107 
Symbol 
ID5054775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp93482 
End bp94792 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID640467686 
Producthypothetical protein 
Protein accessionYP_001152374 
Protein GI145590372 
COG category[R] General function prediction only 
COG ID[COG4882] Predicted aminopeptidase, Iap family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGG TTTTTAGAAA ATGCACCGCC TACAAGGATT TAACGGCTGG AAGCCCCAGG 
GAGAGGGAGT TTCTGCACTG GCTTCTTACA TTTTTAGACA CGCCGAGCGT CTGGTTCCAC
CTGTCCCCGG TGGAGGTCTT GGCCTGGGAG GAGGTAGAGA CTAGGCTCGA GGTGGGCGAC
GTGGCCCTTT CAGGCTTGGC CTTGCCCTAC TCGAGGTCGG CATCTGTGGA AGGGAGGCTA
GTGCCCGTTG ACGGGGATGT GGAGGGGAAT ATCGCCGTGG CCAAGTTTCC CGAGGATGTT
GACGACGCCA AGTACTTGCT TTTAGAAGCG GCTCGGAGGG GGGCACAAGC CGTCCTCTTC
ACTGGGAGGC CCCAGCGGCG TATTGTAATC TCCGGCGAGC CAGGGTTTAA GCTAGACTCG
GCCCCCGCGC CCATACCAGC CGCAAGTTTC GAAGATTTAC AGAGCCACAT AGGGAAGAGG
GCCCGGCTTG TGATGGACGT AAATATGCAG ACCACGTACA GCTACAACCT CGTAGCTTTC
AATAGTCTAG ACAACACGCC GATGATATCT GCACATTGGG ACCATTGGCT TGTAGGCGCC
GCGGACAATT GCGCCGGGGT TGAGGCGGCT GTTCTGGCGT TTAGCGAGCT GGTCGCCGAC
GATGTGCCGA TAGCCCTGGG GCTATTTACA GCAGAGGAGG GGGTAGCCCC CCACCTGCCG
TCTTTCTACT GGGCGTGGGG GTCGCTCAAC TACTTCAAAA GGTGGCAACC TACGCTTCTA
GTCAACATTG ACGTAGTTGG GGCCGGCACG CCGAGGATTT ACGCAATGCC TTACCTGCAC
AGGTATTTGA CGGGGTTGGG CCCCGTGGAG GACCCCGTAC CATATTTCGA CAGCGTGCAC
CTCGAGAGGT GGGGGCTTCC ATCGGTAACC ATATCCTCGC TGAGGGACAC GTGGGGGATC
TACCACAGCC CTCTAGACGC ATATGCTGAG CCTGAGAGTA TCCTATACGC GGCGGAGTTG
GCAAAAAGGA TAGCCAAGAT TAAGCCGACG CCTCCCGAAG TGTCTTTGCA CGAGTACGGC
ATTCCGCACG TGTATAGCCC GTACGAGGCT TGGTCTATTG TGTACAACTA TCTTGTGCTC
TTCCGCGACT TTACGCACTC CGACATAGTC TACACCAATG TATTTAAATT TCTGAGAAGC
GGCGCTGGGT ATCGCCGGAT AGATCTACTC GGCGGCCCAA CTCTCTGTGT GGAGGACTGC
AAAAATGCTG TGGAGATTTA TCGAGAGCTA GTTCTGCTTA GACTGCTCTA A
 
Protein sequence
MAEVFRKCTA YKDLTAGSPR EREFLHWLLT FLDTPSVWFH LSPVEVLAWE EVETRLEVGD 
VALSGLALPY SRSASVEGRL VPVDGDVEGN IAVAKFPEDV DDAKYLLLEA ARRGAQAVLF
TGRPQRRIVI SGEPGFKLDS APAPIPAASF EDLQSHIGKR ARLVMDVNMQ TTYSYNLVAF
NSLDNTPMIS AHWDHWLVGA ADNCAGVEAA VLAFSELVAD DVPIALGLFT AEEGVAPHLP
SFYWAWGSLN YFKRWQPTLL VNIDVVGAGT PRIYAMPYLH RYLTGLGPVE DPVPYFDSVH
LERWGLPSVT ISSLRDTWGI YHSPLDAYAE PESILYAAEL AKRIAKIKPT PPEVSLHEYG
IPHVYSPYEA WSIVYNYLVL FRDFTHSDIV YTNVFKFLRS GAGYRRIDLL GGPTLCVEDC
KNAVEIYREL VLLRLL