Gene Pars_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0104 
Symbol 
ID5054715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp90897 
End bp92369 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content53% 
IMG OID640467683 
Productcarboxypeptidase Taq 
Protein accessionYP_001152371 
Protein GI145590369 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.92548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGT CTGATACAGT AAAGCAGATT TTAGAACACT ACCGCGTTAT ATGGGCGCTT 
GGCCACGCGC AGTCGGTTAT GGGGTGGGAC AGTGAGACTT ACATGCCGGA GGAGGGGATC
AAGGGGAGGG CCGCCGCAAG GGCCGAGATC GCCCAGTTAA TCCAGCGTTT TATGCTTGAC
GAGAAGTTCG TCAAGCTGGT GGAGAAGGCC GAGGAGGAGA AGGACCTCAC AGACGTGGAG
AGAGGCATAG TGAGGGTTTT AAAAAGAGAT CTCAAATTCT ACCAGAGGGT GCCGCCTGAG
ATCGTGAAGG AGTTCGCCAA GGTGACTTCG GAGGCCTTTG TGGTGTGGAG AAACGCCAAG
GAGAAGGCGA AGTTTGACAT ATTTGCCCCA CATTTGGAAA AGATAGTGGA GCTCTCCAGG
GTTATCGCAG ATAAGCTCGG CTACGAGGAG CACCCATACG ACGCACTGCT CGACCTATAC
GAAGAGGGGC TCACCTCTAG AGACGTAGAG GCGGTGTTTT CTGTGCTGGA GCCGGGGATT
AGAAAACTTC TAAACAAGCT GGAATCGGCG GGCTGGCCTA AGAAACACCC GCTTGAGGAG
GTCCCCTACG AGAAGTCGAA AATGGAGGCC GCGATAGTGG AGGTGTTGGA ATTGGTGGGC
TATCCTAAAA CCCGGTTTAG GATCGACGTA TCGCCTCACC CGTTTACCAT AGGCATAACT
ACTCCGTTCG ACGTGAGAAT TACCGTGAGG TACAGGGGGG TTGATTTCAA AGAGTCCTTG
TTCTCGGCGC TTCATGAATA CGGCCATGCC CTATACGAGC TGAACATCGA CGAGTCCCTC
GCCATGACGC CCGTCGGTAG CGGCGTGTCG CTGGGCATAC ACGAGAGCCA GTCGAGATTT
TTCGAAAACG TAGTGGGCAG GAGCAGGGAG TTCGTGGCCA AGATGTCGCC GATTCTGCGT
AAACACCTAG ATTTGTCCAA ATACACAGAC GAGGACTTGT TCTACTACTT CAACACGGTT
AGGCCAAGCC TAATCAGAAC AGAGGCCGAC GAGGTTACCT ACAATCTCCA CATACTCCTG
CGGTACAGGC TAGAGCGGTT GATGATTACC GGCGAGGTGA AGGTGAAGGA GCTCCCAGAG
CTTTGGAATA ACGAAATGGA CCGGCTTCTC GGCGTAAGGC CTAAAAACGA CGCGGAGGGG
GTCCTCCAGG ATATCCACTG GAGCCACGGC TCTATTGGAT ACTTCCCAAC CTACACACTG
GGCAACGTCG TCGCCGCAAT GATTTACTAT AAGCACGGGA GAGTGAGGGA GCTCATAGCC
GAGGGCAACA TAGCCGCTGT GAAGGAGTAT CTCAGAGAAA AAGTACATAA ATGGGGGAGC
GTGTACCCGC CTAAGGAGCT ACTGGTAAGA AGCTTCGGCG AGACCTACAA CGCCGAATAC
TTGGTGAAAT ATCTCGAGGA GAAGTACCGC TAG
 
Protein sequence
MIRSDTVKQI LEHYRVIWAL GHAQSVMGWD SETYMPEEGI KGRAAARAEI AQLIQRFMLD 
EKFVKLVEKA EEEKDLTDVE RGIVRVLKRD LKFYQRVPPE IVKEFAKVTS EAFVVWRNAK
EKAKFDIFAP HLEKIVELSR VIADKLGYEE HPYDALLDLY EEGLTSRDVE AVFSVLEPGI
RKLLNKLESA GWPKKHPLEE VPYEKSKMEA AIVEVLELVG YPKTRFRIDV SPHPFTIGIT
TPFDVRITVR YRGVDFKESL FSALHEYGHA LYELNIDESL AMTPVGSGVS LGIHESQSRF
FENVVGRSRE FVAKMSPILR KHLDLSKYTD EDLFYYFNTV RPSLIRTEAD EVTYNLHILL
RYRLERLMIT GEVKVKELPE LWNNEMDRLL GVRPKNDAEG VLQDIHWSHG SIGYFPTYTL
GNVVAAMIYY KHGRVRELIA EGNIAAVKEY LREKVHKWGS VYPPKELLVR SFGETYNAEY
LVKYLEEKYR