Gene Pars_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2158 
Symbol 
ID5054292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1933022 
End bp1934077 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content57% 
IMG OID640469710 
Productamidohydrolase 
Protein accessionYP_001154356 
Protein GI145592354 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA GGTATAAGGC CCGGTTTGTA CTAGCCGGTG AGCTGGAGGT GTTGCAAGAC 
GGCGTGGTTG AGGTAGACTA TGGCGGGGTC GTGGTAGGTG TTGGGAAGTA CACTGGAGGG
ACTGTGGCCG ACTTAGGCAA CGTGGTGTTG ATGCCGCAAC TGGTCAACGC CCACGTACAT
GTCCTCGACG CCGCAATAAT CGACAGAGAC GACATGTACA TTGACGATCT TGTGGGGTGG
CCGTACGGGG TGAAATACCG CCTGGTTAAG GAACATGTTA GAAGGGGGCG CCACATACCA
CTACTCAGGA AAGTGGCTGA GAGGATGAGA CGCTACGGCG TTGGGTGCGC CTTGATATAC
GCAGAATACG CCGCCCGCGA CGTGGAGACA ATATTCAAGG AATACGGCGT CGAGGCGATT
GTATTTCAAG AAGCGCACGG CGATTTTCCA GACTACCCAA ATGTCCAAGT GGCTTCGCCC
CTCGACCACC CCGTGGAGTA CCTCCGGGAG CTGAGGAGGC GCTACAGACT CGTCTCTACC
CACATCTCCG AGACTGAAGA CTGCCACGAG GGGGGCGATC TTGAGCTGGC CCTAAAGGAG
CTGGGCGCCG ACGTGTTGGT GCACCTCGTC CACGTCACCG ACGAGGAGAT ATCTTCTATA
CCGCCGGAGA AAACCATCGT GGTTAACCCA AGGGCAAACG CCTACTTCGT GGGTAGGGTG
GCGCCGGTGC ATAAGCTGTT GAGCCTAAAG CCCCTCCTGG GGACAGACAA CGTGTTCATG
AACGAGCCTG ACCCCTGGGC CGAGATGAAA TTCCTACACG CCTACTCAGC AATAGCCGGG
TGGGGTCTAG GCGAGAGGGA AATACTGGCA ATGGCAACTG TGTGGGCGTG GGAAAAAATA
CGATGTATAC CGCCTATTGA GCCAGGTAGG CCTCTAAGGG CTCTCGCCGT GGCTGCGCCC
TACGCGGGGG ATAAAGTCTT GAAATACCTC GTAAAGCGTG TTGCCCACAG CGACCTTATA
GCCTTTGTGG AGGGGAGCCG CATCACCCCC ACATAG
 
Protein sequence
MKARYKARFV LAGELEVLQD GVVEVDYGGV VVGVGKYTGG TVADLGNVVL MPQLVNAHVH 
VLDAAIIDRD DMYIDDLVGW PYGVKYRLVK EHVRRGRHIP LLRKVAERMR RYGVGCALIY
AEYAARDVET IFKEYGVEAI VFQEAHGDFP DYPNVQVASP LDHPVEYLRE LRRRYRLVST
HISETEDCHE GGDLELALKE LGADVLVHLV HVTDEEISSI PPEKTIVVNP RANAYFVGRV
APVHKLLSLK PLLGTDNVFM NEPDPWAEMK FLHAYSAIAG WGLGEREILA MATVWAWEKI
RCIPPIEPGR PLRALAVAAP YAGDKVLKYL VKRVAHSDLI AFVEGSRITP T