Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2158 |
Symbol | |
ID | 5054292 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1933022 |
End bp | 1934077 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469710 |
Product | amidohydrolase |
Protein accession | YP_001154356 |
Protein GI | 145592354 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGA GGTATAAGGC CCGGTTTGTA CTAGCCGGTG AGCTGGAGGT GTTGCAAGAC GGCGTGGTTG AGGTAGACTA TGGCGGGGTC GTGGTAGGTG TTGGGAAGTA CACTGGAGGG ACTGTGGCCG ACTTAGGCAA CGTGGTGTTG ATGCCGCAAC TGGTCAACGC CCACGTACAT GTCCTCGACG CCGCAATAAT CGACAGAGAC GACATGTACA TTGACGATCT TGTGGGGTGG CCGTACGGGG TGAAATACCG CCTGGTTAAG GAACATGTTA GAAGGGGGCG CCACATACCA CTACTCAGGA AAGTGGCTGA GAGGATGAGA CGCTACGGCG TTGGGTGCGC CTTGATATAC GCAGAATACG CCGCCCGCGA CGTGGAGACA ATATTCAAGG AATACGGCGT CGAGGCGATT GTATTTCAAG AAGCGCACGG CGATTTTCCA GACTACCCAA ATGTCCAAGT GGCTTCGCCC CTCGACCACC CCGTGGAGTA CCTCCGGGAG CTGAGGAGGC GCTACAGACT CGTCTCTACC CACATCTCCG AGACTGAAGA CTGCCACGAG GGGGGCGATC TTGAGCTGGC CCTAAAGGAG CTGGGCGCCG ACGTGTTGGT GCACCTCGTC CACGTCACCG ACGAGGAGAT ATCTTCTATA CCGCCGGAGA AAACCATCGT GGTTAACCCA AGGGCAAACG CCTACTTCGT GGGTAGGGTG GCGCCGGTGC ATAAGCTGTT GAGCCTAAAG CCCCTCCTGG GGACAGACAA CGTGTTCATG AACGAGCCTG ACCCCTGGGC CGAGATGAAA TTCCTACACG CCTACTCAGC AATAGCCGGG TGGGGTCTAG GCGAGAGGGA AATACTGGCA ATGGCAACTG TGTGGGCGTG GGAAAAAATA CGATGTATAC CGCCTATTGA GCCAGGTAGG CCTCTAAGGG CTCTCGCCGT GGCTGCGCCC TACGCGGGGG ATAAAGTCTT GAAATACCTC GTAAAGCGTG TTGCCCACAG CGACCTTATA GCCTTTGTGG AGGGGAGCCG CATCACCCCC ACATAG
|
Protein sequence | MKARYKARFV LAGELEVLQD GVVEVDYGGV VVGVGKYTGG TVADLGNVVL MPQLVNAHVH VLDAAIIDRD DMYIDDLVGW PYGVKYRLVK EHVRRGRHIP LLRKVAERMR RYGVGCALIY AEYAARDVET IFKEYGVEAI VFQEAHGDFP DYPNVQVASP LDHPVEYLRE LRRRYRLVST HISETEDCHE GGDLELALKE LGADVLVHLV HVTDEEISSI PPEKTIVVNP RANAYFVGRV APVHKLLSLK PLLGTDNVFM NEPDPWAEMK FLHAYSAIAG WGLGEREILA MATVWAWEKI RCIPPIEPGR PLRALAVAAP YAGDKVLKYL VKRVAHSDLI AFVEGSRITP T
|
| |