Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0842 |
Symbol | |
ID | 3909100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 957753 |
End bp | 958739 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637882735 |
Product | arginase |
Protein accession | YP_484464 |
Protein GI | 86747968 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01229] arginase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.950445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCT CCAGCTCCGT GATCCCGCCC GATCCGAACC GCCGCGTCGC GCTGCTCGGG GTGCCGATCG AGATCGGCGC CGGGCTGCGC GGCACGCTGA TGGGCCCGGC GGCGCTGCGC ACCGCGGGTA TCGGCCGCGT GCTCGAACAG TTGCGCATCA CCGTCGAGGA TCACGGCGAC ATGGCGCGGC CGCCACCTTG TCGCGACGCG GGTCCGACGC CGGCCAACGC CAATTATTAC GACGAGGTGA AAAGCTGGGT CGGCGCGATC TCGGCGCGGG CCCATGAGCT GGCGCGTTCC GGCGCCGTGC CGCTGTTCAT GGGCGGCGAT CACAGCCTGT CGATGGGCTC GGTCAACGGC GTCGCGCGCT ATTGGCAGAC GCAGGGGCGG CCGCTGTTCG TGCTGTGGTT CGACGCCCAC GCCGACTACA ACACCCCGGC GACGACGCTC TCCGCCAACA TGCACGGCAT GTCGGCGGCG TTCCTGTGCG GCGAGCCGGG GCTCGATGCG CTGCTCGGCG ACGAGCCCCG GGTGTCGATC CCGCCCGACC GGCTCGACCT GTTCGGCATC CGATCGATCG ATCCGCTGGA GAAGGAATTG GTGCGCGCCC GCGCCATCCC GGTGGTCGAC ATGCGGGCGA TCGACGAATT CGGCGTCGGC GTGCTGATCC GCCGGTTGAT CGACCGCGTC CGCGCGGCGA ACGGCGTGCT GCATCTGTCG TTCGATGTCG ACGTGCTCGA CCCTGCGGTC GCGCCCGGCG TCGGCACCAC CGTGCCGGGC GGCGCGACCT ATCGGGAGGC GCATCTGGTG ATGGAGCTGC TGCACGACTC GGGATTGGTG CGTTCGCTCG ACGTGGTCGA ACTCAACCCC TTCCTCGACG AGCGCGGCCG CACCGCCCGG GTCGCGGTGG AGCTGATCGG CAGCCTGTTC GGCATGCAGA TCTCCGACCG GGTGACGCCG AGCAACGCGC TGCTGCCCGA GGGGTGA
|
Protein sequence | MTPSSSVIPP DPNRRVALLG VPIEIGAGLR GTLMGPAALR TAGIGRVLEQ LRITVEDHGD MARPPPCRDA GPTPANANYY DEVKSWVGAI SARAHELARS GAVPLFMGGD HSLSMGSVNG VARYWQTQGR PLFVLWFDAH ADYNTPATTL SANMHGMSAA FLCGEPGLDA LLGDEPRVSI PPDRLDLFGI RSIDPLEKEL VRARAIPVVD MRAIDEFGVG VLIRRLIDRV RAANGVLHLS FDVDVLDPAV APGVGTTVPG GATYREAHLV MELLHDSGLV RSLDVVELNP FLDERGRTAR VAVELIGSLF GMQISDRVTP SNALLPEG
|
| |