Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1671 |
Symbol | |
ID | 3908658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1902793 |
End bp | 1903989 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883565 |
Product | amidohydrolase |
Protein accession | YP_485290 |
Protein GI | 86748794 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.446042 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCACG ACGCCGCCGC GCCCACGGGA CCGACCAAGC TCGTCATTCG CAACATCGGC CTCTTGATCA GCGGTGACCT CGACAAGCCG ATCCTCGATG CCGACACCAT CGTTGCGGAG AACGGCAAGA TCTCCGCGAT CGGTCGGCTG AAGGACGTCG ACACCGAAGG CGCGACCACC ACCGTCGATG CGGGTGGCGC TGCGGTCACC CCGGGCCTGA TCGACAGCCA CGTCCATCCG GTGGCGGGCG ACTGGACGCC GCGGCAGAGC CAACTCAACT GGATCGACTC CTCGCTGCAC GGCGGCGTCA CCACCATGAT CTCGGCCGGC GAGGTGCACT ATCCCGGCCG GCCGCGCGAC GTCATCGGCA TCAAGGCGCT GGCGATCACC GCGCAGCGCA GCTTCTCCGC CTTCCGCGCC AGCGGCGTGA AGGTCCATGC CGGCGCTCCG GTGATCGAGC ACGAGATGGA AGAGAACGAC TTCAAGGAAC TCGCCGCGGC CGGCGTCAAG CTGCTCGGCG AGATTGGGCT CGGCGGCGTC AAGGACGGAC CGACCGCGAA GAAGATGGTG GCGTGGGCGC GCAAATACGG CATCCAGTCC ACCATCCACA CCGGCGGCCC GTCCATCGCG GGCTCCGGGC TGATCGACAA GGACGTGGTG CTGGAAGCCG GCACCGACGT GATCGGCCAC ATCAACGGCG GCCACACCGC GCTGCCCGAC GGGCAGATCC GCTGCATCTG CGAGGGCTGC AAGGCCGGGC TCGAGCTGGT CCATAACGGC AACGAGCGCT CGGCGCTGTA CACGCTGCGG ATCGCGCGCG AGATGGGCGA TCTCCATCGC GTCATTCTCG GCACCGACGG CCCGGCCGGC TCCGGTGTCC AGCCGCTCGG CATCCTGCGG ATGATTTCGC TGTTGTCGTC GCTCGGCGAT CTTCCCGCCG AACAGGCGTT CTGCCTCGCC ACCGGCAACA CCGCGCGGAT GCGCGATCTC GACTGCGGCC TGATCGAGGT CGGCCGCGTC GCCGATTTCG TGATCATGGA CGCGGCCCAG CACTCGGCCA GTTCGTCGCT GCTGGAAAGC GTCCGCCTCG GCGATCTGCC GGGCATCGGC ATGACCATCA TCGACGGCAT CGTGCGCAGC GAACGCTCCC GCAACACCCC GCCGGCGACG CGACTGCCGA GCATCGTCAA GGCCTGA
|
Protein sequence | MAHDAAAPTG PTKLVIRNIG LLISGDLDKP ILDADTIVAE NGKISAIGRL KDVDTEGATT TVDAGGAAVT PGLIDSHVHP VAGDWTPRQS QLNWIDSSLH GGVTTMISAG EVHYPGRPRD VIGIKALAIT AQRSFSAFRA SGVKVHAGAP VIEHEMEEND FKELAAAGVK LLGEIGLGGV KDGPTAKKMV AWARKYGIQS TIHTGGPSIA GSGLIDKDVV LEAGTDVIGH INGGHTALPD GQIRCICEGC KAGLELVHNG NERSALYTLR IAREMGDLHR VILGTDGPAG SGVQPLGILR MISLLSSLGD LPAEQAFCLA TGNTARMRDL DCGLIEVGRV ADFVIMDAAQ HSASSSLLES VRLGDLPGIG MTIIDGIVRS ERSRNTPPAT RLPSIVKA
|
| |