Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2833 |
Symbol | |
ID | 3910626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3226598 |
End bp | 3227605 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637884733 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_486446 |
Protein GI | 86749950 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.636901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.872567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAGC TTCTAATCGA CAATGTCGGC CCGGCGACCT CCGTGCAGGA CGCGGGACGC CACGGCGCGC AGCGCTACGG TCTGACGCCG AGCGGCGCGA TGGATCGTCT GTCGCTGGCT GCCGCGAACG TGCTGGTCGG CAACGATGCG TTCGCCGCCG CGGTCGAATT GGGGCCGCTC GGCGCAACGC TCACCGCGCA CGACGGCGCG GTGCGGCTGG CCCTGACAGG CGCGGAGCGC TCCGCCACGA TCGGCGACCG CGCGATCGCA CTCAACGAGT CCTTCCTGCT CGCCGACGGC GAGACATTGA CGCTCGGAAT CGCGCGCAGC CAGGTGTTCA GCTATCTGGC GATCGCTGGC GGCATCGATG GCGAGCCGAT GTTCGGCAGT CTCGCGGTCA ATGCCCGCGC CGGCCTCGGC AGTCCCTACC CGCGGCCGCT GCAATCCGGC GACGCCATCC CGGCCGCGTC GGCAGTCGTT GCGCCCGAAC GTCGCCTCGA TCTGCCGACA CCGCCCGACG GGCCGATCCG CGTCGTGCTC GGCCCGCAGG ACGACGAATT CGGCGATGCC GTCGCGACCT TCCTCGACAG CGCGTGGAAA GTGTCGGCGA CCAGCGACCG GATGGGCTAT CGCCTCGAAG GTCCGGAGAT CCGCCATCTG CACGGCCACA ACATCGTCTC CGACGGCACT GTCGACGGCA GCATCCAGGT TCCCGGCAAT GGCCAGCCGA TCGTGCTGAT GCCCGATCGC GGCACCAGCG GCGGCTATCC GAAAATTGCC ACCGTGATCA CCGCCGATCT CGGTCGGCTC GCGCAGCTTC AGCCCGGGCG GCCGTTTCGT TTCAGATCGG TGAGCATGGA GGAGGCGCAG GCCGAATATC GCGCAATGGC CGGGCTGATC CGCGCCCTGC CCGACCGGAT CGCGGACGCG CAGCATATGA CGCTCGACCT CGACGCGCTG CTGACGGCCA ACGTGGCGGG CGCGGCCACC AACGCGCTCG AAATCTGA
|
Protein sequence | MSKLLIDNVG PATSVQDAGR HGAQRYGLTP SGAMDRLSLA AANVLVGNDA FAAAVELGPL GATLTAHDGA VRLALTGAER SATIGDRAIA LNESFLLADG ETLTLGIARS QVFSYLAIAG GIDGEPMFGS LAVNARAGLG SPYPRPLQSG DAIPAASAVV APERRLDLPT PPDGPIRVVL GPQDDEFGDA VATFLDSAWK VSATSDRMGY RLEGPEIRHL HGHNIVSDGT VDGSIQVPGN GQPIVLMPDR GTSGGYPKIA TVITADLGRL AQLQPGRPFR FRSVSMEEAQ AEYRAMAGLI RALPDRIADA QHMTLDLDAL LTANVAGAAT NALEI
|
| |