Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2635 |
Symbol | |
ID | 4023132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2954269 |
End bp | 2955276 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637962833 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_569765 |
Protein GI | 91977106 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.238129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.0000404715 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAGGC TCGTCGTTGA CATCGTCGGC CCGGCCACCT CCGTGCAGGA TGCGGGCCGC CACGGCGCGC AACGCTATGG GCTGACCCCG AGCGGCGCGA TGGATCGATG GTCGTTAGCC GCGGCGAACA CGCTGGTCGG CAACCCCGCA TTCGCCGCTG CGATCGAACT CGGGCCGCTC GGCGCGGCGT TCACCGCGCG CGACGGCGCG GTGCGGCTCG CGTTATGCGG CGCGGAACGT CCCGCCGCGA TCGGCAGCGG AGCGATTGCG CTCAACGAGT CGTTTCTGCT CGCTGAGGAC GAGACGCTGA CGCTCGGCGT CGCGCGCAGC CATGTGTTCA GCTATCTGGC GATCGCAGGC GGCATCAGCG GCGAACCGAT GTTCGGCAGT CTCGCGGTCA ATGCCCGCGC CGGCCTCGGC AGCCCCTACC CGCGGCCGCT ACAGCCCGGC GACGTCATTC CGGCAAAGCC AGCGACGATC GCCGCCGAAC GCCGTCTCGA TCTGCCGAAG CCGTCCGAAG CGCCGATCCG CGTCGTGCTC GGTCCGCAGG ACGACGAATT CGGCGACGCC GTCGCAACCT TCCTCAATGG CGAATGGAAA ATCTCCGCGA CCAGCGACCG GATGGGCTAT CGACTCGAAG GGCCGGAGAT CAGGCATTTG CACGGCCATA ACATTGTCTC CGACGGCACC GTCGACGGCA GCATTCAGGT TCCCGGCAAC GGCCAGCCGA TCGTGTTGAT GCCCGACCGC GGCACCAGCG GCGGCTACCC GAAGATCGCG ACCGTGATCT CCGCCGATCT CGGTCGTCTC GCGCAATTCC AGCCCGGGCG GCCGTTCCGT TTCAAGGCGG TGAGCATGGA CGAGGCGCAG GCCGAGTATC GTGCGATGGC GAAGTTGATC CGCGCTTTGC CAGATCGTTT GCAGGATGCG CAACAGGGGA TGCTCGACCT CGACGCGCTG TTCACCGCCA ACGTCGCGGG CGCGGCGGCC AATGCGCTCG ACGGCTGA
|
Protein sequence | MSRLVVDIVG PATSVQDAGR HGAQRYGLTP SGAMDRWSLA AANTLVGNPA FAAAIELGPL GAAFTARDGA VRLALCGAER PAAIGSGAIA LNESFLLAED ETLTLGVARS HVFSYLAIAG GISGEPMFGS LAVNARAGLG SPYPRPLQPG DVIPAKPATI AAERRLDLPK PSEAPIRVVL GPQDDEFGDA VATFLNGEWK ISATSDRMGY RLEGPEIRHL HGHNIVSDGT VDGSIQVPGN GQPIVLMPDR GTSGGYPKIA TVISADLGRL AQFQPGRPFR FKAVSMDEAQ AEYRAMAKLI RALPDRLQDA QQGMLDLDAL FTANVAGAAA NALDG
|
| |