Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2835 |
Symbol | |
ID | 4023333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3158315 |
End bp | 3159010 |
Gene Length | 696 bp |
Protein Length | 231 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637963033 |
Product | HAD family hydrolase |
Protein accession | YP_569964 |
Protein GI | 91977305 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.160416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTCTTA ACGATTATGC GCTTCTGCCC GCCTCGCCAT TACGGTCCGT TCCTGTTAGG CAGATTCACA TGACCAGTCC TCTCGCTCCC TCGGCCGCCG ACGCACTGCT GTTCGATCTC GGCGGGGTCG TGATCGACTT CGATCTCGCG CGTACGCTCA AAGCCTGGGC GGTCGAACTC GGCAACGATC CCTCTGCGAT GCTGGCGACG CTGGCGCGCA ACGACACTTT CCATCGCTAT GAGACCGGCC ACGTCACCGA CGCGGAATTC TTCGCCTCCG TACGCGCGGC GCTGCGGCTC GATCTCAGCG ACGATCAACT GCGCGAGGGC TGGAACGCGA TCTTCGTCGG CGAAATTGCG GGCATCGCGC CGCTGTTGGC GCGGGCCGCG AGTCGTCTGC CGCTGTATGC GCTCTCCAAT ACCAACGATG CGCACATCGC GCATTTCTCG GAGCGCTACA GCGGCTTGCT GAAGCCGTTT CGGGAATTGT TCCTGTCGTC GCAGATCGGA CTGCGCAAGC CGAACGCCGA GGCTTACGAC TTCGTCGTGA ACGCGATCGG CGTTGCGCCG TCGCGCATTG TGTTCTTCGA CGATCTGGCC GAAAATATCG AGGCTGCGCG CAAGCGCGGG CTGCAGGCCG TCCATGTCCG CTCCAGCGCG GATGTGGCGC AAGCGCTGGA TCAACTCGGG CTGTAG
|
Protein sequence | MFLNDYALLP ASPLRSVPVR QIHMTSPLAP SAADALLFDL GGVVIDFDLA RTLKAWAVEL GNDPSAMLAT LARNDTFHRY ETGHVTDAEF FASVRAALRL DLSDDQLREG WNAIFVGEIA GIAPLLARAA SRLPLYALSN TNDAHIAHFS ERYSGLLKPF RELFLSSQIG LRKPNAEAYD FVVNAIGVAP SRIVFFDDLA ENIEAARKRG LQAVHVRSSA DVAQALDQLG L
|
| |