Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1286 |
Symbol | |
ID | 4021763 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1448863 |
End bp | 1449717 |
Gene Length | 855 bp |
Protein Length | 284 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637961479 |
Product | HAD family hydrolase |
Protein accession | YP_568425 |
Protein GI | 91975766 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.268881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0567412 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACGC TTCGTTTCGT CGATCACCTG CGCGAACTGA TGGGCTCTGT CGATGTCGTG TTGAGCGACA TCTGGGGCGT GGTCCATAAC GGCCTGGAAT CGTTTCCAGA GGCTTGCGAC GCGCTGCGCA CCGCGCGCAA CGAAGGCCGC ACTGTGGTGC TGATCACCAA TGCACCGCGC CCGGCCGACT CGGTGCAACG CCAGCTCCGC AAGCTGCACG TGCCCGACGA TTGCTACGAC GCCATCGTCT CATCCGGCGA CCTCACGCGC GCTTACGTCG CCGAGCACCC CGGTCAGTCG GTGTTCTGGC TCGGCCCGGA TCGCGACAAT TCGATCTATC GCGGCCTCGA TGCTGTCCTG ACGCCGCTGG ACCAGGCCGA CTACATCATC TGCACCGGCC CGTTCGACGA CGAGACCGAG TCGGCCGAGG ACTATCGCGA GATGATGGGC GAAGCGCTGC AGCGCAAGCT GAGGCTGATC TGCGCCAACC CCGACATCGT GGTCGAGCGC GGCGACCGGC TGATCTATTG CGCCGGCGCG ATCGCCGAAC TCTATCGCGA ACTCGGCGGC GACGTCATTT TCTATGGCAA GCCGCATCGG CCGATCTACG ACCGCGCCAT GGCGATCGCG CGTGAGCTGC GCAACGCCGA GACGCCGTTG CAGCGCGTGC TGGCGATCGG CGACTCGGTG CGCACGGACC TTGCCGGCGC GCAGAGCTAC GGCATCGATC TGCTGTTCGT CACCCGCGGC ATCCATTCCG ACGCCTTCGA GGGCATCGAC CGCCTGGATA CCGACGCTGT CAGCGAACTG TTCGGCCACC CGCCGCTGGC GCTGACCCGC GAATTACGGT GGTAA
|
Protein sequence | MTTLRFVDHL RELMGSVDVV LSDIWGVVHN GLESFPEACD ALRTARNEGR TVVLITNAPR PADSVQRQLR KLHVPDDCYD AIVSSGDLTR AYVAEHPGQS VFWLGPDRDN SIYRGLDAVL TPLDQADYII CTGPFDDETE SAEDYREMMG EALQRKLRLI CANPDIVVER GDRLIYCAGA IAELYRELGG DVIFYGKPHR PIYDRAMAIA RELRNAETPL QRVLAIGDSV RTDLAGAQSY GIDLLFVTRG IHSDAFEGID RLDTDAVSEL FGHPPLALTR ELRW
|
| |