Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2100 |
Symbol | |
ID | 3908514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2387545 |
End bp | 2388531 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883993 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_485717 |
Protein GI | 86749221 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.415706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.000106106 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGCACATCC TCATCCTCGG CGCCGCCGGC ATGGTCGGGC GCAAACTCAC GGAGCGGCTG CTCGCCGACG GTCGCCTCGG TGATCGCGAG ATCACCCGGA TGACGCTGCA GGACGTGGTC GCGCCGGCCA CGCCCGCCAA GGCGGCGATG CCGATCACGA CGATCGTCAG CGATCTCGCC GAGCCGGGGC AGGCGGCGGC GCTGGTGGCG CATCGGCCGG AGGTGATCTT CCATCTCGCC GCGATCGTCT CCGGCGAGGC CGAGGCCGAT TTCGACAAGG GCTACCGCAT CAATCTCGAC GGCACGAGGC ATCTGATCGA CGCGATCCGC GCGGAAGGCG ACGACTATCA TCCGCGGCTG GTGTTCACCT CGTCGATCGC GGTGTTCGGC GCGCCGTTCC CCGAGAAAAT CGGCGACGAA TTCCTCTCCG CGCCGCTCAC CAGCTACGGC ACCCAGAAGG CGATCTGCGA ACTCTTGATC GCCGACTATA CCCGCAAGGG CTTTCTCGAC GGCGTCGGCA TTCGCCTGCC GACGATCTGC GTCCGCCCCG GCACGCCCAA CAAGGCGGCC TCCGGCTTCT TCTCCAACAT CATCCGCGAG CCGCTCGCCG GCCACGAGGC GGTGCTGCCG GTCTCCGACG ACGTGATGCA CTGGCACGCC TCGCCGCGCT CCGCGGTCAG CTTCCTGATC CATGCCGGCA CGATGGACAC GCAGGCGATC GGCCCGCGCC GCAATTTGTC GATGCCCGGT CTCGCCGCCA CCGTCGGCGA ACAGATCGCG GCGCTCGAAC GCGTCGCCGG CAAGGGCGTC GTGGCGCGGA TCAGGCGCGA GCCCGATCCG GTGATCATGG GCATCGTCGC CGGCTGGCCG CGCAATTTTG CGACCGACCG CGCGCTCGCG CTCGGCTTCA CCACCGCGGA ACAGAGCTTC GACGACATCA TCCGGATTCA CATCGAGGAC GAACTGGGCG GGAATTTTGC CGCCTGA
|
Protein sequence | MHILILGAAG MVGRKLTERL LADGRLGDRE ITRMTLQDVV APATPAKAAM PITTIVSDLA EPGQAAALVA HRPEVIFHLA AIVSGEAEAD FDKGYRINLD GTRHLIDAIR AEGDDYHPRL VFTSSIAVFG APFPEKIGDE FLSAPLTSYG TQKAICELLI ADYTRKGFLD GVGIRLPTIC VRPGTPNKAA SGFFSNIIRE PLAGHEAVLP VSDDVMHWHA SPRSAVSFLI HAGTMDTQAI GPRRNLSMPG LAATVGEQIA ALERVAGKGV VARIRREPDP VIMGIVAGWP RNFATDRALA LGFTTAEQSF DDIIRIHIED ELGGNFAA
|
| |