Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1626 |
Symbol | |
ID | 3909903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1832529 |
End bp | 1833524 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637883521 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_485246 |
Protein GI | 86748750 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.87577 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.584228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTCCA TCAATCCGAC CCAATTGACT GCACGCATCG AGCATTGGCC GATCGCCGGA GCCTTCACCA TCAGCCGCGG CGCCAAGACC GAAGCGGTGG TCGTGACGGC CGAGATCAGG CGTAACGGAC AAGCCGGCCG CGGCGAATGC GTTCCTTACG CCCGCTACGG CGAGACGCCG GAGACCACCG TCGCCGCGAT CGAGGCGATG CGCGCGCCGC TGCGTCAGGG GCTCGACCGC GCCGGCCTGC AATCGGCGAT GCCGCCGGGC GCGGCGCGCA ATGCGCTGGA TTGCGCGCTG CTCGACCTCG AGGCCGGGCT GAGCGGACGC CGCGCCTGGG AGCTGCTCGG CAGCGCCGCG CCGCAACCGG CGACCACCGC CTACACCATT TCGCTGGGCA CGCCGGAGGC GATGGCGGAG GCGACCGCGA AAGCCGCCGC CCGCCCGCTG CTGAAGATCA AGCTCGGCGG CGACGGCGAC GATGCCCGGA TCGCCGCGGT CCGCCGCGCC GCGCCGCGAG CCGAACTGAT CGTCGACGCC AACGAGGCCT GGACGCCGGA CAATCTCGCC CGCAACCTCG CCGCCTGCGC CGGCGCCGGC GTGACGCTGG TGGAGCAGCC GCTGCCGGCC GGGCGCGACG AGGCGCTGGC GCAAATCCGC CGGCCGATCG CGGTCTGCGC CGACGAGAGC GTCCACGCCC GCGCCTCGCT GGATGCCCTG CGCGGCCGCT ACGACGCCGT CAACATCAAG CTCGACAAGA CCGGCGGCGT CACCGAGGCG ATGGCGATGG CGGAGGCGGC GCGCGCGCTC GGCCTCGACA TCATGGTCGG CTGCATGGTC GCGACCTCGC TGTCGATGGC GCCGGCGATG CTGCTGACGC CGCACGCCCG CTTCGTCGAT CTCGACGGCC CGCTGCTGCT GGCGAAGGAT CGCGACGACG GCCTGCGCTA CGACGGCAGC ATCGTCTATC CGCCGGAGCC CTCGTTGTGG GGCTGA
|
Protein sequence | MTSINPTQLT ARIEHWPIAG AFTISRGAKT EAVVVTAEIR RNGQAGRGEC VPYARYGETP ETTVAAIEAM RAPLRQGLDR AGLQSAMPPG AARNALDCAL LDLEAGLSGR RAWELLGSAA PQPATTAYTI SLGTPEAMAE ATAKAAARPL LKIKLGGDGD DARIAAVRRA APRAELIVDA NEAWTPDNLA RNLAACAGAG VTLVEQPLPA GRDEALAQIR RPIAVCADES VHARASLDAL RGRYDAVNIK LDKTGGVTEA MAMAEAARAL GLDIMVGCMV ATSLSMAPAM LLTPHARFVD LDGPLLLAKD RDDGLRYDGS IVYPPEPSLW G
|
| |