Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3393 |
Symbol | |
ID | 4898395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 447128 |
End bp | 448033 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640113990 |
Product | 5-dehydro-4-deoxyglucarate dehydratase |
Protein accession | YP_001045258 |
Protein GI | 126464145 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR03249] 5-dehydro-4-deoxyglucarate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCCGC AACAGATCAA GGCGGCCCTC GGCTCCGGCC TGCTCTCCTT CCCCGTCACG CCGTTCGACG CCGAGAACAG GTTCGCCGCC GCCCCCTACC AGAAGCACGT CGAGTGGCTG TCCGGCTTCG ACGCGCCGGT TCTCTTCGCC GCGGGCGGCA CCGGAGAGTT CTTCTCGCTG ACCCCTGACG AGATCCCCGC CATCGTCAGG GCCGCCAAGG AGAGCGCCGG CAAGACCGCC ATCGTCTCGG GCTGCGGCTA CGGGACCGAG ATCGCCCGGG GCATCGCGCG CTCGGTCGAG GCGGCGGGCG GCGACGGCAT CCTGCTGCTG CCGCATTACC TGATCGACGC GCCGCAGGAA GGGCTCTACG CCCATGTCAA GGCGGTCTGC CAGGCCACCG GCATGGGCGT CATGGTCTAC AACCGCGACA ATGCCGTGCT GCAGGCCGAC ACGCTGGCGC GGCTCTGTGA CGACTGCCCG AACCTCGTGG GCTTCAAGGA CGGCACCGGC GACATCGGCC TCGTGCGCCA GATCACCGCG AAGATGGGCG ACCGGCTGAC CTATCTCGGC GGCATGCCGA CGGCGGAGCT CTTTGCCGAG GCCTACCTCG GAGCGAGCTT CACCACCTAC AGTTCGGCGG TGTTCAACTT CGTCCCGGCG CTGGCCAACA AGTTCTACGC CGCGCTGCGG GCCGGGGATC GCGCCACCTG CGAGAGCATC CTCAACAGCT TCTTCTACCC GTTCATGGAG CTCCGCTCGC GCCGCAAGGG CTATGCGGTC GCGGCGGTCA AGGCGGGCGT GCGGCTGGTG GGCTTCGACG CGGGCCCGGT GCGGGCGCCG CTCTCGGATC TGACCGGCGA GGAGGAAGAG ATCCTCAAGG CCCTGATCGA CGCGCATCGG GAATGA
|
Protein sequence | MDPQQIKAAL GSGLLSFPVT PFDAENRFAA APYQKHVEWL SGFDAPVLFA AGGTGEFFSL TPDEIPAIVR AAKESAGKTA IVSGCGYGTE IARGIARSVE AAGGDGILLL PHYLIDAPQE GLYAHVKAVC QATGMGVMVY NRDNAVLQAD TLARLCDDCP NLVGFKDGTG DIGLVRQITA KMGDRLTYLG GMPTAELFAE AYLGASFTTY SSAVFNFVPA LANKFYAALR AGDRATCESI LNSFFYPFME LRSRRKGYAV AAVKAGVRLV GFDAGPVRAP LSDLTGEEEE ILKALIDAHR E
|
| |