Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2384 |
Symbol | |
ID | 4897404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | + |
Start bp | 2519439 |
End bp | 2520704 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640112981 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001044258 |
Protein GI | 126463144 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.193755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.99015 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCATT TCCTCTACCG CAACGGGTCG CTCCACGCCG AGGAGGTGCC GCTCTCGGAC ATCGCCGCGA CCGTCGGCAC GCCCTTCTAC TGCTACTCGG CGGCCACGCT CGAACGGCAC TACCGGCTCT TCACCGAGGC GCTGACGCCC CTGCCGCATC TGGTCTGCTT CGCAATCAAG TCGCTGTCGA ACCTCGCGGT GCTGAAGCTT CTGGGCGATC TCGGCGCGGG GATGGATGTG GTCTCGGCGG GCGAATATCT GCGCGCGCGC GCTGCGGGTG TGCCGGGCGA CCGGATCGTC TTCTCGGGCG TGGGCAAGAC CCGCGAGGAG ATGCGGATCG CGCTCGAGGG CGGGATCCGG CAGTTCAACG TGGAATCCGA GCCCGAGATG CGGGCGCTGT CCGAGGTGGC CTCCTCGATG GGGCTGCGCG CGCCCATCGC CGTGCGGGTG AACCCCGACG TCGATGCCCG CACCCACGAG AAGATCGCCA CCGGCAAGAA AGAGAACAAG TTCGGCATCC CGATAGAGCG CGCCTCCGAG GTTTATGCCG AGGCCGCGGC CTTGCCCGGG CTCGAGGTCA TGGGGATCGA CGTCCATATC GGCTCGCAGC TGACCGAGCT CGAGCCCTTC GAGCAGGCCT ATCTGAAGGT GGCGGAGCTC ACCGGCCGGC TGCGCGCCGA GGGCCACGAG ATCCGCAGGC TCGACCTCGG CGGCGGTCTG GGCATCCCCT ACACCCGCTC GAACGAGGCC CCGCCGCTGC CCACCGATTA TGGCGCGCTC ATCAAGCGCA CCGTGGGCCA TCTCGGCTGC GAGATCGAGA TCGAGCCCGG GCGGCTGATC TCGGGCAATT CGGGCGTGCT GGTGAGCCGG GTCATCTATG TCAAGAACGG CGAGGGACGC GACTTCCTGA TCCTCGACGC GGCGATGAAC GATCTCGTGC GGCCCTCGAT GTATGGCGCG CACCACGACA TCGTGCCGCT GGCGGAGGCC GCGCCGGGCA CCGACAGCCA GCCCTACGAT GTGGTGGGCC CGGTCTGCGA GACGGGCGAC ACCTTCGCCA AGGCGCGCGC GCTGCCGCCG ATGGCCGAGG GCGATCTGGT GGCCTTCCGC TCGGCCGGGG CCTATGGCGC GGTGATGGCC TCGGAGTACA ATTCCCGCCC GCTGGTGCCC GAGGTTCTGG TGCGCGGGGA TCACTTCGCC GTCATACGGG CGAGACCGAC GTTTGACGAA ATGCTCGGCC GCGATAGCAT CCCCGAATGG CTGTGA
|
Protein sequence | MDHFLYRNGS LHAEEVPLSD IAATVGTPFY CYSAATLERH YRLFTEALTP LPHLVCFAIK SLSNLAVLKL LGDLGAGMDV VSAGEYLRAR AAGVPGDRIV FSGVGKTREE MRIALEGGIR QFNVESEPEM RALSEVASSM GLRAPIAVRV NPDVDARTHE KIATGKKENK FGIPIERASE VYAEAAALPG LEVMGIDVHI GSQLTELEPF EQAYLKVAEL TGRLRAEGHE IRRLDLGGGL GIPYTRSNEA PPLPTDYGAL IKRTVGHLGC EIEIEPGRLI SGNSGVLVSR VIYVKNGEGR DFLILDAAMN DLVRPSMYGA HHDIVPLAEA APGTDSQPYD VVGPVCETGD TFAKARALPP MAEGDLVAFR SAGAYGAVMA SEYNSRPLVP EVLVRGDHFA VIRARPTFDE MLGRDSIPEW L
|
| |