Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5223 |
Symbol | |
ID | 6412923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5635587 |
End bp | 5636852 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 642715113 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001994186 |
Protein GI | 192293581 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCATT TCGACTATCG CGACGGCGTG CTGCACGCCG AAGGCGTCAG CCTTGCCTCG ATCGCGCAGG ACGTCGGTAC GCCGTTCTAC TGCTACTCGA GCGCGACGCT GGAGCGGCAC TACCGGGTGT TCACCGAGGC CTTCGCCGGC CTCGATGCGC TGGTGTGCTA CGCCATGAAG GCCAATTCCA ACCAGTCGGT GCTGCGCACC CTGGCCAAGC TCGGTGCCGG CGCCGACGTG GTGTCGGGCG GCGAACTGCA GCGCGCGCTC GCGGCGGGAA TTCCACCGAG CAAGATCGTG TTCTCCGGCG TCGGCAAGAC CGAGGCCGAG CTGCGCGCCG CGCTCGCCCA CGACATCAAG TGCCTCAACG TCGAATCCGA GCCCGAGCTC GAACAGCTGT CGCGTATCGC GGTCGAGACC GGCCGCACGG CGCGGATTTC GCTGCGGGTC AATCCAGACG TCGATTCCGG CACCCACGCC AAGATCTCCA CCGGCAAGTC GGAGAACAAG TTCGGCATCC CGATTCGCCA CGCCCGCGAG GTCTATGCCC GCGCCGCCAA GCTGCCGGGC ATCCAGGTCA CCGGCGTCGA CGTCCATATC GGCAGCCAGA TCGTCGACCT GGCGCCGATG GAAGCCGCGT TCCGCAAGGT TGCCGAATTC ATCCACGTAC TGCGCGGCGA CGGTCACACG GTCAGCCATG TCGATTTCGG CGGCGGCCTC GGCATCCCGT ACTACGAGGA TCGCGACGCG CCGCCGGAGC CGTTCGCCTA TGCGGAGATG GTCAAGCGCG TCACCCACAA TCTCGGCTGC ACGCTGCTGT TCGAGCCGGG CCGGATGATC GTCGGCAACG CCGGCATCCT GGTCGCCAAG GTGATCTATG TGAAGCACGG CGACGGCAAG ACCTTCGTGA TCATCGACGC GGCGATGAAC GATCTGATCC GCCCGACGCT GTACGAGGCG TATCACGAGA TCATCGCCGT GCAGCAGCCG GCACCCGGCG TAGCTACCAT GGTGGCCGAC GTCGTCGGCC CGGTGTGCGA GACCGGCGAT TACCTCGCGC TCGATCGCAA GCTGCCTGAA CTGAAGGCCG GCGATCTGAT CGCGATCATG ACGGCGGGCG CTTACGGCGC GGTGCAGGCG TGTACTTACA ACACCCGCGC GCTGGTGCCG GAAGTGCTGG TGAAGGACGA TCAGGTTGCG GTGGTGCGTC CGCGCATCGA AGTCGAGCAA CTGATCGCGA TGGATAAGCC GGCGCCCTGG CTGTGA
|
Protein sequence | MRHFDYRDGV LHAEGVSLAS IAQDVGTPFY CYSSATLERH YRVFTEAFAG LDALVCYAMK ANSNQSVLRT LAKLGAGADV VSGGELQRAL AAGIPPSKIV FSGVGKTEAE LRAALAHDIK CLNVESEPEL EQLSRIAVET GRTARISLRV NPDVDSGTHA KISTGKSENK FGIPIRHARE VYARAAKLPG IQVTGVDVHI GSQIVDLAPM EAAFRKVAEF IHVLRGDGHT VSHVDFGGGL GIPYYEDRDA PPEPFAYAEM VKRVTHNLGC TLLFEPGRMI VGNAGILVAK VIYVKHGDGK TFVIIDAAMN DLIRPTLYEA YHEIIAVQQP APGVATMVAD VVGPVCETGD YLALDRKLPE LKAGDLIAIM TAGAYGAVQA CTYNTRALVP EVLVKDDQVA VVRPRIEVEQ LIAMDKPAPW L
|
| |