Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0976 |
Symbol | |
ID | 6374645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1050144 |
End bp | 1051394 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642683477 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001959400 |
Protein GI | 189499930 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.351187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.117285 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTGACA GTAATTTATT TCCGTTTTCC AACGGGGTTC TGTGTTGTGA AGGGGTTTCG CTTGAAAGAC TTGCGGAAAA GTACGGGACC CCTCTTTTCG TGACCAGTCG AAACAGCCTT GTCAATCAGT ACAGGTCATT TGAAAAGGCG TTTGCCTTGC TGGATCACCT GACCTGTTAC TCTGTGAAGG CTAATTTCAA CCTTCATATC ATTCGTACGC TTGCCGCCGA GGGTTGCGGT TTCGATGTCA ATTCAGGCGG AGAGCTTTAC CGTGCGCTTC AGGCAGGGGC TGATCCGGCA AAGATCATCA TGGCAGGCGT CGGCAAGAGC GCTGCTGAAA TTGAATACGC CATCAGTTCC GGTATCCTGA TGCTCAAGAC AGAGTCTCTC TCTGAGCTTC GCCTGATTGA TGAAATCGCA GGTCGTCTGG GTGTGCAGGC ATCTGTGGGA ATCAGGGTCA ATCCGAACGT GACCGCTGAA ACGCATCCCT ACATTACCAC CGGTGACAGT AAAGAGAAGT TTGGTATTGA CGAGACGGAT CTTGCCCAGG TGTTTTCCCT GATCGGAGCT ATGGATCATG TCAATCTGAC GGCTCTCGAC ATGCATATCG GATCACAGAT TTTCGATACG GAATTCTATC ACGCTGCTTC GGAAAAACTG CTTGACGTGC TTGCCGTTGC GCGTTCTGCC GGGTTTGCTA TTCGTTACTT CGACATAGGC GGCGGGTTTC CGGTTACCTA TGACCCGCAA AAACCTGCGA CGCCGATAGG ACATTTTGCT GAAAAGCTTA TTCCTTTGCT TGAAAAGGCA GGGACAACGA TACTTTTCGA GCCCGGCAGG TTTATCGCGG CCAATTCGAC CGTACTCGTT ACCCGGGTAC TCTACAGGAA ACGCAATCAC GCAGGAAAGG AGTTTGTGAT TGTCGATGCA GGAATGACCG AGCTTATCCG TCCTGCACTG TACCAGTCCC ACCATGAAAT TGTCGCGGTC AAACCGCATG ACTCCATGAT GGTGGCTGAT GTCGTGGGCC CGGTGTGTGA GTCTGGTGAC TTTTTTGCAA GGGCGAGAAC CATAGACGCC GTCGGAGAGG GAGAGCTTCT TGCCGTTCTT TCAAGCGGAG CATACGGTTC AGTCATGGGG AACAATTACA ACGGCCGTCT TCGCCCGGCT GAGGTCATGG TTGACGGCGA CGATGCAACG CTTATCCGCA AAAGGGATAC GTTCGAGCAA CTCATCCAGA ATGAAGTATA G
|
Protein sequence | MLDSNLFPFS NGVLCCEGVS LERLAEKYGT PLFVTSRNSL VNQYRSFEKA FALLDHLTCY SVKANFNLHI IRTLAAEGCG FDVNSGGELY RALQAGADPA KIIMAGVGKS AAEIEYAISS GILMLKTESL SELRLIDEIA GRLGVQASVG IRVNPNVTAE THPYITTGDS KEKFGIDETD LAQVFSLIGA MDHVNLTALD MHIGSQIFDT EFYHAASEKL LDVLAVARSA GFAIRYFDIG GGFPVTYDPQ KPATPIGHFA EKLIPLLEKA GTTILFEPGR FIAANSTVLV TRVLYRKRNH AGKEFVIVDA GMTELIRPAL YQSHHEIVAV KPHDSMMVAD VVGPVCESGD FFARARTIDA VGEGELLAVL SSGAYGSVMG NNYNGRLRPA EVMVDGDDAT LIRKRDTFEQ LIQNEV
|
| |