Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1199 |
Symbol | |
ID | 4710363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1304016 |
End bp | 1305272 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639855672 |
Product | diaminopimelate decarboxylase |
Protein accession | YP_001002776 |
Protein GI | 121997989 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0019] Diaminopimelate decarboxylase |
TIGRFAM ID | [TIGR01048] diaminopimelate decarboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0292186 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAC CCCCTGGCTT CGCCCGCTAC GCCGATCGCC TGTGGGCTGA AGCACTGCCC CTGGACGAGC TCGCCGAACG TTTCGGCACC CCGTGCTACG TCTACTCCCG CGCCGCCATC GAGGAGCGCT GGGCACTCTA CCGGAGCGCC CTCGGCGTCG CCGGGGACGT CTGCTACGCC GTCAAGGCCA ACGGCAACCT CGCCCTGCTG CAACTGCTCG CCCGCCACGG AGCCGGATTC GACATCGTCT CCGGCGGCGA GCTCGAACGC GTGCTGCACG CCGGCGGGGA CGCCGCGCGG GTGGTCTTCT CGGGTGTTGG CAAAGGCACC GACGAGATCC GTCGCGCCCT GCAAGCCGGT ATCCGCTGCT TCAACGTGGA GTCCGCTGCC GAACTCGAAC GCATCGCCAG CGTGGCCGCC ACCGAGAACA CCCCGGCGCC GGTGGCCCTG CGCGTCAACC CCGACGTCAA CCCCGAGACC CACCCCTACA TCGCCACCGG CCTGGCCCAG AGCAAATTCG GCATCGCCCT GGAGGAGGCC GAGGCCCTCT ACCTCCAAGC GGCTAACGAT CCGCGCCTCG AGGTCCGTGG CATCGCCTGC CACATCGGCT CGCAACTGCT CTCGGTGGCA CCGCTGACCG AGGCGGCAGA ACGGCTGGCG GCGCTGGCCC GGCGACTGCA GGAGCAGGGC ATCTCCCTGG ACCACATCGA CGCCGGCGGT GGCCTCGGCG TGCACTACAT CGATGAGCAG CCGCCAACCC CGGCCGAGCA CATCGAGGCG ATCTCGGCGC CGCTGCGCGA CCTTGGCGTA TCCGTCCTGG TCGAGCCCGG GCGCGCCATT GTTGCCGAGG CCGGCATCCT GCTCACCCGC ATCGAGTATC TCAAGCACAA CGGCGGCAAG GAGTTCGCCA TCGTCGACGC CGGCATGAAC GACTACCTGC GCCCAGCGCT CTACGACGCC GCCCACACCC TGGAGGCGGT CACCCCGTCG GAAGCCGAAC TTCGACCGGT GGACGTGGTC GGACCGGTCT GCGAATCCGC CGACACCTTC GCCCGCGACT GCACGCTGCC GGCAGCCGCC GGCGGCTTGC TGGCCATCCG TAGCGCCGGC GCCTACGGTG CCGTCATGGC CTCGCAGTAC AACGCCCGGC CCCGACCGCC CGAGGTGCTG GTGGACGGCA CCCAGGCGCA CCTGATCCGC CGGCGCGAGA CCATCGACGA GCTGATGAGC GGGGAATCGC TCCTGCCGGA GGGCTGA
|
Protein sequence | MSTPPGFARY ADRLWAEALP LDELAERFGT PCYVYSRAAI EERWALYRSA LGVAGDVCYA VKANGNLALL QLLARHGAGF DIVSGGELER VLHAGGDAAR VVFSGVGKGT DEIRRALQAG IRCFNVESAA ELERIASVAA TENTPAPVAL RVNPDVNPET HPYIATGLAQ SKFGIALEEA EALYLQAAND PRLEVRGIAC HIGSQLLSVA PLTEAAERLA ALARRLQEQG ISLDHIDAGG GLGVHYIDEQ PPTPAEHIEA ISAPLRDLGV SVLVEPGRAI VAEAGILLTR IEYLKHNGGK EFAIVDAGMN DYLRPALYDA AHTLEAVTPS EAELRPVDVV GPVCESADTF ARDCTLPAAA GGLLAIRSAG AYGAVMASQY NARPRPPEVL VDGTQAHLIR RRETIDELMS GESLLPEG
|
| |