Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1347 |
Symbol | |
ID | 5704274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1554168 |
End bp | 1555094 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270858 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_001536239 |
Protein GI | 159036986 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0665282 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000471677 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGCGCG ACCTCCCTGA CCCTTCCACC CGGGCGGCGT CCCGGCCGTT CGGGCGGGTG CTCACGGCCA TGGTGACACC CTTCACCGCC GATGGCGTGC TCGACCTCGA CGGGGCGGCA CGGTTGGCGA ACTACCTGGT TGACGAGCAG GGCAGCGACG CGCTGGTGGT CAACGGCACT ACCGGTGAGT CGCCGACCAC GACCGACGCG GAGAAGGAAC GTCTGATCCG GACGGTGGTG GCGGCCGTTG GTGACCGGGC CAAGGTGGTG GCCGGCGTCG GTACCAACGA CACCCGGCAC ACGATCGAGC TGGCCGGCAG CGCCGAGAAG GCCGGTGCAC ACGGCCTGCT GGTGGTGACT CCCTACTACA GCAAGCCGCC GCAGGCGGGC CTGGTGCGAC ACTTCACTGC GGTGGCCGAC GCCAGCGGGC TACCGCTGAT GCTCTACGAC ATTCCACACC GCGCCGGGGT GGCGATCGAG ACCGAGACGC TCGTCCGACT CGCCGAGCAC GGCCGGATCG TCGCGGTGAA GGACGCCAAG ACCGACCTGA CTGCGACCAG CTGGGTGACC AGCCGGACCG GCCTGGCTTT CTACTGCGGT GAGGACCCGC TCATCCTGCC CGCGCTGGCG GTGGGGTCGG TCGGGGTGGT GGGCACGTCG ACCCACTTCA CCGGGGCGCG GACCCAGGAG ATGATCCGGG CGTTCGAGGC GGGAGACAAC GCCACGGCGC TCGCCCTGCA CCGGCGGCTG CTGCCGCTGT ACACGGGCAT CTTCCGGACT CAGGGCGTGA TCCTGGTCAA GGCCGGCCTG ACGGCGAAGG GACTGCCCGC CGGCCCGGTG CGTGCCCCGC TGGTGGACGC CACCGCCGAC CAGCTTGCCC AGCTCCGCGC CGACTGCGCG GCGGCGGGCC TGGACCTCCC TGAATGA
|
Protein sequence | MTRDLPDPST RAASRPFGRV LTAMVTPFTA DGVLDLDGAA RLANYLVDEQ GSDALVVNGT TGESPTTTDA EKERLIRTVV AAVGDRAKVV AGVGTNDTRH TIELAGSAEK AGAHGLLVVT PYYSKPPQAG LVRHFTAVAD ASGLPLMLYD IPHRAGVAIE TETLVRLAEH GRIVAVKDAK TDLTATSWVT SRTGLAFYCG EDPLILPALA VGSVGVVGTS THFTGARTQE MIRAFEAGDN ATALALHRRL LPLYTGIFRT QGVILVKAGL TAKGLPAGPV RAPLVDATAD QLAQLRADCA AAGLDLPE
|
| |