Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0727 |
Symbol | |
ID | 5704499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 809753 |
End bp | 810700 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641270245 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001535637 |
Protein GI | 159036384 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.104965 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000687036 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGAGG CATGGAGACC CGATGGCGAT GAACCGCGAG GCGTCAGCGG GCCGTTCCGC GTACGGGTGC CCGCCAAGAT CAATCTGCAT CTCGGGGTGG GACCACTGCG CCCCGACGGT TATCACGAGT TGAACACCGT CTATCACGCG ATCTCCATCC ACGACGAACT GACCGCCCGC AGGGGCGACA CCCTTGCCCT GACGATGGAG GGTGAGGGGG CCGGTGAGCT GGCCCTGGAC GACTCCAACC TGGTCATCCG CGCAGCCCGC GCCCTCGCCG CACACGCCGG GGTGCCTCCA TATGCCCGGC TGCACCTGCG CAAGCAGATC CCCCTGGCCG GTGGGCTGGC CGGCGGCAGC GCCGACGCCG CCGCCGCGCT GGTCGCCTGC GACGCACTCT GGGGAACCGG GCTGTCCCGC GACGAACTGG CGGGGATCGC CGCCGACCTC GGCTCCGACG TGCCGTTTCT GATCCACGGT GGCACCGCGC TGGGCACCGG CCGAGGGGAG GCGGTCAGCC CGGTACTGGC CCGGCCGACC GTATGGCACT GGGTTGTCGC AGTCGCCGAC GGGGGCCTGT CCACGCCGGT CGCGTACCGG GAACTCGATC GGCTCCGGGA CGCCGGCGCC GCCAGTACGC CGCTCGGCAG TAGCGACGCC CTGCTGGCCG CCCTCCGGCA GCGGGACCCC CGGGTGCTCG CCGGGGTCCT CGGCAACGAT CTCCAGGATG CCGCGCTTGC CCTGCGGCCG TCCCTGGCCG CCACCCTGAA GGCGGGCGAG GCGGCGGGTG CGCTCGCGGG CATCGTCTCC GGCTCCGGGC CGACCTGCGT GTTCCTCGCC GCCAACGCGG CCGACGCAGA GCGCGTAGCC GGTGAGTTGA GCGCCCTCGA CGTGTGCCGA CAGGCGCGCA CCGCGCGCGG CCCGGTTGCC GGGGCCCGGG TTGGTTGA
|
Protein sequence | MTEAWRPDGD EPRGVSGPFR VRVPAKINLH LGVGPLRPDG YHELNTVYHA ISIHDELTAR RGDTLALTME GEGAGELALD DSNLVIRAAR ALAAHAGVPP YARLHLRKQI PLAGGLAGGS ADAAAALVAC DALWGTGLSR DELAGIAADL GSDVPFLIHG GTALGTGRGE AVSPVLARPT VWHWVVAVAD GGLSTPVAYR ELDRLRDAGA ASTPLGSSDA LLAALRQRDP RVLAGVLGND LQDAALALRP SLAATLKAGE AAGALAGIVS GSGPTCVFLA ANAADAERVA GELSALDVCR QARTARGPVA GARVG
|
| |