Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1100 |
Symbol | |
ID | 5707021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1236804 |
End bp | 1238426 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641270615 |
Product | hypothetical protein |
Protein accession | YP_001535999 |
Protein GI | 159036746 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.575745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0877269 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAGC GGATCGAGGG GCACGGCGGG GCGCTCGCGC TCGCGGCGCT GCGTGGGCAC GGCGTTCGGG AGATGTTCAC CCTCTCCGGG GGGCATGTCT TCCCGCTCTA CGACGCCGCG CACCAGGCCG GGTTCCCGCT CTACGACGTC CGGCACGAGC AGTCGGCGGT TTTCGCGGCC GAGGCTGTCG CGAAGCTTCA GCGCCGTCCC AGCCTCGCCG TGCTGACCGC CGGCCCGGGG GTCACCAACG GCATCTCCGG ATTGACCAGC TCCTACTTCA ACGCGTCACC GGTCCTGGTG CTCGGTGGGC GAGCGCCGCA GTTCCGGTGG GGGGCGGGCA GCCTCCAGGA GTTGGACCAC CTGCCGTTGG TCACCCCGGT GACGAAGCAC GCCGAGACGA TCGCGCATGC GGCTGACGTC CCCCGGGCGG TGGGCGTCGC GTTGACCACG GCGCTCACCC CGCACCGCGG CCCGGTCTTC TTGGACCTTC CACTGGAGGT GGCCTTCTCG GTGGCCGACG CCGACCTGCC CGCTGTCGCC CCGATTGCCC CGATCGAGGC GGACCCGGCG GAGGTCGCGG AGGCCGCTGC GCTGATCGCC GAGGGGCGGC GTCCCGTGCT CGTCGCCGGC TCCGACGTCT ACGCCGGCGA CGCTGTCGAA CCCCTGCGCG CGGCGGCCGA GGCGCTCGCG GTGCCCGTCT TCACCAACGG TCAGGGCCGA GGCGCGCTGC CGCCGGAGCA CCCGCTCGCC TTCACCAGGT CCCGCCGGGT GGCCCTCCAG AAGGCCGACG TGGTGGTGGT CGTCGGCACG CCGCTCGACT TCCGGCTCAA CTTCGGGGAC TTCGGTGCCG CGACCGTGGT GCACGTGGTG GACGCGCCGA GTCAGCGCGC CGGGCATGTC CAACCTGCGG TCGCGCCCGC GGGTGACCTG CGGCTGATCC TGTCCGCGTT CGCGGAATAC TCCGGGGACC GGGTCGACCA CACGGAGTGG GTCGCCGAGC TGCGGGCTGT CGAGGACGCC GCCCGGGCCC GCGACGCCGT GGCGATGGCC GCGGAGACCG ACCCGATTCG GCCCGCCCGC GTCTACGGCG AGCTGCGTCG GGTGCTGACC CGGGACGCGA TCACCATTGG TGACGGCGGC GACTTCGTTT CGTACGCGGG GCGCTACCTG GAGCCCGCCC AGCCCGGCAC CTGGCTGGAC CCCGGTCCGT ACGGCTGCCT CGGCACCGGT ATGGGCTATG CCATGGGTGC TCGGGTGACC CACCCCGACC GGCAGGTCTG CGTCCTGATG GGGGACGGTG CGGCCGGCTT CTCGCTGCTG GACGTGGAGT CCCTGGTCCG GCAGCGGCTG CCGGTGGTGA TTGTCGTCGG CAACAACGGA ATCTGGGGAC TGGAGAAGCA TCCCATGCGG GCGATGTACG GCTACGATGT GGCCGCGGAC CTCCAGCCGG AGCTGCGCTA CGACCAGGTG GTCCAGGCAC TGGGCGGTGC GGGCGAGACG GTGGCGAAGG CTGCCGACCT TGGGCCGGCG CTGACGCGCG CGTTCGAGGC CGGGGTGCCG TACCTGGTCA ACGTGCTGAC CGACCCGGCC GACGCGTACC CCCGCTCGTC GAACCTCGCC TGA
|
Protein sequence | MSERIEGHGG ALALAALRGH GVREMFTLSG GHVFPLYDAA HQAGFPLYDV RHEQSAVFAA EAVAKLQRRP SLAVLTAGPG VTNGISGLTS SYFNASPVLV LGGRAPQFRW GAGSLQELDH LPLVTPVTKH AETIAHAADV PRAVGVALTT ALTPHRGPVF LDLPLEVAFS VADADLPAVA PIAPIEADPA EVAEAAALIA EGRRPVLVAG SDVYAGDAVE PLRAAAEALA VPVFTNGQGR GALPPEHPLA FTRSRRVALQ KADVVVVVGT PLDFRLNFGD FGAATVVHVV DAPSQRAGHV QPAVAPAGDL RLILSAFAEY SGDRVDHTEW VAELRAVEDA ARARDAVAMA AETDPIRPAR VYGELRRVLT RDAITIGDGG DFVSYAGRYL EPAQPGTWLD PGPYGCLGTG MGYAMGARVT HPDRQVCVLM GDGAAGFSLL DVESLVRQRL PVVIVVGNNG IWGLEKHPMR AMYGYDVAAD LQPELRYDQV VQALGGAGET VAKAADLGPA LTRAFEAGVP YLVNVLTDPA DAYPRSSNLA
|
| |