Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1253 |
Symbol | |
ID | 5703481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1449486 |
End bp | 1450547 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270768 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001536149 |
Protein GI | 159036896 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000152397 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGAAGAG TCGTTCCGGT GCGTCTCGGG GAACGTTCCT ACCAGGTCCT GATCGGCCCG GGCGTCCGCA CATCGCTGGC CGAGGTGATT CGCCGACTGG GCGCCGAACG AGCCGCCGTC GTGTCGGCCC GCCCGCGGGA ATGGGTGCCC GACACGGGCG TGGAGACCCT GCTGCTGCCC GCCCGCGACG GCGAGCAGAG CAAGACGCTC GCCACGGTGG AGGCGTTGTG CCACGCGTTC GTGCGGTTCG GGCTCACCCG GTCCGACGTT GTCGTCTCGT GCGGTGGCGG GACGACAACC GACGTCGTCG GACTGGCTGC CGCGCTCTAC CACCGGGGGG TGGACGTGAT CCATCTGCCC ACGTCGCTGC TGGCCCAGGT GGACGCCAGC GTCGGCGGTA AAACGGCGGT GAACCTGCCC GACGGCAAGA ACCTGGTCGG CGCGTACTGG CAGCCCCGCG CGGTGCTGTG CGACACGGAC TACCTGTCGA CCCTGCCCCC GCGGGAGTTG CTCAACGGCC TGGGTGAGAT CGCCCGCTGT CACTTTATCG GTGCGGGTGA CCTGCGCGGG CTACCGCTCG CGGAGCAGAT CGCCGCCAGT GTGACCCGCA AGGCGGGCAT CGTCGAGGTC GACGAGCGGG ACGCCGGTAG GCGGCATCTG CTCAACTACG GCCATACGCT GGGCCACGCG CTCGAGCTGG CCACCGGATT CGCGCTGCGG CACGGCGAGG CGGTCGCAGT CGGCACCGTC TTCGCCGGCC GGCTGGCGGG CGCGCTGGGC CGCATCAACC AGTCCAGAGT GGATGAACAT CTGGCGGTCG TGCGCCACTA CAACCTGCCC GCCGCCCTGC CCGCCGAGGT CGATCCCAGG GCCCTGGTCC GCCAGATGCG CCGGGACAAG AAGGCGATCA GTGGTCTCGG TTTCGTCCTG GACGGGCCCG AGGGCGCGGA GCTGGTGAGT GACGTGCCGG AGAACGTGGT GCTCGCTGTC CTCGACGCGA TGCCGCGAGC GCCCATGGAC GCGCTCGTCG GCGCCCTCAC GACCGGTGCG GTGCGGACAT GA
|
Protein sequence | MRRVVPVRLG ERSYQVLIGP GVRTSLAEVI RRLGAERAAV VSARPREWVP DTGVETLLLP ARDGEQSKTL ATVEALCHAF VRFGLTRSDV VVSCGGGTTT DVVGLAAALY HRGVDVIHLP TSLLAQVDAS VGGKTAVNLP DGKNLVGAYW QPRAVLCDTD YLSTLPPREL LNGLGEIARC HFIGAGDLRG LPLAEQIAAS VTRKAGIVEV DERDAGRRHL LNYGHTLGHA LELATGFALR HGEAVAVGTV FAGRLAGALG RINQSRVDEH LAVVRHYNLP AALPAEVDPR ALVRQMRRDK KAISGLGFVL DGPEGAELVS DVPENVVLAV LDAMPRAPMD ALVGALTTGA VRT
|
| |