Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_4112 |
Symbol | |
ID | 5211095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 5152425 |
End bp | 5153390 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640597700 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001278406 |
Protein GI | 148658201 |
COG category | [C] Energy production and conversion |
COG ID | [COG0371] Glycerol dehydrogenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCACC GCCTGCATCC CATTCCCGCA ATTGTCGAGG CGCCGTTGCA CGCGCTGGTC GATCAGCGGC GCATTGCCCG GATTGCCAGT GGACCGGCGC TTGCCGCCGC CGTGCATGCC GGGTTGCATC TGCCGGTCGT CTGGAGCGCC GAGCCGCGCG AAGCCAGCGA GATGCACTTC GAGGAACTGG CGCGCACCGT GCCCGCCGAA ACTGAGGTGA TCTACGGCAT CGGCGGCGGT CTGGCGGCGG ATGCGGCGAA GTACGTCGCC TGGCGCCGCA GTCTCCCGCT GGCGTTGATT CCGACCGCCC TCTCGGTTGA TGCCCACTTC ACCTGGGTCA GCGGCGTCCG ACGCAACGGA TGTGTGGCGT ATCTCGACAC CGGTCCGGCG CAGGTTGTGT ACGCCGATGA CGGGTTTCTG GCGCACGCGC CGCCCCATCT TCGCGCCGCA GGCGTGTGTG ACCTGCTCTC CATCGCTACG GCGCTCCACG ACTGGCGGTA CGCGGAAGAG CGCGGCATGA ATCCGCCAGA GCAACGGTAT ACGCCGTGGG TTGCCATGGC AGCGCAGGCG ATCCTCAATG GCGCGATCAC TGTTGCCGAA GCCGCAGGGC GCGGCGACCC GGAGGGCATC CGTGAATTGT TGCGGCTCCT GGCGCTCGAA GTGCAGCTGT GCAATCTGAT CGGGCATAGC CGACCGGAGG AAGGTTCAGA ACACTATCTG GCGTATGCGC TGGAAGCGCA TCCGTCGATC GGCAGCGGTC ACGCCCACGG CGACCTGGTG GGACCGGCAA TCCTGCGCGC TGCCGCCTGG CAGGGGCAGC ATGTCGCTCC GCTGGAGCAG GCGCTGATCC AGGCTGGCGT ACCGCTTGAT CGCGTGCCGG AAACGGTCAT GCAAGAGGTG ATCCGGGAGT TGCCGTCGTA TGTTCGACGC CACCATCTCG CCTTCAGTAC CGCTCACGAC CTGTAA
|
Protein sequence | MSHRLHPIPA IVEAPLHALV DQRRIARIAS GPALAAAVHA GLHLPVVWSA EPREASEMHF EELARTVPAE TEVIYGIGGG LAADAAKYVA WRRSLPLALI PTALSVDAHF TWVSGVRRNG CVAYLDTGPA QVVYADDGFL AHAPPHLRAA GVCDLLSIAT ALHDWRYAEE RGMNPPEQRY TPWVAMAAQA ILNGAITVAE AAGRGDPEGI RELLRLLALE VQLCNLIGHS RPEEGSEHYL AYALEAHPSI GSGHAHGDLV GPAILRAAAW QGQHVAPLEQ ALIQAGVPLD RVPETVMQEV IRELPSYVRR HHLAFSTAHD L
|
| |