Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0093 |
Symbol | |
ID | 5537552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 115318 |
End bp | 116433 |
Gene Length | 1116 bp |
Protein Length | 371 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640892258 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_001430248 |
Protein GI | 156740119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.19244 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAAC CCGCCGAGTA TTTGTACGTC ACAACGACAG GCGGCAGGTA CGAAGTCATC GTCGCCCACG GCGCCTTCGA TCACCTTCCG CACCATCTTC AGCGCATCGG TTTGCGCGGC GCCGCGTGGG TCATCAGCGA CGACCAGGTG TTCCCGCGTT ACGCCCCGGC GCTGATCGCG CGTCTGCGCG CCGCCGGGTA CAATGCGCAC GGGTATGCCG TGCCGCCTGG CGAACCGAGC AAAGACCTGG CAATGGCTGC GCGACTCTAC GACTGGCTGA TCGGCAACGG CGTCGAGCGG CGCGACACTG TGCTGGCGCT GGGTGGCGGC GTCATCGGCG ACCTGGCAGG GTTCGTGGCT GCAACCGTAT TGCGCGGCAT CGCTCTCGTT CACCTGCCGA CCACACTGCT GGCAATGGTC GACTCGGCAA TCGGCGGCAA AACCGGCGTG AACCATTCGC TCGGCAAGAA CCTGATTGGC GCGTTCCACC AACCGCGCTT GACGCTTGCC GACACGACCA CGCTGGCGAC TCTGCCGCCG CGTGAACTAC GCGCTGGCTG GGCGGAAGTG ATCAAACACG CCGTCATTCG CGACGCCGAT CTGTTCGCGC ACCTCGAAGC GATCACCGAT CCTGCGTCTC TCCAGGGTGA TGCGCTGGCA ACGATCATCC GGCGCGCCGC CAGGGTCAAG ATCGATATTG TCAACATCGA TGAGCGCGAA ACCGGCGAAC GCATGCTTCT GAACTATGGG CACACCCTGG GGCATGCCAT CGAAGCCGCA CGCGGCTATG GCGACCTGCT GCACGGCGAG GCAGTCGCCA TCGGCATGCA CCTGGAGGCG CAGATCGCCC ATCGCATGGG GATGGTTGAC TCTCGGTTCG TTGAGCGTCA GCAGCGTCTG CTGCGCGCAT ATGGGCTGCC AACAAATCTG CCGCCCGGTG TGACTATTGA CGACCTGATC GAACGCACGC TGCGCGACAA GAAGGTGCGG GCAGGGCGGG CGCGCTGGGC ATTGCCGCTG GGAATTGGTG CGGCGACCGT GCGCGACGAT GTGCCCGAAA CAGTGGTGCG CGCCATTCTC GAGAAGGCGA CCGACAGAAA TGAGGGACCT GTATGA
|
Protein sequence | MTQPAEYLYV TTTGGRYEVI VAHGAFDHLP HHLQRIGLRG AAWVISDDQV FPRYAPALIA RLRAAGYNAH GYAVPPGEPS KDLAMAARLY DWLIGNGVER RDTVLALGGG VIGDLAGFVA ATVLRGIALV HLPTTLLAMV DSAIGGKTGV NHSLGKNLIG AFHQPRLTLA DTTTLATLPP RELRAGWAEV IKHAVIRDAD LFAHLEAITD PASLQGDALA TIIRRAARVK IDIVNIDERE TGERMLLNYG HTLGHAIEAA RGYGDLLHGE AVAIGMHLEA QIAHRMGMVD SRFVERQQRL LRAYGLPTNL PPGVTIDDLI ERTLRDKKVR AGRARWALPL GIGAATVRDD VPETVVRAIL EKATDRNEGP V
|
| |