Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dfer_2884 |
Symbol | aroB |
ID | 8226457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dyadobacter fermentans DSM 18053 |
Kingdom | Bacteria |
Replicon accession | NC_013037 |
Strand | + |
Start bp | 3550117 |
End bp | 3551277 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644930714 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_003087264 |
Protein GI | 255036643 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00942869 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAAACCA TAAAACAACG TTTCCAGGTG GAATACAATT ATGCCGTATT CTTCACCCAG CACCTATTTG ACACCCAGAA CCCGTTACTG AAAGATTTTT TTCAAGCATA TACCGAGCAG GGCTTTCAAC GCAAGGCCCT GGTGATCGTG GACGAGGGGT TTGAACAAAC ACAGCCGGAT TTGAAATCCA ATATCCGCGC CTATTTCGCA CAAAATGCAG CGCATATCCA GCTCGCGGCC GAGATCATTT CGGTGCCGGG CGGGGAGGCG TGCAAAAATG ATCCGTCGCA GTTCGATAAG CTCGTGGAGG CTGTCGACGT GTTCGGGATT GATCGGCACT CATTTGTGAT CGGCATTGGC GGTGGCGCCG TGCTGGACCT CGTCGGTTAC GCCGCCGCGG TTTCGCACCG GGGCATCAAG CTCATCCGCA TTCCGACGAC CGTTTTGGCG CAAAACGACT CGGGCGTGGG CGTCAAAAAC AGCATTAACT TTCACGGTAA AAAAAACTTC CTCGGCACAT TTGCACCACC GGTGGCTGTT TTCAACGATC TCACTTTCCT GCGCACGCTC GACGACCGCG ACTGGCGCGG CGGCCTGGCC GAAGCTGTAA AAGTGGCATT GATCAAGGAC CAGGCGTTTT TTGAATGGAT TGAAGAACAT GCAACCGCAT TGGCAGCGCG TGATGAAGAA GCCATGGCCT ACCTCATCCA CCGCTGCGCC GAAATGCATA CCGACCACAT TGCCGGCGGC GACCCGTTCG AATTCGGCTC TTCGCGACCA TTGGATTTTG GTCACTGGGC CGCGCATAAG CTGGAATTCC TTACGAACTT CGAGGTGCGT CACGGCGAGG CGGTGGCGAT CGGCATCGCA TTGGATTGCG TGTATGCCCA TAAAATAGGC ATGCTGGCCG AAAGCGACCT GCACCGCATT ATCGACGTGC TCACCAAAGT CGGTTTTGAG CTATATCATT CCAAACTGGC TGAAAACGAT AAAATCAACC TCCGCAACGG CTTGCAGGAA TTCCGCGAGC ATTTGGGCGG CAGGCTCACC ATCATGCTTT TGGAAAAGAT TGGAAAAGGT GTGGAAGTGC ATGAGCTCGA CGCCGACATT ATCGCACAGT CGGTGGATTA CCTCGAAAGT TCCAAAATTA TTCCTGCATG A
|
Protein sequence | MQTIKQRFQV EYNYAVFFTQ HLFDTQNPLL KDFFQAYTEQ GFQRKALVIV DEGFEQTQPD LKSNIRAYFA QNAAHIQLAA EIISVPGGEA CKNDPSQFDK LVEAVDVFGI DRHSFVIGIG GGAVLDLVGY AAAVSHRGIK LIRIPTTVLA QNDSGVGVKN SINFHGKKNF LGTFAPPVAV FNDLTFLRTL DDRDWRGGLA EAVKVALIKD QAFFEWIEEH ATALAARDEE AMAYLIHRCA EMHTDHIAGG DPFEFGSSRP LDFGHWAAHK LEFLTNFEVR HGEAVAIGIA LDCVYAHKIG MLAESDLHRI IDVLTKVGFE LYHSKLAEND KINLRNGLQE FREHLGGRLT IMLLEKIGKG VEVHELDADI IAQSVDYLES SKIIPA
|
| |