Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1432 |
Symbol | aroB |
ID | 4078062 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1529139 |
End bp | 1530260 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638006742 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_613427 |
Protein GI | 99081273 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.68619 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.103295 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAA CCGTTCACGT TCCCCTTGGC GCGCGCGCCT ATGATGTGGT GATCGGCCCC GATCTTGTTG CACAGGCGGG CCAGCGTATT GCGCCCCTCC TGCGCCGCAA GACAGTGGCT GTGCTCACGG ATGAGACCGT GGCCGCGCTT CATCTTGAGG CTCTGCGCGC GGGACTCGCA GCCGACGGCA TCGAGATGGA AGCACTTGCC TTGCCGCCCG GCGAGGCCAC TAAAGGCTGG CCCCAGTTCA CCCGCGCGGT GGAGTGGCTC TTGGACAAGA AAGTCGAGCG CGGCGACATC GTCATTGCCT TTGGCGGGGG CGTCATCGGC GATCTGGCGG GTTTTGCCGC AGCCGTGCTG CGCCGGGGCG TTCGTTTTGT CCAGATCCCC ACATCCCTGC TGGCGCAGGT CGACAGTTCC GTCGGGGGCA AAACCGGCAT CAACGCTCCG CAAGGCAAGA ACCTGATCGG CGCCTTCCAC CAGCCCAGCC TGGTACTGGC CGATACAGCG GTTCTTGGCA CGCTCACAGA GCGCGATTTT CTTGCCGGCT ACGGTGAGGT GGTGAAATAC GGGCTCTTGG GCGATGCGGC CTTTTTTGAC TGGCTCGAAG AAAATGCCCC GGCAATGGCG GCAGGTGACA TGGCGCTGCG GGTCGAAGCC GTGGCGCGTT CGGTTCAGAT GAAAGCCGAC ATCGTGGCCC GCGACGAAAC CGAACAAGGC GACCGGGCGC TGTTGAACCT TGGTCATACC TTCTGTCACG CGCTGGAAGC GGCGACCGGC TACAGCGACC GGTTGCTGCA TGGCGAAGGC GTGGCGATCG GCTGTGCGTT GGCCTTTGAG CTCTCAGCCC GCCTCGGCCT CTGCAGTCAG GAAGATCCCA GCCGCGTGCG CGCGCACCTC AAGGCGATGG GCATGAAAAC AGACCTCTCG GACATTCCCG GCGATCTTCC CCCCGCCCAA GAGCTTCTGG ATCTCATGGC GCAGGACAAG AAGGTCGTGG ATGGTCAGCT GCGCTTCATC CTCGCGCGCG GCATCGGAGC GGCCTTTGTC ACCGCCGATG TGCCCTCTGA AAAGGTGCTT GAGGTGCTGC AAGAGGCGCT GGCGCATACA CAACCCGCCT GA
|
Protein sequence | MEQTVHVPLG ARAYDVVIGP DLVAQAGQRI APLLRRKTVA VLTDETVAAL HLEALRAGLA ADGIEMEALA LPPGEATKGW PQFTRAVEWL LDKKVERGDI VIAFGGGVIG DLAGFAAAVL RRGVRFVQIP TSLLAQVDSS VGGKTGINAP QGKNLIGAFH QPSLVLADTA VLGTLTERDF LAGYGEVVKY GLLGDAAFFD WLEENAPAMA AGDMALRVEA VARSVQMKAD IVARDETEQG DRALLNLGHT FCHALEAATG YSDRLLHGEG VAIGCALAFE LSARLGLCSQ EDPSRVRAHL KAMGMKTDLS DIPGDLPPAQ ELLDLMAQDK KVVDGQLRFI LARGIGAAFV TADVPSEKVL EVLQEALAHT QPA
|
| |