Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4110 |
Symbol | |
ID | 4072301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4865344 |
End bp | 4867185 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986141 |
Product | 3-dehydroquinate dehydratase / shikimate dehydrogenase |
Protein accession | YP_593184 |
Protein GI | 94971136 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0169] Shikimate 5-dehydrogenase [COG0710] 3-dehydroquinate dehydratase |
TIGRFAM ID | [TIGR00507] shikimate 5-dehydrogenase [TIGR01093] 3-dehydroquinate dehydratase, type I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.471518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.833222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACTT CCGGCTACAA CGCGCCCCGC ATCCTCCGCT CGCGCATTCC CCGTGTTTGC GTCGCCGTAA TCGGCGCGAC GCCGGAGGAA ATGATCGCCA AAGCGGAAGG CATAATCCGC GAAAACCCCT TTCTTGAGTT CAGGTTAGAC TACCTGACCA ACCCCGCAGC GGCGCTCCCC AAGGTGAAGC GCCTGCTCGA AACCCGGCCC GATGCCGTCA TCATTGGTAC CTGTCGGAGG GCGGTCAACG GGGGCAAATT TAAGGGGTCG GCCTCGGCGC AGCTCGAAAT CCTGGCCAAG GCCGCGGCAG CCGGATGTCA ACTTCTGGAC GTGGAACTGG AGACAGCGGA GTCGGTGAAG GCCTCGGAGC TGCAAAAGCT TCGTGAAGCA GCCTCGATCA TCCTGTCTTT TCATGACTTT AAGGCAACCA AGAAGCTGGA TGACATCCTC GCCCGGATGG TGAAAATCCA CGTCGAATAT TACAAAGTGG TCTCCACGGC GACCTCCCTG TACGACAACG TGCAGATGAT GAAGTTCCTG GAGAAGCACG GGCACGATTA CTGGCTGATC GGGCTGTGCA TGGGCGAGCA AGGCATTATC AGCCGGGTTT TGTCGGTCCG AGCGGGCAGT GTATTTACCT TTGCGGCGGC TTCCGCGGGC GAAGAGACGG CGCCGGGACA GGTCACCTAT CGCGACCTGC GGAGCACTTT CCGTATTGAC CACGTGGACG CCGCCACGCA TGTGTACGGC GTTGCCGGAG ATCCAGTAGC GCATTCGTTG TCGCCGGCGA TGATGAACCT GGCGTTCCGC CGCGAGAACG TGAATGGCGT GTATCTGGCG TTGCACACAA AAAAGACCGA CGATCTGATG GCGTGTATTC GCGACATTCC TCTGCGCGGG TTGAGCATCA CCATGCCGCA TAAAGAGGAG ATGGTGAAGC ACCTCGCGAA CACCGACGAC CTGGTGCGCA AGATTGGCGC GGTGAATACG ATCGTCCGCG CGCAAGACGG AAAGCTCTAC GGATTTAATA CCGACGTGGC TGGGATCGTT GGGCCGCTTT CGGACCGCAT CACCGTGACT GGGGCGAAAA TCCTGGTGGT AGGCGCCGGT GGCGCAGCGC GCGCAGCGGT GTTCGGATTG GCGGCACGCG GGGCACACAT TTCGATCGTG AATCGCACCG TGCCGAAGGC GCAAAAGTTG GCGAAGGAAG CGGGTGCGAA GGTCGTAAAG CGAGCCGATT TGAAAAAGCT CGATTTCGAC GTGATCATCA ACGCGACCTC GGTCGGCATG ATGAACCCGA AGGTGAGCCC GCTCGAAGCC GATGAATTGC GGGCGCGCAT CGTGTTCGAC ATGGTCTATA ACCCGGTGAA CACTCGGTTG ATCCAGTTGG CGAAAGCCAA AGGATTGGCG ACGATTCCGG GCTACGAGAT GTTTGTGCAG CAGGGAGCGC GTCAATTTGA GATTTGGAGC GGCAAGCCTG CGCCGGTCGA GGAAATGCGC GGCATAGTGA TGAAGGCGCT CGGGGAATCG CCGGTTTTGG AAGCTCCCAC GCCTGTGCCG CCACCGCTTC CAGCCGAAGC GCCGAAACCT GCGGTCGCCG CGAAACCGGT TGCTGGAAAA ACTGCGCCGG TGACGAAATC TGTTTTAGCC GCGAAGCCCG CAGCTGCAAA GTCTGCTCTC GTCGCGAAGC CCGCTCCTGT TACGAAGCAT GTGCCTGTAG CAGCAAACCA TAACGGGACC GGCAAGAAAG CTGCCCCTGT CGCGAAGAAG CCGGAGCCCA AACCGGTCGC GAAGAAAGCC GCTCCGCCTG CCAAGAAGAA GGTTGTGGCA AAGAAGAAGT AA
|
Protein sequence | MATSGYNAPR ILRSRIPRVC VAVIGATPEE MIAKAEGIIR ENPFLEFRLD YLTNPAAALP KVKRLLETRP DAVIIGTCRR AVNGGKFKGS ASAQLEILAK AAAAGCQLLD VELETAESVK ASELQKLREA ASIILSFHDF KATKKLDDIL ARMVKIHVEY YKVVSTATSL YDNVQMMKFL EKHGHDYWLI GLCMGEQGII SRVLSVRAGS VFTFAAASAG EETAPGQVTY RDLRSTFRID HVDAATHVYG VAGDPVAHSL SPAMMNLAFR RENVNGVYLA LHTKKTDDLM ACIRDIPLRG LSITMPHKEE MVKHLANTDD LVRKIGAVNT IVRAQDGKLY GFNTDVAGIV GPLSDRITVT GAKILVVGAG GAARAAVFGL AARGAHISIV NRTVPKAQKL AKEAGAKVVK RADLKKLDFD VIINATSVGM MNPKVSPLEA DELRARIVFD MVYNPVNTRL IQLAKAKGLA TIPGYEMFVQ QGARQFEIWS GKPAPVEEMR GIVMKALGES PVLEAPTPVP PPLPAEAPKP AVAAKPVAGK TAPVTKSVLA AKPAAAKSAL VAKPAPVTKH VPVAANHNGT GKKAAPVAKK PEPKPVAKKA APPAKKKVVA KKK
|
| |