Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1048 |
Symbol | |
ID | 4073135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1313475 |
End bp | 1315148 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637983055 |
Product | urocanate hydratase |
Protein accession | YP_590125 |
Protein GI | 94968077 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.155699 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTG ATACCGAAAC ATCTAGCTAT ACTCCCGTCA AAGCGCCACG TGGTAACACC ATTTCCTGCA AGGGCTGGCA GCAAGAAGCT GCCATGCGCA TGCTCATGAA CAACCTCGAC GAAGAGGTAG CCGAGCGCCC TCGCGACCTG GTTGTGTATG GGGGCACCGG CCGCGCAGCC CGCAGTTGGG ACTGCTTCCA CGCCATCGTG AACACGCTTA AGTCTCTCGA CAACGACGAG ACTCTGCTGG TGCAATCCGG GAAGCCGGTT GCGGTGTTCC GGACGCACGA ATATGCGCCG CGTGTGCTCA TCTGCAACTC GAACCTTGTC GGGCATTGGT CGAATTGGGA CAAGTTCAAC GAGCTCGAAC GTGCCGGTTT GACGATGTAT GGCCAGATGA CCGCCGGCTC GTGGATCTAC ATCGGATCGC AGGGCATCAT CCAGGGCACT TTTGAGACAT TTGCTGCCGC GGCCGAGAAG CACTTCGGTG GTGAACTTGA AGGGAAGCTG ATTGTGAGCG GCGGTATGGG AGGCATGGGT GGCGCTCAGC CACTGGCAGC AACCATGACC GGCGCGTGCT TCCTTGGCAT TGATGTTGAT CCCGAGCGCA TCAAGAAGCG CCTGAAGACG GGCTACTGCG ACTTCATGGT CAACTCGCTT GACGAAGCGC TCCGCATCCT GAAGAACGCC GTTCGCAAAA AAGAGAACAT TTCTGTCGGT CTTGTCGGCA ACTGCGCCGA TGTGATTCCG GAACTAGCCG AGCGCGGCGT GGTGCCCGAC ATCCTTACCG ACCAGACGTC GGCGCATGAT CCGCTGAACG GGTACGTTCC GAATGGCATG ACGTTCGAAG CAGCGCTGGA GCTTCGCAAG AGCGATCCGC ATGCGTACAA CGAGCGTTCG CTGGATTCAA TGGCGCGCCA CGTCGAGGGC ATGCTCAAGC TGCAGAAAAT GGGCGCTGTC ACCTTTGACT ACGGCAACAA CATTCGGACG TTCGCCTTCC AGCGCGGCGT TAAGAACGCG TACGACTTCC CGGGTTTTGT GCCGGCGTAC ATTCGTCCTC TGTTTTGCGA AGGCCGCGGA CCGTTCCGCT GGGTAGCGCT CTCGGGTGAG CCGTCGGACA TTCATGTGAC GGACGAGCTG ATCCTTCAGA TGTATCCGCA GAACCGCATT CTGAGCCGCT GGATCGATCT TGCGCGCAAG CGGATCAAGT TCCAGGGACT GCCGTCGCGC ATCTGCTGGC TCGGCTATGG CGAGCGCGCC GAGTTCGGGC TCGCAATGAA CGACCTCGTA AAGAAAGGGA AGATCAAGGC CCCGATCGTC ATCGGCCGCG ACCACCTCGA CTGCGGCTCG GTGGCATCGC CGTTCCGCGA AACCGAAGCC ATGAAAGACG GTAGCGATGC GATCGCCGAT TGGCCGCTGC TCAACGCGCT GCTGAACACT GCGAGCGGAG CTTCGTGGGT CTCGATTCAC AACGGAGGCG GCGTAGGCAT TGGCTATTCG CAACACGCCG GCCAGGTGAC AGTCGCCGAC GGAACCGACG AGATGGCGAA GCGCATCGAG CGGGTGCTTA CTAACGATCC GGGCATTGGT GTGGCACGGC ACGTGGACTC CGGCTACGAC GAAGCCAAGA GCTTCGCGAA AGAAAAGGGC GTCAAGATTC CGATGGGACA GTAG
|
Protein sequence | MPVDTETSSY TPVKAPRGNT ISCKGWQQEA AMRMLMNNLD EEVAERPRDL VVYGGTGRAA RSWDCFHAIV NTLKSLDNDE TLLVQSGKPV AVFRTHEYAP RVLICNSNLV GHWSNWDKFN ELERAGLTMY GQMTAGSWIY IGSQGIIQGT FETFAAAAEK HFGGELEGKL IVSGGMGGMG GAQPLAATMT GACFLGIDVD PERIKKRLKT GYCDFMVNSL DEALRILKNA VRKKENISVG LVGNCADVIP ELAERGVVPD ILTDQTSAHD PLNGYVPNGM TFEAALELRK SDPHAYNERS LDSMARHVEG MLKLQKMGAV TFDYGNNIRT FAFQRGVKNA YDFPGFVPAY IRPLFCEGRG PFRWVALSGE PSDIHVTDEL ILQMYPQNRI LSRWIDLARK RIKFQGLPSR ICWLGYGERA EFGLAMNDLV KKGKIKAPIV IGRDHLDCGS VASPFRETEA MKDGSDAIAD WPLLNALLNT ASGASWVSIH NGGGVGIGYS QHAGQVTVAD GTDEMAKRIE RVLTNDPGIG VARHVDSGYD EAKSFAKEKG VKIPMGQ
|
| |