Gene Acid345_1048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1048 
Symbol 
ID4073135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1313475 
End bp1315148 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content60% 
IMG OID637983055 
Producturocanate hydratase 
Protein accessionYP_590125 
Protein GI94968077 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.155699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTG ATACCGAAAC ATCTAGCTAT ACTCCCGTCA AAGCGCCACG TGGTAACACC 
ATTTCCTGCA AGGGCTGGCA GCAAGAAGCT GCCATGCGCA TGCTCATGAA CAACCTCGAC
GAAGAGGTAG CCGAGCGCCC TCGCGACCTG GTTGTGTATG GGGGCACCGG CCGCGCAGCC
CGCAGTTGGG ACTGCTTCCA CGCCATCGTG AACACGCTTA AGTCTCTCGA CAACGACGAG
ACTCTGCTGG TGCAATCCGG GAAGCCGGTT GCGGTGTTCC GGACGCACGA ATATGCGCCG
CGTGTGCTCA TCTGCAACTC GAACCTTGTC GGGCATTGGT CGAATTGGGA CAAGTTCAAC
GAGCTCGAAC GTGCCGGTTT GACGATGTAT GGCCAGATGA CCGCCGGCTC GTGGATCTAC
ATCGGATCGC AGGGCATCAT CCAGGGCACT TTTGAGACAT TTGCTGCCGC GGCCGAGAAG
CACTTCGGTG GTGAACTTGA AGGGAAGCTG ATTGTGAGCG GCGGTATGGG AGGCATGGGT
GGCGCTCAGC CACTGGCAGC AACCATGACC GGCGCGTGCT TCCTTGGCAT TGATGTTGAT
CCCGAGCGCA TCAAGAAGCG CCTGAAGACG GGCTACTGCG ACTTCATGGT CAACTCGCTT
GACGAAGCGC TCCGCATCCT GAAGAACGCC GTTCGCAAAA AAGAGAACAT TTCTGTCGGT
CTTGTCGGCA ACTGCGCCGA TGTGATTCCG GAACTAGCCG AGCGCGGCGT GGTGCCCGAC
ATCCTTACCG ACCAGACGTC GGCGCATGAT CCGCTGAACG GGTACGTTCC GAATGGCATG
ACGTTCGAAG CAGCGCTGGA GCTTCGCAAG AGCGATCCGC ATGCGTACAA CGAGCGTTCG
CTGGATTCAA TGGCGCGCCA CGTCGAGGGC ATGCTCAAGC TGCAGAAAAT GGGCGCTGTC
ACCTTTGACT ACGGCAACAA CATTCGGACG TTCGCCTTCC AGCGCGGCGT TAAGAACGCG
TACGACTTCC CGGGTTTTGT GCCGGCGTAC ATTCGTCCTC TGTTTTGCGA AGGCCGCGGA
CCGTTCCGCT GGGTAGCGCT CTCGGGTGAG CCGTCGGACA TTCATGTGAC GGACGAGCTG
ATCCTTCAGA TGTATCCGCA GAACCGCATT CTGAGCCGCT GGATCGATCT TGCGCGCAAG
CGGATCAAGT TCCAGGGACT GCCGTCGCGC ATCTGCTGGC TCGGCTATGG CGAGCGCGCC
GAGTTCGGGC TCGCAATGAA CGACCTCGTA AAGAAAGGGA AGATCAAGGC CCCGATCGTC
ATCGGCCGCG ACCACCTCGA CTGCGGCTCG GTGGCATCGC CGTTCCGCGA AACCGAAGCC
ATGAAAGACG GTAGCGATGC GATCGCCGAT TGGCCGCTGC TCAACGCGCT GCTGAACACT
GCGAGCGGAG CTTCGTGGGT CTCGATTCAC AACGGAGGCG GCGTAGGCAT TGGCTATTCG
CAACACGCCG GCCAGGTGAC AGTCGCCGAC GGAACCGACG AGATGGCGAA GCGCATCGAG
CGGGTGCTTA CTAACGATCC GGGCATTGGT GTGGCACGGC ACGTGGACTC CGGCTACGAC
GAAGCCAAGA GCTTCGCGAA AGAAAAGGGC GTCAAGATTC CGATGGGACA GTAG
 
Protein sequence
MPVDTETSSY TPVKAPRGNT ISCKGWQQEA AMRMLMNNLD EEVAERPRDL VVYGGTGRAA 
RSWDCFHAIV NTLKSLDNDE TLLVQSGKPV AVFRTHEYAP RVLICNSNLV GHWSNWDKFN
ELERAGLTMY GQMTAGSWIY IGSQGIIQGT FETFAAAAEK HFGGELEGKL IVSGGMGGMG
GAQPLAATMT GACFLGIDVD PERIKKRLKT GYCDFMVNSL DEALRILKNA VRKKENISVG
LVGNCADVIP ELAERGVVPD ILTDQTSAHD PLNGYVPNGM TFEAALELRK SDPHAYNERS
LDSMARHVEG MLKLQKMGAV TFDYGNNIRT FAFQRGVKNA YDFPGFVPAY IRPLFCEGRG
PFRWVALSGE PSDIHVTDEL ILQMYPQNRI LSRWIDLARK RIKFQGLPSR ICWLGYGERA
EFGLAMNDLV KKGKIKAPIV IGRDHLDCGS VASPFRETEA MKDGSDAIAD WPLLNALLNT
ASGASWVSIH NGGGVGIGYS QHAGQVTVAD GTDEMAKRIE RVLTNDPGIG VARHVDSGYD
EAKSFAKEKG VKIPMGQ