Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1000 |
Symbol | |
ID | 5704682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1124381 |
End bp | 1126030 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641270515 |
Product | urocanate hydratase |
Protein accession | YP_001535902 |
Protein GI | 159036649 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.208315 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGC CGATCCGTGC CACGCGCGGC ACCACCCGCA CCGCCCAGGG CTGGCCCCAG GAAGCCGCGC GGCGGATGCT GATGAACAAC CTCGACCCCG AGGTGGCCGA ACGTCCCGAG GACCTGGTGG TATACGGCGG GACCGGGAAG GCGGCACGGG ACTGGCCGTC GTACCGCGCA CTGCTGGACA CCCTCACCGA CCTGCGCGAC GACGAGACGA TGCTGGTGCA GTCCGGTCGA CCGGTCGCGG TGATGCGAAC CCACGAATGG GCGCCACGGG TGCTGCTCGC CAACTCCAAC CTGGTCGGAG ACTGGGCGAC CTGGCCGGAG TTCCGGCGCC TGGAACAGCT GGGCCTGACC ATGTACGGGC AGATGACCGC CGGATCGTGG ATCTACATCG GCACCCAGGG GATCCTCCAG GGCACCTACG AGACGTTCGC GGCCGTCGCC GCGAAGCGGT TCGGCGGATC CCTGGCCGGC ACGCTGACGC TGACCGCCGG CTGCGGTGGG ATGGGCGGGG CGCAACCGCT CGCGGTGACC ATGAACGGCG GCTCCTGCCT GATCGTGGAC GTCGACCGGT CCCGCCTCGA ACGCCGGGTG CGCGAACACT ACCTGGACGA GGTCGCCGAC TCGCTCGACG ACGCCGTACA ACGGGCACTC GCCGCCCGCG ACCAACGACG GGCACGCAGC GTCGGAGTGG TCGGCAACGC GGCCACCATC TTCCCCGAGC TGCTTCGCCG CGGCGTCCCG GTGGACGTGG TGACCGACCA GACCAGCGCC CACGACCCGC TGTCGTACCT GCCGGAAGGG GTCGAGCTGA CCGACGCCCG CGACTACGCG GCGGCCAAGC CGGCCGAGTT CACCGACCGT GCCCGCGCGT CGATGGCCCG GCACGTCGAG GCGATGGTCG GCTTCCTCGA CGCGGGCGCC GAGGTCTTCG ACTACGGCAA CTCGATCCGC GGCGAGGCGC AGCTCGGTGG ATACTCGCGC GCCTTCGACT TCCCGGGTTT CGTGCCCGCC TACATCCGTC CGCTGTTCTG CGCGGGCAAG GGCCCGTTCC GGTGGGCGGC GCTCTCCGGC GACCCGGCCG ACATCGCCGC CACCGACCGG GCCATCCTCG ACCTCTTCCC GGAGAACGAA CCGCTGGCCC GGTGGATCCG GATGGCCGGC GAACGGGTGG CGTTCCAGGG ACTACCAGCC CGGATCTGCT GGCTCGGCTA CGGCGAACGA GACCGGGCCG GGGTGCGGTT CAACGAGATG GTCGCCGCCG GGGAGTTGTC CGCACCGGTG GTCATCGGGC GCGACCACCT GGACTGCGGT AGCGTCGCCA GCCCGTACCG GGAGACCGAG GCGATGGCCG ACGGCTCCGA TGCGATCGCC GACTGGCCGC TGCTCAACGC ACTGGTGAAC ACGGCCAGTG GGGCCTCGTG GGTGTCCATC CACCATGGTG GCGGGGTCGG GATCGGCCGG TCCATCCACG CCGGCCAGGT CTGCGTCGCC GACGGCAGCG CCCTCGCCGG GCAGAAGATC GAACGGGTGC TCACCAACGA CCCGGCGATG GGCGTCGTGC GACACGTCGA CGCCGGCTAC GACGAGGCCC GGCAGGTCGC CGAACGGACC GGGCTACACA TCCCGATGAC AGCGGCGTAA
|
Protein sequence | MTQPIRATRG TTRTAQGWPQ EAARRMLMNN LDPEVAERPE DLVVYGGTGK AARDWPSYRA LLDTLTDLRD DETMLVQSGR PVAVMRTHEW APRVLLANSN LVGDWATWPE FRRLEQLGLT MYGQMTAGSW IYIGTQGILQ GTYETFAAVA AKRFGGSLAG TLTLTAGCGG MGGAQPLAVT MNGGSCLIVD VDRSRLERRV REHYLDEVAD SLDDAVQRAL AARDQRRARS VGVVGNAATI FPELLRRGVP VDVVTDQTSA HDPLSYLPEG VELTDARDYA AAKPAEFTDR ARASMARHVE AMVGFLDAGA EVFDYGNSIR GEAQLGGYSR AFDFPGFVPA YIRPLFCAGK GPFRWAALSG DPADIAATDR AILDLFPENE PLARWIRMAG ERVAFQGLPA RICWLGYGER DRAGVRFNEM VAAGELSAPV VIGRDHLDCG SVASPYRETE AMADGSDAIA DWPLLNALVN TASGASWVSI HHGGGVGIGR SIHAGQVCVA DGSALAGQKI ERVLTNDPAM GVVRHVDAGY DEARQVAERT GLHIPMTAA
|
| |