Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_0100 |
Symbol | |
ID | 8881277 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 107995 |
End bp | 109650 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | urocanate hydratase |
Protein accession | YP_003508914 |
Protein GI | 291297636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.431481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAG CGAGAGAAGT TCGTGCCCCG CGCGGCACCG AGTTGACCTG CGCGAACTGG CAGATTGAGG CCCCGTACCG GATGTTGCAG AACAATCTGG ACCCCGACGT GGCCGAGCGT CCCGACGACC TGGTCGTGTA CGGCGGCACC GGTAAGGCCG CCCGGGACTG GCAGAGCTTC GACGCGCTGC TGCGCACCAT GCGCACGCTC AGGGGCGACG AGACGATGCT GGTGCAGAGC GGCAGGCCGG TCGGGGTCTT CCAAACCCAC GAGTGGGCGC CGCGGGTGCT GATCGCGAAC TCGAACCTCG TCGGCGACTG GGCGACCTGG CCGGAGTTTC GCAAGCTGGA GCAGCTCGGT CTGACGATGT ACGGCCAGAT GACGGCCGGG TCGTGGATCT ACATCGGAAC GCAGGGCATC CTGCAGGGCA CCTACGAGAC CTTCGCGGCG GTGGCCAACA AGCACTTCGG CGGCACCCTG GCCGGAACCC TGACCCTGAC CGCCGGGTGC GGCGGCATGG GCGGCGCGCA GCCGCTGGCC GTCACCATGA ACGACGGCGT GTGCCTGATC ATCGACGTGG ACGCCACCCG GCTGCGGCGC CGGGTCGAGA CCCGGTACCT GGACGTGGTC GCCGCCGACC TGGACGAGGC GGTGCGGCTG GCCACCGAGG CGAAACAGGA AAAGAAGCCG CTGTCGGTGG GCCTGGTGGG CAACGCCGCG CTGCTGGTGC CGCAGATCCT GCGCATGGGG GTGCCGGTCG ACATCGTCAC CGACCAGACC TCCGCCCACG ACCCGCTCGC ATATGTGCCC GACGGCATCG ACCTGGGCGA CGCCCCCGAC TACGCGGCCA AGAAGCCGGA GGAGTTCACC GACCGGGCGC GGGCCGCGAT GGCCAAGCAC GTGCAGGCCA TGGTGGAGTT CCAGGACGGC GGCGCGGTCG TGTTCGACTA CGGCAACTCG ATCCGCGGTG AGGCGAAACT GGGCGGCTAC GACCGCGCCT TCGACTTCCC CGGTTTCGTG CCCGCCTACA TCCGGCCGCT GTTCTGCGAG GGCAAGGGCC CGTTCCGGTG GGCGGCGCTG TCGGGCGACC CGAAGGACAT CGCCGCCACC GACCAGGCGA TCCTGGAGTT GTTCCCGGAG AACGAATCGC TGGCCCGCTG GATCCGGCTG GCCGGGGAGC GGGTGGCGTT CCAGGGCCTG CCGTCGCGGA TCTGCTGGCT GGGCTACGGC GAGCGGGACA AGGCCGGGGA GCGGTTCAAC GAGCTGGTCG CCGACGGCAC CATCTCGGCG CCGATCGCGA TCGGCCGCGA CCACCTGGAC ACCGGCAGCG TCGCGTCCCC GTACCGCGAA ACCGAGTCCA TGAAGGACGG ATCGGACGCG ATCGCGGACT GGCCACTGCT CAACGCGATG CTCAACACCT CCTCCGGCGC CACCTGGGTC TCGATCCACC ACGGCGGCGG GGTCGGCATG GGCCGCTCCA TCCACTCCGG ACAGGTGACG GTGGCCGACG GCACCCCGCT GGCGGCCGAG AAGATCGCCC GGGTCCTGAC CAACGACCCC GGCTCCGGGG TGATGCGGCA CGTGGACGCC GGATACGAAC GCGCCGAACA GGTCGCGCAC GAGCGAGGCG TCCGCGTCCC GATGAGGGAG TCATGA
|
Protein sequence | MSEAREVRAP RGTELTCANW QIEAPYRMLQ NNLDPDVAER PDDLVVYGGT GKAARDWQSF DALLRTMRTL RGDETMLVQS GRPVGVFQTH EWAPRVLIAN SNLVGDWATW PEFRKLEQLG LTMYGQMTAG SWIYIGTQGI LQGTYETFAA VANKHFGGTL AGTLTLTAGC GGMGGAQPLA VTMNDGVCLI IDVDATRLRR RVETRYLDVV AADLDEAVRL ATEAKQEKKP LSVGLVGNAA LLVPQILRMG VPVDIVTDQT SAHDPLAYVP DGIDLGDAPD YAAKKPEEFT DRARAAMAKH VQAMVEFQDG GAVVFDYGNS IRGEAKLGGY DRAFDFPGFV PAYIRPLFCE GKGPFRWAAL SGDPKDIAAT DQAILELFPE NESLARWIRL AGERVAFQGL PSRICWLGYG ERDKAGERFN ELVADGTISA PIAIGRDHLD TGSVASPYRE TESMKDGSDA IADWPLLNAM LNTSSGATWV SIHHGGGVGM GRSIHSGQVT VADGTPLAAE KIARVLTNDP GSGVMRHVDA GYERAEQVAH ERGVRVPMRE S
|
| |