Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr7_0092 |
Symbol | |
ID | 4255554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-7 |
Kingdom | Bacteria |
Replicon accession | NC_008322 |
Strand | - |
Start bp | 104334 |
End bp | 105560 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 638120687 |
Product | imidazolonepropionase |
Protein accession | YP_736155 |
Protein GI | 114045605 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTGGG ATCAGGTTTG GATAGACGTT AACGTAGCAA CAATGGACCC TTCCATATCA GCACCTTACG GCGCAATTAC CAATGCGGCT ATCGCAGTAA AAGACGGTAA AATTGCCTGG TTAGGCCCAC GCAGCGAGCT GCCCGCCTTC GATGTGTTGT CCATTCCTGT TTACAGGGGC AAGGGCGGTT GGATCACTCC TGGGCTGATT GATGCCCACA CCCATTTAGT ATTTGCCGGT AATCGTGCCA ACGAATTCGA GCTACGCCTA AAGGGCGCCA CCTACGAGGA AATCGCCCGT GCTGGCGGCG GCATTATTTC CACGGTTAAC GCCTGCCGTG AGGCCGACGA AGCCGAGTTA TTTGATCTCG GTCGCCAGCG TTTAAATGCC TTGGCGAAGG AAGGCGTTAC CACGGTTGAG ATTAAATCTG GCTACGGTTT AGATACCGAA ACCGAACTCA AAATCCTGCG TGTTGCCCGC GAACTCGGCC AACATCACCA TGTGGATGTG AAGACCACCT TCCTCGGTGC CCATGCGGTG CCGCCCGAGT TTAAAGACAA TAGCGACGGC TATGTCGACT TAATCATCAA TAAGATGCTG CCTGCAGTGA TTGCCGAAAA CCTCGCCGAT GCGGTGGATG TATTCTGTGA AAACATCGCC TTTAACCTAG AGCAAACCGA GCGCGTACTG AGCGCCGCCA AAGCAGCTGG CCTGCAAGTT AAACTGCATG CCGAGCAATT ATCCAATATG GGTGGCTCTG AATTAGCCGC ACGTTTAGGG GCTAAGTCGG TTGATCATAT TGAATATTTA GATGAGGCTG GTGTTAAAGC CCTAAGTGAA AGTGGCACCT GCGCCGTACT GTTACCGGGC GCGTTTTACT TTTTGCGGGA AACCCAAAAA CCGCCTATCG ACTTATTGCG TCAATACGGT GTGCCTATGG TGCTCGCCAG CGACTTTAAT CCCGGCTCAT CACCCATCTG CTCGACCCTG CTGATGCTGA ACATGGGTTG CACCCTATTC CGCTTAACCC CAGAGGAAGC GCTTGCGGGT TTAACATTGA ATGCCGCCAA GGCACTAGGG ATTGAAGAGA ATGTCGGCAG CTTAGTGGTT GGTAAGCAGG CGGATTTCTG TCTATGGGAT ATCGCCACCC CAGCACAACT CGCCTATAGC TACGGCGTGA ATCCCTGCAA GGATGTAGTG AAAAACGGTA AGTTAGTGCA TCAATAA
|
Protein sequence | MSWDQVWIDV NVATMDPSIS APYGAITNAA IAVKDGKIAW LGPRSELPAF DVLSIPVYRG KGGWITPGLI DAHTHLVFAG NRANEFELRL KGATYEEIAR AGGGIISTVN ACREADEAEL FDLGRQRLNA LAKEGVTTVE IKSGYGLDTE TELKILRVAR ELGQHHHVDV KTTFLGAHAV PPEFKDNSDG YVDLIINKML PAVIAENLAD AVDVFCENIA FNLEQTERVL SAAKAAGLQV KLHAEQLSNM GGSELAARLG AKSVDHIEYL DEAGVKALSE SGTCAVLLPG AFYFLRETQK PPIDLLRQYG VPMVLASDFN PGSSPICSTL LMLNMGCTLF RLTPEEALAG LTLNAAKALG IEENVGSLVV GKQADFCLWD IATPAQLAYS YGVNPCKDVV KNGKLVHQ
|
| |