Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_1688 |
Symbol | |
ID | 4081091 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | + |
Start bp | 1776993 |
End bp | 1778204 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638010062 |
Product | imidazolonepropionase |
Protein accession | YP_616734 |
Protein GI | 103487173 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.823771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCATC GGCGCGACAC TGTCCAGATG GAGCAACGCT GGACGAACGC CCGACTCGCG ACGATGGCCG GTGACGATCT CGGCCTGATC GACGATGGCG TGGTCGCGGC AAAGGATGGC CGCATCATCT ATGCCGGTCC GGCGAAGGAC GCGCCTGCGA CGCAAGGTGC GGTCCATGAC TGCGAGGGCC GCCTGATCAC GCCCGGCCTG ATCGACTGCC ACACCCATCT GATACATGGC GGCAACCGTG CGAACGAATG GGCGATGAGG CTCGAAGGTG CGAGCTATGA CGAGATCGCC CGCGCGGGCG GCGGCATCGT CTCGACGATG CGCGCGACGC GCGACGCGAG CGAAGCCGAA CTGGTCGCGA GCGCCCTCCC CCGCCTCGAC GCGCTGATCG CCGAGGGTGT GACGACGATC GAGATCAAGT CAGGCTATGG GCTCTCGACC AACGACGAGT TGAAGATGCT GCGCGCCGCG CGCGCGCTCG CCAACATCCG CGCGATCCGC GTCGAGCCGA CCTTCCTCGG CGCGCACGCG CTGCCTCCCG AATATCAAGG CGACAGCGAC GCCTATATCG ACCTCGTCGT GGGCGAGATG ATCCCGGCCG TGGCCTCGCT AGCCACCGCG GTCGACGCCT TTTGCGAAGG CATCGGCTTT TCGCCCGAAC AGTGCGCGCG CGTCCTTGCC GCCGCGAAGG CCCATGGGCT CAAGGTCAAG CTCCACGCCG AACAATTGTC GGCGCTCCAC GGCAGCGCGC TCGCCGCGCG GCACGGCGCA TTGTCCGCCG ACCATCTCGA ACATGCGACC GACGAGGATG TGCGCGCGAT GGCCGAAGCC GGCAGCGTCG CGGTGCTGCT CCCCGGCGCC TATTATTTCA TGCGCGAAAC CAGACTGCCG CCCGTCCAGG CAATGCGCCG CCATGGGACG CGCATCGCGC TGGCGACCGA CAACAACCCC GGAACCTCGC CGACGAGTTC GCTGCTGCTG ATGCTCAACA TGGGCGCGAC CTTGTTCGGG CTGACCGTCA TCGAAGCGCT GCGCGGCGTG ACGGTCAACG CCGCCGCCGC GCTCGGCCTG TCGGCGGAGA TCGGCACGAT TGAGGTCGGC AAGGCCTGCG ACCTCGCCAT CTGGGACGTC GGCGACCCCG CCGAGCTGGT CTATCGCATC GGCTTCAACC CGCTGCACCA ACGTATCAAG GACGGACAAT GA
|
Protein sequence | MTHRRDTVQM EQRWTNARLA TMAGDDLGLI DDGVVAAKDG RIIYAGPAKD APATQGAVHD CEGRLITPGL IDCHTHLIHG GNRANEWAMR LEGASYDEIA RAGGGIVSTM RATRDASEAE LVASALPRLD ALIAEGVTTI EIKSGYGLST NDELKMLRAA RALANIRAIR VEPTFLGAHA LPPEYQGDSD AYIDLVVGEM IPAVASLATA VDAFCEGIGF SPEQCARVLA AAKAHGLKVK LHAEQLSALH GSALAARHGA LSADHLEHAT DEDVRAMAEA GSVAVLLPGA YYFMRETRLP PVQAMRRHGT RIALATDNNP GTSPTSSLLL MLNMGATLFG LTVIEALRGV TVNAAAALGL SAEIGTIEVG KACDLAIWDV GDPAELVYRI GFNPLHQRIK DGQ
|
| |