Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_4194 |
Symbol | |
ID | 5198717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 4615068 |
End bp | 4616288 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640583748 |
Product | amidohydrolase |
Protein accession | YP_001264672 |
Protein GI | 148557090 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.268221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0453774 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGCGA TATTGCACAA CGGCCCGATC TTCGACGGGC ACTCCGCCGA CCTGTTGCAG GGACAGGCCA TCTATGTCGA AGACGGGCTG ATCCGCGAAA TCGCGCCGCT CGACAAGCTT CCGGGCGCGG AACGGCGGAT CGACCTGAAC GGGCATTTCG TGATGCCCGG CCTGATCGAC GCCCATTTCC ACGCCTATGG GATCGAGGTC GACCTCGAAA AGGTCGACCA CATCTCCCCC GCCCTGCGCA GCCTCCATGC CCGGCGCTTC CTGGAAAGCG CGCTGCACCG CGGCTTCACC ACCGTGCGCG ATGCCGCCGG CGGCGATCTG CCGCTGGCGA CCGCCCTCGA ACAGGGCCTG ATCGACGGGC CGCGCTTCTT CTTTCCCGGC CTCGCCATCA GCCAGACCGG CGGGCATGGC GATTTCCGCC TGCCCGACCA TTATGACGCC TGCGCCTGCG CTTATTGCGG GGCGCTGGCG ACCGTCGCCG ACGGCCCCGA CGAGGTGCGC CGGGTCGTGC GCGATCAGCT CCGCAAGGGC GCGCACCACA TCAAGCTGTT CGTGTCCGGC GGCGTGCTGT CGCGCACCGA CCCGATCTGG ATGCGGCAGT TCAGCGACGC CGAGATCCGG GTCGCGGTCG AAGAGGCCGA GACGCGCCGC GCCTATGTCA TGGCGCATGT CCATACCAAT GAGGCGGCGC TGCGCTGCGT CGCCAACGGC GTCCGTTCGC TGGAGCACGT CACCATCCTC GAGCGGGATG GGGCCGACGC CATCGTCGCC GCCGGCGCTT TCGCCGTGCC GACCTTCGCC ATCGGCGACG CGATGAAGGA ACGCGCCGAG CAGATGGGAT TGCCCGCGGC GATCCTCGAC AAGGTCCGCG CGATGGGCGA CGTGGCCTAT GCGTCGCTGG ACCATCTGCG ACAGGCCGGG GCCCAGATCG GTTTCGGCAC CGACCTGCTG GGCCCGCTGA TGGATCGCCA GGCCCGCGAG TTCCGCTTGC GGCTGCCCGT CTGCAGTCCG GTCGAGATCC TGCGATCGGC GACCTCGGTC AACGCCGCGC TGCTTCAGAT GGAAGGCAAG TTGGGGACGA TCGCGCCCGG CGCCTGCGCG GACATCATCG CCATCGACGG CAATCCGCTG GACGACATCG CCCTGTTCGA ACAGCAGGAG CGCATCGGCT TCATCATGCG CGACGGCAAG GTCGTCCGAT GCGCGCTTTA G
|
Protein sequence | MSAILHNGPI FDGHSADLLQ GQAIYVEDGL IREIAPLDKL PGAERRIDLN GHFVMPGLID AHFHAYGIEV DLEKVDHISP ALRSLHARRF LESALHRGFT TVRDAAGGDL PLATALEQGL IDGPRFFFPG LAISQTGGHG DFRLPDHYDA CACAYCGALA TVADGPDEVR RVVRDQLRKG AHHIKLFVSG GVLSRTDPIW MRQFSDAEIR VAVEEAETRR AYVMAHVHTN EAALRCVANG VRSLEHVTIL ERDGADAIVA AGAFAVPTFA IGDAMKERAE QMGLPAAILD KVRAMGDVAY ASLDHLRQAG AQIGFGTDLL GPLMDRQARE FRLRLPVCSP VEILRSATSV NAALLQMEGK LGTIAPGACA DIIAIDGNPL DDIALFEQQE RIGFIMRDGK VVRCAL
|
| |