Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3843 |
Symbol | |
ID | 5318571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 300408 |
End bp | 301808 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640775655 |
Product | amidohydrolase |
Protein accession | YP_001312588 |
Protein GI | 150375992 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.865403 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGATT TGCGCGACAG GTTTCTGGGC AACGGTGATT TCCTGCTTCA CCCCGGCAAG GTCCTCTTGC CGGAAGGCCC TCGATCCGGA ATCGGGATAG TGGTCCGCAA GGGGCGCTTC TCGGAAATCG GTGCCGCAGG ACTTGTCGGC AGCCGGAATC CCGACCTGAG GCCGATAGAG TTGCCGCATC ATCTTGTGAT GCCAGGCTTC ATCGACACCC ATACCCATCT GACCCAGTCC CTGGGGAAGT CGCTCGTCTT CGGCGAGCCG TCCGAGATCT TTCGCCGTAT CTGGGTGCCG CTCGAAGGCA GTCTGGATGA ACGGATGGTC TATCTTTCGG CAAAGCTGGC GGCGCTTGAA TGCCTGCGCG GCGGCTTTAC CGCCGCAGTC GATGCGGGCA CGCGTTCCGC GGGCCACATG GACGCGCTGA TACGGGCGGC GCGTGAAACC GGCTTGCGTA GCGTCATAGG ACTTATCTGC AACGATCTCG GCGGAGCCGC GGTGGTCCCC GACCGCACGA CGATCCTCAG GAATGCGGCC GGACATCTGG CCGCATTCGA GGGCGATTCG CTCGTACATC CCTCACTCGC CATTTCCATT CCCGAGGCGG CCAGCGACCA CATGCTGGCT GACGTCTCCA GCATGGCGCG GGAAGCGGGT GTCATATTCC AGACCCATGT CAACGAACAC CTCGTCGCAG TCGAGCGCTC GCTGGTTGCA AACGGCCGCC GTCCGCTGGA GCATCTCGCT CATCTCGGCG CACTCGGCCC GCATGTGCTG ATCGCCCATT CCACGCTGGT GACACCGCAC GAACTGAACC TGTTGCGCCA CAGCGATACG GCGGTCGCAT ACAATCCGGT GGCGAGCTTG TGGAAAGGCA ATGCCATCGC ACCCGCGCTG CAAATGGCCG CACTCGGGAT CCGCTTCGGA CTGGGAACCG ACGGCACCCG CGCAGACGGT TTCCGCCTCA TGGATGCCGC CGAGGGCCTG CAGCGCGCCG GCTTCGGGCT TGCGACGGGC GACTCTTCCT GTGGAGGCGG CTGGCTCTGG ATCGACCGGG CAACAGCCCA GGCGGCGGAT GCCGCAGGTC TTGGCTGCGT GACCGGCGCG ATCCGCGAGA AGCTTGCGGC CGATTTCCTC CTGGTGGATC TCGACCGTCC CGAATTCACG CCCTCCCACG ATCTCATGTG GGAACTCGTG CGCTACGGCA ACCGCGACCA GATCGACGCC GTCTTCACCG CCGGAATGCT TCGCCTCTGG CAAGGCTGGC CGGTCCAATG GGATGCACGC GCGCTTCTTG CCGAGGTGCG CGAGGTCACG GCCGATGCCA TAGCAAGGGC GCCGATCCAG CGCGTACACA AGCCATCGGC GGAGCACCGG GCGCTGGGGC ATTTCGCATG A
|
Protein sequence | MTDLRDRFLG NGDFLLHPGK VLLPEGPRSG IGIVVRKGRF SEIGAAGLVG SRNPDLRPIE LPHHLVMPGF IDTHTHLTQS LGKSLVFGEP SEIFRRIWVP LEGSLDERMV YLSAKLAALE CLRGGFTAAV DAGTRSAGHM DALIRAARET GLRSVIGLIC NDLGGAAVVP DRTTILRNAA GHLAAFEGDS LVHPSLAISI PEAASDHMLA DVSSMAREAG VIFQTHVNEH LVAVERSLVA NGRRPLEHLA HLGALGPHVL IAHSTLVTPH ELNLLRHSDT AVAYNPVASL WKGNAIAPAL QMAALGIRFG LGTDGTRADG FRLMDAAEGL QRAGFGLATG DSSCGGGWLW IDRATAQAAD AAGLGCVTGA IREKLAADFL LVDLDRPEFT PSHDLMWELV RYGNRDQIDA VFTAGMLRLW QGWPVQWDAR ALLAEVREVT ADAIARAPIQ RVHKPSAEHR ALGHFA
|
| |