Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4285 |
Symbol | |
ID | 5319121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 777581 |
End bp | 779398 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776090 |
Product | peptidase M20 |
Protein accession | YP_001313023 |
Protein GI | 150376427 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4187] Arginine degradation protein (predicted deacylase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.194958 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.944988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAATG CACAGGAAAA GACTGCCTCT CGCGCGGAAA GCCGGATCGA CGGCGAGCAG GTGCGGCAAA TCGCCCTGCG CATCACGTCT TGGCCGAGCG AAACCGGCAC GCCGGGCGAA GCATCCTTCG CGGATCGCCT TCATGCTCTT CTCCGCGAGC TTCCCTATTT TCGGGAGCAT CCGGAAGACC TGCACCTCAT TGCAAGTCAC GGTGAGCCGC TGACCCGCAA TGTCGTTGCG CTCGTTCGCG GCACGGGTAA GCGAACGCTG GTCATGGCCG GCCACTTCGA CACGGTTTCG ACCGACAATT ATCACGAGCT TAAGGCCCTC GCCTGCGACA GCCTGGCGCT CAAGGATGCG CTCATCGAAA GCCTGTCGGC ACGCGACGAC CGGTCCGAGC AGGAAGAGCG GGCCCTGCAG GATCTGGCGA GCGGCGACTT TCTCCCCGGC CGCGGTCTGC TGGACATGAA GAGCGGGCTC GCGGTGGCCA TCGCCTGTCT TGAACAATTC GCGGCCGACA CGGGCCGGCA GGGCAATCTC ATGCTCGTCG CCACCCCCGA CGAGGAACGC GAAAGCCGGG GTATGCGATC GTTTCGAAAT GCATTGCCCG GTCTGGTCGG GGATTTCGAT ATCGAGATCG CCGGGGGCAT CAACCTCGAT GTGACCTCGG ATCAGGGCGA CGGGAGCGAA GGCCGGGCCG TTTACGCCGG CACGATCGGC AAGCTTCTGC CCTTTGCGCT GGTGATCGGC TGCAGTTCCC ATGCGAGCTA TCCCTTCGAA GGGGTAAGCG CACAGGCCAT GGCAGCCGGC ATCCTGGAGC GTCTGGAAGG GAACGCTTCC CTGGCGGATC GCGACGACAA CGACATTTCG CCGCCGCCGA TCTGCCTGGA GGCGAAGGAT TTGCGCGACG GTTACGAGGT GACGACGCCG GAGCGTTTCT GGATAGCTTT CAACTGGCTC TACCATGCGA TGACGGCGGA CGCACTCTTT GCGCGCTTCC GAGAGGAAGT GCTGACCGGC GCGAACGAAG CCATCGAGAA GTTTGCGGCA CAATCTGCCG AATACGGCAG GCTCGTCGGC AGACGGGCGG GCGTGATGCC GGCCACGCCG CACCTGATGT CGTTCGGGGA ATTGCGGGCG GCGGCTGCAC GGGTTTTCGG AGACGGCTTC GACGCGTTCT ATGCCGAGAA GGAAAGCGTA TTCTCTCAGA GCGACAACCC GCTCGTCGCC ACGCGGCAAC TGACGGAGTG GCTCGTCGGC ATCGCGCGCC TCTCCGGTCC CGCCATCGTC ATCGGATTTG CCGGCCTGCA CTACCCGCCT AGCCATCTGC GCCTGAGCGA AGGAAACGAC CGGTCCCTTC ATCAGGCGGT CGAGAAGGCG CGTGCCAGTC TCGGCAACGA TCCCGCACGA AGCCTCGTCT GGAAGCCGCA TTTCTACGGA ATCTCGGATA TGAGTTTTCT CGGGCTTGCG GCAGGCGATA GCCACATCGT TTCGGACAAT ACCCCAATCT CGAGGCTCGT CGATCGGCCG GGCGAGAATG CGCTGCGCTT TCCCACGGTT AACCTCGGTC CCGGGGGAGG GAGTTCCATC AGAAGTTCGA GCGCGTATAC GCGCCTTACG CCTTCGAGGT CCTCCCGGAT ATGGTTTTCG AGATCGCGAG GCGCTTTCTC TCCGACTGCA GTCACTGAAA CGGACGCCGC CTTGGTGCGC TTGGCAACGG ACACTTTTAA ACAAGGATTG CGCAGGCGCC CTAACGAGGC GCCGCGGTTT TACGCGTTCC CCCTAGGAAA TCAGCGTAGC GGTCGCCCAG CATTATGA
|
Protein sequence | MKNAQEKTAS RAESRIDGEQ VRQIALRITS WPSETGTPGE ASFADRLHAL LRELPYFREH PEDLHLIASH GEPLTRNVVA LVRGTGKRTL VMAGHFDTVS TDNYHELKAL ACDSLALKDA LIESLSARDD RSEQEERALQ DLASGDFLPG RGLLDMKSGL AVAIACLEQF AADTGRQGNL MLVATPDEER ESRGMRSFRN ALPGLVGDFD IEIAGGINLD VTSDQGDGSE GRAVYAGTIG KLLPFALVIG CSSHASYPFE GVSAQAMAAG ILERLEGNAS LADRDDNDIS PPPICLEAKD LRDGYEVTTP ERFWIAFNWL YHAMTADALF ARFREEVLTG ANEAIEKFAA QSAEYGRLVG RRAGVMPATP HLMSFGELRA AAARVFGDGF DAFYAEKESV FSQSDNPLVA TRQLTEWLVG IARLSGPAIV IGFAGLHYPP SHLRLSEGND RSLHQAVEKA RASLGNDPAR SLVWKPHFYG ISDMSFLGLA AGDSHIVSDN TPISRLVDRP GENALRFPTV NLGPGGGSSI RSSSAYTRLT PSRSSRIWFS RSRGAFSPTA VTETDAALVR LATDTFKQGL RRRPNEAPRF YAFPLGNQRS GRPAL
|
| |