Gene Smed_4285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4285 
Symbol 
ID5319121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp777581 
End bp779398 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content63% 
IMG OID640776090 
Productpeptidase M20 
Protein accessionYP_001313023 
Protein GI150376427 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4187] Arginine degradation protein (predicted deacylase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.194958 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.944988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATG CACAGGAAAA GACTGCCTCT CGCGCGGAAA GCCGGATCGA CGGCGAGCAG 
GTGCGGCAAA TCGCCCTGCG CATCACGTCT TGGCCGAGCG AAACCGGCAC GCCGGGCGAA
GCATCCTTCG CGGATCGCCT TCATGCTCTT CTCCGCGAGC TTCCCTATTT TCGGGAGCAT
CCGGAAGACC TGCACCTCAT TGCAAGTCAC GGTGAGCCGC TGACCCGCAA TGTCGTTGCG
CTCGTTCGCG GCACGGGTAA GCGAACGCTG GTCATGGCCG GCCACTTCGA CACGGTTTCG
ACCGACAATT ATCACGAGCT TAAGGCCCTC GCCTGCGACA GCCTGGCGCT CAAGGATGCG
CTCATCGAAA GCCTGTCGGC ACGCGACGAC CGGTCCGAGC AGGAAGAGCG GGCCCTGCAG
GATCTGGCGA GCGGCGACTT TCTCCCCGGC CGCGGTCTGC TGGACATGAA GAGCGGGCTC
GCGGTGGCCA TCGCCTGTCT TGAACAATTC GCGGCCGACA CGGGCCGGCA GGGCAATCTC
ATGCTCGTCG CCACCCCCGA CGAGGAACGC GAAAGCCGGG GTATGCGATC GTTTCGAAAT
GCATTGCCCG GTCTGGTCGG GGATTTCGAT ATCGAGATCG CCGGGGGCAT CAACCTCGAT
GTGACCTCGG ATCAGGGCGA CGGGAGCGAA GGCCGGGCCG TTTACGCCGG CACGATCGGC
AAGCTTCTGC CCTTTGCGCT GGTGATCGGC TGCAGTTCCC ATGCGAGCTA TCCCTTCGAA
GGGGTAAGCG CACAGGCCAT GGCAGCCGGC ATCCTGGAGC GTCTGGAAGG GAACGCTTCC
CTGGCGGATC GCGACGACAA CGACATTTCG CCGCCGCCGA TCTGCCTGGA GGCGAAGGAT
TTGCGCGACG GTTACGAGGT GACGACGCCG GAGCGTTTCT GGATAGCTTT CAACTGGCTC
TACCATGCGA TGACGGCGGA CGCACTCTTT GCGCGCTTCC GAGAGGAAGT GCTGACCGGC
GCGAACGAAG CCATCGAGAA GTTTGCGGCA CAATCTGCCG AATACGGCAG GCTCGTCGGC
AGACGGGCGG GCGTGATGCC GGCCACGCCG CACCTGATGT CGTTCGGGGA ATTGCGGGCG
GCGGCTGCAC GGGTTTTCGG AGACGGCTTC GACGCGTTCT ATGCCGAGAA GGAAAGCGTA
TTCTCTCAGA GCGACAACCC GCTCGTCGCC ACGCGGCAAC TGACGGAGTG GCTCGTCGGC
ATCGCGCGCC TCTCCGGTCC CGCCATCGTC ATCGGATTTG CCGGCCTGCA CTACCCGCCT
AGCCATCTGC GCCTGAGCGA AGGAAACGAC CGGTCCCTTC ATCAGGCGGT CGAGAAGGCG
CGTGCCAGTC TCGGCAACGA TCCCGCACGA AGCCTCGTCT GGAAGCCGCA TTTCTACGGA
ATCTCGGATA TGAGTTTTCT CGGGCTTGCG GCAGGCGATA GCCACATCGT TTCGGACAAT
ACCCCAATCT CGAGGCTCGT CGATCGGCCG GGCGAGAATG CGCTGCGCTT TCCCACGGTT
AACCTCGGTC CCGGGGGAGG GAGTTCCATC AGAAGTTCGA GCGCGTATAC GCGCCTTACG
CCTTCGAGGT CCTCCCGGAT ATGGTTTTCG AGATCGCGAG GCGCTTTCTC TCCGACTGCA
GTCACTGAAA CGGACGCCGC CTTGGTGCGC TTGGCAACGG ACACTTTTAA ACAAGGATTG
CGCAGGCGCC CTAACGAGGC GCCGCGGTTT TACGCGTTCC CCCTAGGAAA TCAGCGTAGC
GGTCGCCCAG CATTATGA
 
Protein sequence
MKNAQEKTAS RAESRIDGEQ VRQIALRITS WPSETGTPGE ASFADRLHAL LRELPYFREH 
PEDLHLIASH GEPLTRNVVA LVRGTGKRTL VMAGHFDTVS TDNYHELKAL ACDSLALKDA
LIESLSARDD RSEQEERALQ DLASGDFLPG RGLLDMKSGL AVAIACLEQF AADTGRQGNL
MLVATPDEER ESRGMRSFRN ALPGLVGDFD IEIAGGINLD VTSDQGDGSE GRAVYAGTIG
KLLPFALVIG CSSHASYPFE GVSAQAMAAG ILERLEGNAS LADRDDNDIS PPPICLEAKD
LRDGYEVTTP ERFWIAFNWL YHAMTADALF ARFREEVLTG ANEAIEKFAA QSAEYGRLVG
RRAGVMPATP HLMSFGELRA AAARVFGDGF DAFYAEKESV FSQSDNPLVA TRQLTEWLVG
IARLSGPAIV IGFAGLHYPP SHLRLSEGND RSLHQAVEKA RASLGNDPAR SLVWKPHFYG
ISDMSFLGLA AGDSHIVSDN TPISRLVDRP GENALRFPTV NLGPGGGSSI RSSSAYTRLT
PSRSSRIWFS RSRGAFSPTA VTETDAALVR LATDTFKQGL RRRPNEAPRF YAFPLGNQRS
GRPAL