Gene Smed_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5202 
Symbol 
ID5319504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp162271 
End bp163500 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content57% 
IMG OID640776980 
Productarginine deiminase 
Protein accessionYP_001313912 
Protein GI150377317 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2235] Arginine deiminase 
TIGRFAM ID[TIGR01078] arginine deiminase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTACTG TCGGTGTCCA TTCGGAAGTC GGTAAGCTTA GAACGGTGAT GGTCTGCAGA 
CCATCCTTGG CTCATCAACG GCTGACGCCG GGCAACTGTC ACGACTTGCT TTTCGATGAT
GTCGTCTGGG TGCATGAGGC GCAGAAGGAC CATTACGATT TCGTTCTGAA AATGCAGGAA
CGAGGTGTGG AGGTCCTAGA GTTGCACGAC CTTCTAACCG ACACTCTGAT GGATGCCGAA
GCGCGCAAGT TCGTGCTTGA TCGCCGGGCC ACACCCAATG TCATGGGATC CCAAATCGCC
GAACTCGTCC GTCCTTGGAT GGAGGAAATG GATCCCAAGC GCCTGGCTGC TTTCCTGATC
GGTGGAATCT CTGTTGCAGA CCTCCCGGAG GGACAGGGCA AGACCCTGAT GGCATCAGCC
TTCGGAGCCA CCGAATTTGT CCTTCCCCCG ATACCCAACA CCCTGTTTCA GCGCGATCCG
TCCTGCTGGA TTTACAACGG AGTGACGTGC AACCCCATGT TCTGGCCGGC GCGGCGCGCA
GAAACTCTGG TTCAAAGGGC GATCTACAAG TTTCACCCTT CCTTCAAGAG TGCGAGCTTC
GATATTTGGT GGGGCGACTC CGACGAGCAG TTTGCCAACG CCACGATCGA AGGCGGCGAC
GTTATGCCTA TCGGCAATGG TACCCTTCTG GTGGGAATGG GGGAACGGAC CACTTACCAA
GCGGTTGGCC AGGTTGCCAA AACCTTGTTC AAGTCGGGAG CCGCTACGCG CGTCATCGGC
TGCCTTATGC CGAGGAGCCG CGCGGCGATG CACCTCGACA CGGTATTCAC ATTCTGTGAT
CGCGACGTAG TGACGCTATT CGCCGAGGTT GTAGATCGGA TCCGCTGCTA CAGCATGATC
CCTCTCGACG ATGAGGGAAA TTTCGAGGTG CGGCAGGAAG ATCGACCCAT GCTTGAAGTT
GTTGCCGAAG CATTAGGCGT CGACAAGCTT CGCACTATCG CAACCGGCGG CAACACCTAT
GAGGCTGAGC GCGAACAATG GGACGACGGA AACAATGTCG TCGCGCTCGA GCCGGGAGTA
GTCGTTGCTT ATGATCGGAA CACCTATACC AACACCCTGC TCCGCAAGGC AGGCATCGAG
GTCATCACAA TCCGTGGCTC CGAATTGGGC CGAGGACGCG GCGGCGGTCA TTGCATGACG
TGTCCGATCT GGCGAGAGCC GACTGAATGA
 
Protein sequence
MRTVGVHSEV GKLRTVMVCR PSLAHQRLTP GNCHDLLFDD VVWVHEAQKD HYDFVLKMQE 
RGVEVLELHD LLTDTLMDAE ARKFVLDRRA TPNVMGSQIA ELVRPWMEEM DPKRLAAFLI
GGISVADLPE GQGKTLMASA FGATEFVLPP IPNTLFQRDP SCWIYNGVTC NPMFWPARRA
ETLVQRAIYK FHPSFKSASF DIWWGDSDEQ FANATIEGGD VMPIGNGTLL VGMGERTTYQ
AVGQVAKTLF KSGAATRVIG CLMPRSRAAM HLDTVFTFCD RDVVTLFAEV VDRIRCYSMI
PLDDEGNFEV RQEDRPMLEV VAEALGVDKL RTIATGGNTY EAEREQWDDG NNVVALEPGV
VVAYDRNTYT NTLLRKAGIE VITIRGSELG RGRGGGHCMT CPIWREPTE