Gene Smed_5087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5087 
Symbol 
ID5319389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp36377 
End bp37378 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content66% 
IMG OID640776866 
ProductLAO/AO transport system ATPase 
Protein accessionYP_001313798 
Protein GI150377203 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCCG TTGCGCGATC TCGAACGGCC AAGCGCGGCG CCTACACGCC CTCAACCGAA 
CTGGCGAAGG ACGTGCTTTC GGGTCGGCCG ATCGCCATCG CCCGCATGAT CTCCCGTGCC
GAAGACGGGC GCGCGGAAGC TGCCTCAGCC CTGGCCGAAC TCTATCTGCA GGGTGGGAGG
GCTCATATCG TCGGGATTAC CGGCGTTCCG GGTGCGGGAA AATCGACGCT CGTTTCCGCG
CTGATCAGGA CCTATGCCGA CGAAGGGCAC AAGGTCGGGG TCATCGCGGT CGATCCGAGC
AGCCCGTTTT CGGGCGGAGC AATTCTGGGC GATCGTGTGC GGATGAGTGA TGCGGCGGCG
GCGGGCAGCG CCTTCGTGCG CAGCATGGCA ACACGCGGCC ATCTCGGCGG ACTGGCGCGT
TCGACCCTCC GGGCCGTCGA TGTCCTGGAC GCCGCCGGAT ATTCGCCGAT CATCATCGAG
ACCGTCGGAG TAGGGCAGGA CGAGGTGGAG GTGGTCGCGG CCGCCCATAC CATCGTCGTC
CTTTCCGCCC CCGGGCTCGG CGACGACATT CAGGCGATCA AGGCCGGAAT TCTCGAGACC
GCGGACATCC ATGCCGTCAG CAAATGCGAC AAGCCCGAAG CATCCGCGAC CGTTTCGGCG
CTTCAGGGCA TGCTGGCACT TGGCGGCAGC AGCGCCGGGT CCGACTGGAA GCCGCCTGTC
CTGCCTGTCA GCTCCGTTTC GGGCGAACGC ATCGGCGATC TGAGGCGCGC CATCGGAGAG
CACTGGGCGC ATCTGCGCAG GAGCGGCGAA CTTGCCGTAC GCCAGCGCAA CATATGCAGG
ACGCGCATTC TGGGAACCGC CAAGCAGCTG TTTCAAAACA AATTCCAGCA GAAGCCCGAG
GCGCTCGAAC CTTACGTTCG GGCCGTGACG GAAAGGCGTA TGGACCCGTC CGTCGCCGCG
CGCGCTCTGC TGGGTTGGGA GGAAGACCCA TGGACGACGT GA
 
Protein sequence
MRAVARSRTA KRGAYTPSTE LAKDVLSGRP IAIARMISRA EDGRAEAASA LAELYLQGGR 
AHIVGITGVP GAGKSTLVSA LIRTYADEGH KVGVIAVDPS SPFSGGAILG DRVRMSDAAA
AGSAFVRSMA TRGHLGGLAR STLRAVDVLD AAGYSPIIIE TVGVGQDEVE VVAAAHTIVV
LSAPGLGDDI QAIKAGILET ADIHAVSKCD KPEASATVSA LQGMLALGGS SAGSDWKPPV
LPVSSVSGER IGDLRRAIGE HWAHLRRSGE LAVRQRNICR TRILGTAKQL FQNKFQQKPE
ALEPYVRAVT ERRMDPSVAA RALLGWEEDP WTT