Gene Smed_1151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1151 
SymbolvalS 
ID5321997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1222779 
End bp1225622 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content61% 
IMG OID640790092 
Productvalyl-tRNA synthetase 
Protein accessionYP_001326837 
Protein GI150396370 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0456653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGATA AAACCTACGA TTCCGCAGCG GTAGAACCGA AGATCGCCAA AGCCTGGGAC 
GAGGCGGACG CCTTCCGTGC CGGTGCGAAT GCGAAGCCCG GGGCAGAGAC CTTCACGATC
GTGATCCCGC CGCCGAACGT GACGGGTTCG CTGCATATGG GTCATGCGCT CAACAATACG
CTGCAGGACC TCATGGTCCG CTTCGAACGC ATGCGCGGCA AGGACGTACT CTGGCAGCCG
GGTATGGACC ATGCGGGTAT CGCCACCCAG ATGGTCGTCG AGCGCCAGCT CAGGGAACGG
CAGCTTCCGG GCCGCCGCGA GATGGGACGC GAGGCCTTCA TTGAGAAGGT CTGGGAGTGG
AAGGCCGAAT CCGGTGGCCT GATCTTCAAT CAATTGAAGC GGCTCGGTGC CTCCTGCGAC
TGGTCGCGTG AGCGCTTCAC GATGGACGAG GGGCTTTCCG ACGCCGTCAT CGAGGTTTTC
GTCAGCCTTT ACAAGGAAGG CCTCATCTAT CGCGACAAGC GCCTGGTCAA CTGGGATCCG
AAGCTGCAAA CGGCGATTTC CGACATAGAG GTAGAGCAGG TCGAGATCAA TGGGCATCTC
TGGCACCTGC GCTACCCGCT CGAAGATGGC GTGACCTATG AGCACCCGGT TTCCTTCGAT
GAAGACGGCA ATGCGACGGA ATGGGAGACC CGCGACTATC TGGTCGTCGC GACCACGCGC
CCTGAGACCA TGCTCGGCGA TACCGGCGTT GCCGTTCATC CCGACGACGC GCGCTACAAA
GGGATTGTCG GCAAGCATGT CATTTTGCCG ATCGTCGGGC GCCGCATCCC GATCGTCGCC
GACGAGTATC CTGATCCGAC GACCGGCACC GGCGCGGTGA AGATGACGCC TGCCCACGAC
TTCAACGATT TCGACGTCGG CAAGCGCAGG GGGCTGCGCC AGGTCAATGT TCTGACAGCG
GACGGCCGGA TCACGATCAA GAACAACGAG GATTTCCTCG AGGGCCTCGA CCATCCGGCT
GCGCTCCATG GCGCCTGGGA CCAGCTGGAG GGCAAGGATC GCTTCGAGGC CCGTAAGCTC
ATCGTGGAGA TGCTCGAAGA GGCTGGCCTC GTCGACCATA TCGAACCACA CAAGCATATG
GTGCCCCATG GCGACCGCGG CGGCGTTCCC ATCGAGCCGC GCCTGACCGA GCAATGGTAT
GTCGACGCCA AGACCCTGGC GAAGCCTGCC ATTGCAGCCG TCAAGGAAGG CAGGACCAAC
TTCGTTCCGA AGAATTGGGA AAAGACCTAT TTCGAGTGGA TGGAGAACAT TCAGCCCTGG
TGTATTTCGC GCCAGCTCTG GTGGGGGCAT CAGATTCCGG CCTGGTACGG TCCCGACGGG
CAGATTTTCG TCGAGCGAAC CGAGGAGGAA GCACTGCATG CGGCGATTCA GCATTATATC
GCCCATGAAG GACCGATGAA GGCCTATGTC GAGGACTTGC TCGAAAACTT CAAGCCGGGT
GAGATCCTGA CGCGTGACGA GGACGTTCTC GACACCTGGT TCTCCTCCGC GCTCTGGCCC
TTCTCCACGC TCGGTTGGCC GAAGGAGACG CCGGAGCTCG ACAAATACTA TCAGACCGAT
GTGCTGGTAA CGGGCTTCGA CATCATCTTC TTCTGGGTCG CTCGGATGAT GATGATGGGC
CTTCATTTCA TGAAGGATGC GGACGGCACG CCGGTCGAGC CGTTCCACAC GGTTTATGTT
CATGCGCTCG TTCGGGACAA GAACGGTCAG AAGATGTCGA AGTCCAAGGG CAACGTCATC
GACCCCCTCG AGCTCATCGA CGAATACGGC GCCGACGCGC TGCGCTTCAC GCTGGCGATA
ATGGCGGCTC AGGGGCGCGA CGTGAAGCTC GATCCGGCCC GGATTGCGGG CTACCGCAAC
TTCGGAACCA AACTCTGGAA CGCAACCCGT TTTGCCGAGA TGAACGGTGC AACGAGCAGC
GAAGGCTTCA TCCCGGAGGC GGCTTCGCTC ACGACCAATC GCTGGATTCT GACGGAGCTG
TCGCGGACCA TCCGCGATGT GAGCGAAGCA ATCGAGGACT ACCGCTTCAA CGATGCGGCC
GGAACCCTCT ACCGCTTCGT CTGGAACCAG TTCTGCGACT GGTATCTCGA ACTCCTGAAG
CCCGTCTTCA ATGGCGACGA CGAGGCGGCC AAACGGGAGT CCCAGGCCTG CACTGCCTAT
GTTCTGGACG AAACCTACAA GCTCCTGCAT CCATTCATGC CCTTCATGAC GGAAGAGCTT
TGGGACAAGA CGAGCGGCCC AGGGCGCGAG CGCGCCACGC TTCTTTGTCA CGCCGAATGG
CCTGCGGCCT TCTATGCGGA TGATGCGGCG GCGGACGAGA TCAACTGGCT GATCGACCTC
GTCTCGGGCA TTCGTTCCGT TCGGGCCGAA ATGAACGTCC CCCCGGCGGC GATGGCCCCC
CTTGTGGTGG TCGGGGCGAA GGCGCAAACG CGCGAGCGTC TCGATCGGCA CGCTTCCGCC
ATAAAGCGTC TGGCGAGGGT GGAGAAGATC AAACCTGCTG CTGTGGCCCC GCGCGGCAGC
GCGCAGATCG TCATCGGCGA GGCGACGGCT TGCCTTCCGC TTGGCAGCCT CATCGATCTC
GCAGCCGAGA AACTGCGCCT GGAAAAGGCG ATCGCGAAAG TCGACGTCGA GCGGGAGCGG
ATCCTCGGCA AGCTGGCCAA CGAGAAATTC GTTGCCAACG CCAGGCCCGA GCTGGTGGAG
GCCGAACGCG AGCGGCTGGT CGAACTCGAC GCGCAGAAGG ACTCGCTCGG CATTGCTTTG
TCCCGCGTTT CAGAAGCGGG CTGA
 
Protein sequence
MLDKTYDSAA VEPKIAKAWD EADAFRAGAN AKPGAETFTI VIPPPNVTGS LHMGHALNNT 
LQDLMVRFER MRGKDVLWQP GMDHAGIATQ MVVERQLRER QLPGRREMGR EAFIEKVWEW
KAESGGLIFN QLKRLGASCD WSRERFTMDE GLSDAVIEVF VSLYKEGLIY RDKRLVNWDP
KLQTAISDIE VEQVEINGHL WHLRYPLEDG VTYEHPVSFD EDGNATEWET RDYLVVATTR
PETMLGDTGV AVHPDDARYK GIVGKHVILP IVGRRIPIVA DEYPDPTTGT GAVKMTPAHD
FNDFDVGKRR GLRQVNVLTA DGRITIKNNE DFLEGLDHPA ALHGAWDQLE GKDRFEARKL
IVEMLEEAGL VDHIEPHKHM VPHGDRGGVP IEPRLTEQWY VDAKTLAKPA IAAVKEGRTN
FVPKNWEKTY FEWMENIQPW CISRQLWWGH QIPAWYGPDG QIFVERTEEE ALHAAIQHYI
AHEGPMKAYV EDLLENFKPG EILTRDEDVL DTWFSSALWP FSTLGWPKET PELDKYYQTD
VLVTGFDIIF FWVARMMMMG LHFMKDADGT PVEPFHTVYV HALVRDKNGQ KMSKSKGNVI
DPLELIDEYG ADALRFTLAI MAAQGRDVKL DPARIAGYRN FGTKLWNATR FAEMNGATSS
EGFIPEAASL TTNRWILTEL SRTIRDVSEA IEDYRFNDAA GTLYRFVWNQ FCDWYLELLK
PVFNGDDEAA KRESQACTAY VLDETYKLLH PFMPFMTEEL WDKTSGPGRE RATLLCHAEW
PAAFYADDAA ADEINWLIDL VSGIRSVRAE MNVPPAAMAP LVVVGAKAQT RERLDRHASA
IKRLARVEKI KPAAVAPRGS AQIVIGEATA CLPLGSLIDL AAEKLRLEKA IAKVDVERER
ILGKLANEKF VANARPELVE AERERLVELD AQKDSLGIAL SRVSEAG