Gene Shewmr4_1130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_1130 
SymbolvalS 
ID4251798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp1328854 
End bp1331730 
Gene Length2877 bp 
Protein Length958 aa 
Translation table11 
GC content50% 
IMG OID638117711 
Productvalyl-tRNA synthetase 
Protein accessionYP_733267 
Protein GI113969474 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00231601 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAAA CATACGATCC TCAGTCCATT GAGCAGACTC TTTACCAAAA CTGGGAAGAG 
CAAGGTTACT TTAAGCCCCA CGGCGATGCC TCACAGGGCA ACTATTGCAT CATGATCCCG
CCACCTAACG TGACGGGCAG CTTGCACATG GGCCACGCTT TCCAAGATAC CATCATGGAT
ACTCTGATCC GCTACCAACG TATGAAGGGT AAAAACACCC TGTGGCAAGT CGGTACTGAC
CATGCGGGTA TTGCGACTCA AATGCTGGTT GAGCGTAAGC TTGAAGCCGA AGAAGGCAAG
AGCCGCCATG ACCTTGGCCG CGATGCTTTT ATGGAAAAAG TATGGGAATG GAAAGCGCAA
TCTGGCGGCA CCATCACTAA GCAGCTGCGT CGTATGGGCG CCTCTGTGGA TTGGGACCGT
GAGCGCTTTA CCATGGACGA AGGCTTGTCT AAAGCCGTTC AAGAAGTGTT TGTCCGTCTC
TATGAAGATG ATTTGATTTA CCGTGGTAAA CGTTTAGTGA ATTGGGATCC TAAACTACAT
ACCGCGATTT CTGATCTCGA AGTGGAAAAC AAAGAAAAGC AAGGCCACAT GTGGCACCTA
CGTTATCCAT TAGCCGATGG CGAGCTGACT GCCGACGGTA AAGATTACCT CGAAGTCGCC
ACCACGCGTC CAGAAACTAT GCTGGGCGAT AGCGCGGTTG CAGTTCATCC AGACGATGAG
CGTTATCAAG CGCTGATCGG CAAATTTATT CTGCTGCCTA TCGTTAATCG TCGCATCCCT
ATCGTTGCCG ATGACTATGT AGACATGGCC TTTGGCACAG GCTGTGTGAA GATCACTCCT
GCCCACGACT TTAACGACTA TGAAGTCGGT AAACGTCACA AACTTCCTAT GTTCAACGTG
TTGACCTTAG ATGCGGCCAT TCGTGCATCC GCTGAAGTCG TCAATACCGA TGGCACCATC
AACACTAGCT TAGATGGCAG CCTGCCAGAG CGTTTCGCAG GCCTTGACCG TTTCAAAGCC
CGTGATGCTA TCGTGGCCGA GTTTGAAACC TTAGGTCTGC TCGAAAAAAT CGCGCCACAT
GGCCTAAAAG TGCCCTACGG TGACCGCTCT GGCGTGGTAA TCGAACCAAT GCTGACCGAT
CAATGGTACG TCGCCGTTGC GCCTATGGCG AAAACCGCTA TCGAAGCCGT TGAAAATGGC
GACATTAAGT TTGTGCCGCA GCAATACGAA AACATGTACT TCTCTTGGAT GCGAGACATT
CAAGATTGGT GTATCTCTCG CCAATTATGG TGGGGTCACC GCATCCCCGC TTGGTACGAT
GCGAACGGTA AGGTGTATGT TGGCCGTAAC GAAGCTGAAG TGCGTGCTAA GCACAATATC
GATGATTCAA TTGCCCTGCG CCAAGATGAA GACGTACTGG ATACTTGGTT CAGCTCAGCC
CTGTGGACAT TCTCAACCTT AGGCTGGCCT GACAATGTTG AAGATTTAAA AACCTTCCAC
CCGACCGACG TGTTGGTGAC GGGTTTTGAC ATCATCTTCT TCTGGGTTGC CCGTATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA GACGGTAAGC CACTGGTGCC ATTTAAAACC
GTTTATGTGA CTGGCCTTAT CCGCGATGAA GCGGGTAACA AGATGTCAAA ATCTAAGGGC
AACGTACTTG ACCCATTGGA TATGATTGAC GGTATCGATC TAGAATCCCT AGTTGAAAAA
CGCACCGGCA ACATGATGCA ACCTCAACTT GCCGCCAAGA TTGAAAAGAG CACCCGTAAA
GAATTTGAAA ACGGTATCGA AGCCCACGGT ACCGATGCCC TGCGCTTTAC CCTAGCGGCA
ATGGCCTCAA CTGGCCGTGA TATCAACTGG GATATGAAGC GTTTAGACGG TTACCGCAGC
TTCTGTAACA AGCTTTGGAA CGCGTCACGT TACGTGCTGA TGAATACCGA AGGCCAAGAT
TGCGGCCCGA ACTCACCGGA TTACCAAGGC GGCGAGATGG AACTGTCACT GGCGGATCGC
TGGATCATCG GTTTATTCAA CCAAACCGTG AAAACCTACG ACGACCACAT GGCAAACTAC
CGCTTCGACC TTGCGGCGAA CACGCTGTAC GAGTTCACTT GGAACCAGTT CTGCGATTGG
TATTTAGAGT TAACTAAACC AGTACTGCAA AACGGCAATG AAGCGCAAAT GCGTGGCACC
CGCCATACGC TGGTCAATGT GTTAGAAGCG ATGCAACGCT TGATGCACCC AATGATGCCA
TACATCACTG AAACCATCTG GCAACGCGTT AAACCCCTGA CTGGCGCCCA AGGCGATACC
ATCATGCTGG CACCATTCCC AAGCTACGAT GCCGCCAAAG TAGATGCGAC CGCGATGGCC
GATCTCGAGT GGGTTAAGCA AGTTATTGTT GCAGTGCGTA ATATCCGTGC CGAGCTCAAT
ATTGCACCGT CGAAGCCATT AAATGCCCTA CTACGTGGCG TGAGTGCCCA AGACCAAGGC
CGTGTTGAAG CGAACCAAGC CTTCTTCACA ACGCTTGCGC GCTTAGAGAG CATGACGATT
CTTGGCGAAG GTGAAACGGC GCCTATGTCG ACCACAGGCC TGATCGGCGA AATGGAATTA
TTAATTCCGA TGGCAGGTCT GGTGGATGTC GCGGCCGAAA TGGCTCGCAT CGACAAGCAG
CTTGAGAAGC TGACTCAAGA GATTGCTCGT ATCGAAGGTA AATTATCGAA TGAAGGCTTC
GTCGCGAAAG CACCGCCAGC CGTCATTGAT AAAGAGCGCG CCAAGATGGC CGATCTGAGC
CGTGATATGG ACAAACTAAA AGAGCAAAAA GCCGAGTTTG CTAAGCTAGA AGCCTAA
 
Protein sequence
MEKTYDPQSI EQTLYQNWEE QGYFKPHGDA SQGNYCIMIP PPNVTGSLHM GHAFQDTIMD 
TLIRYQRMKG KNTLWQVGTD HAGIATQMLV ERKLEAEEGK SRHDLGRDAF MEKVWEWKAQ
SGGTITKQLR RMGASVDWDR ERFTMDEGLS KAVQEVFVRL YEDDLIYRGK RLVNWDPKLH
TAISDLEVEN KEKQGHMWHL RYPLADGELT ADGKDYLEVA TTRPETMLGD SAVAVHPDDE
RYQALIGKFI LLPIVNRRIP IVADDYVDMA FGTGCVKITP AHDFNDYEVG KRHKLPMFNV
LTLDAAIRAS AEVVNTDGTI NTSLDGSLPE RFAGLDRFKA RDAIVAEFET LGLLEKIAPH
GLKVPYGDRS GVVIEPMLTD QWYVAVAPMA KTAIEAVENG DIKFVPQQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD ANGKVYVGRN EAEVRAKHNI DDSIALRQDE DVLDTWFSSA
LWTFSTLGWP DNVEDLKTFH PTDVLVTGFD IIFFWVARMI MMTMHFIKDE DGKPLVPFKT
VYVTGLIRDE AGNKMSKSKG NVLDPLDMID GIDLESLVEK RTGNMMQPQL AAKIEKSTRK
EFENGIEAHG TDALRFTLAA MASTGRDINW DMKRLDGYRS FCNKLWNASR YVLMNTEGQD
CGPNSPDYQG GEMELSLADR WIIGLFNQTV KTYDDHMANY RFDLAANTLY EFTWNQFCDW
YLELTKPVLQ NGNEAQMRGT RHTLVNVLEA MQRLMHPMMP YITETIWQRV KPLTGAQGDT
IMLAPFPSYD AAKVDATAMA DLEWVKQVIV AVRNIRAELN IAPSKPLNAL LRGVSAQDQG
RVEANQAFFT TLARLESMTI LGEGETAPMS TTGLIGEMEL LIPMAGLVDV AAEMARIDKQ
LEKLTQEIAR IEGKLSNEGF VAKAPPAVID KERAKMADLS RDMDKLKEQK AEFAKLEA