Gene Sama_0790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_0790 
SymbolvalS 
ID4603042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp973085 
End bp975946 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content57% 
IMG OID639780124 
Productvalyl-tRNA synthetase 
Protein accessionYP_926667 
Protein GI119773927 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATACAATCC ACAGTCCATC GAACAGGCCC TCTACCGCGT TTGGGAAGAA 
AAGGGTTACT TCAAGCCACA CGGTGATGCC AGCCAGGGCA ACTACTGCAT CATGATCCCA
CCGCCGAACG TCACCGGCAG CCTGCACATG GGTCACGCCT TCCAGGACAC CATCATGGAT
ACCCTCATCC GCTATCAGCG CATGAAGGGC AAAAACACCC TGTGGCAAGT AGGTACCGAC
CATGCTGGTA TCGCCACCCA GATGCTGGTT GAGCGTAAGC TGGAAGCCGA ACAGGGCAAG
AGCCGCCACG ATCTCGGCCG CGACGCCTTT ATGGAAAAGG TTTGGGAGTG GAAAGCCCAG
TCCGGCGGTA CCATCACCAG CCAGCTGCGC CGCATGGGCG CCTCCGTGGA TTGGGACCGT
GAGCGCTTCA CCATGGATGA AGGCCTGTCC AACGCCGTGC AGGAAGTGTT CGTGCGCCTG
TACGACGACA AACTGATTTA CCGCGGCAAG CGCCTCGTAA ACTGGGATCC CAAGCTGCAC
ACTGCCATTT CCGATCTGGA AGTGGAAAAC AAAGAAAAAG CGGGCCACAT GTGGCACTTC
CGCTATCCAC TGGCCGGCAC CGAGCTGACC GCCGATGGCA AAGACTATCT GGTAGTAGCC
ACTACCCGCC CTGAAACCAT GCTGGGCGAC AGCGCGGTTG CGGTACACCC CGAAGATGAG
CGTTATGCGT CACTGATTGG CAAAGAAATC ATTCTGCCTA TCGTGAACCG CCGCATCCCC
ATCATCGCCG ATGAATACGT GGATAAAGAC TTCGGTACCG GCTGCGTGAA AATCACCCCG
GCCCACGACT TCAACGACTA CGAAGTGGGC AAGCGCCACA GCCTGCCGAT GTTCAACATC
CTCACTCAGG ATGCCACCAT CCGTGCCCTG GCCGAGGTAC TGAACACCGA CGGCAGCCAC
AACAGCGAGC TGGATGCCAG CCTGCCTGAG CGCTATGCCG GCCTCGACCG CTTCAAGGCC
CGTGACGCCA TCGTGGCCGA GTTTGAGACT CTGGGTCTCT TGGAAAAAAT CGAGCCACAC
GCCCTCAAGG TGCCTTATGG CGATCGCTCC GGCGTCGTGA TTGAGCCGCT GCTTACTGAT
CAGTGGTACG TTGCGGTGCA GAAACTCGCT CAGCCTGCCA TCGAAGCGGT GGAAAACGGC
GACATCAAGT TTGTGCCTCA GCAATACGAA AACATGTACT TCTCCTGGAT GCGTGACATT
CAGGACTGGT GTATCTCCCG TCAGCTGTGG TGGGGTCACC GCATTCCTGC GTGGTACGAC
GAAGCCGGTA AGGTGTACGT TGGCCGCAGC GAAGATGAAG TGCGCCAGAA CCATAACCTC
GGCAGCGATG TGAAACTGCG TCAGGATGAC GATGTACTGG ACACCTGGTT CTCCTCTGCC
CTGTGGACTT TCTCAACCCT GGGCTGGCCG GAACAAACCC CAGAGCTCAA GACCTTCCAC
CCCACCGACG TACTGGTGAC AGGTTTTGAT ATCATCTTCT TCTGGGTTGC CCGGATGATC
ATGATGACCA TGTACTTCAT CAAAGACGAA GACGGCAAGC CGCAGGTACC ATTCAAGACG
GTTTACGTCA CCGGTCTTAT CCGCGACGAA GCTGGCAACA AGATGTCCAA GTCCAAGGGT
AACGTCCTCG ACCCGCTGGA TATGATTGAC GGTATCGACC TTGAATCTCT GGTTCAGAAG
CGTACCGGCA ACATGATGCA ACCACAGCTT GCCGCCAAGA TAGAAAAGAG CACCCGCAAG
GAGTTCGAAA ACGGCATCGA GCCACACGGC ACAGACGCGC TGCGCTTTAC CCTGGCGGCC
ATGGCCTCTA CCGGCCGTGA CATCAACTGG GACATGAAGC GCCTCGACGG TTACCGCAGC
TTCTGTAACA AGCTGTGGAA CGCCTCTCGC TACGTGCTGA TGAACACCGA AGAGCAGGAT
TGTGGCCAAG GCGGTGGTGA TATGAAGCTG TCGCTGGCCG ACCGCTGGGT CATCGGCAAG
TTCCAGGAAA CCGTCAAAGC CTTCGACGAG CACATCAATG CCTATCGTTT TGACCTGGCC
GCCAACACCC TGTACGAGTT CACCTGGAAC CAGTTCTGTG ACTGGTATCT GGAGCTGACC
AAGCCAGTAC TGCAAAACGG CTCTGAAGCC GAGCAGCGCG GCACCCGTCA TACCCTGGTG
ACTGTACTTG AGCAGCTGCT GCGCCTGATG CACCCCATGA TGCCGTACAT CACCGAGACC
ATCTGGGATC GCGTGAAGCC CTTGGCAGGT GTTGAGGGTG ACACCCTGAT GCTGATGTCC
TTCCCCGAGT TCGATGCCGC CAAGGTAGAT GCCAAGGCCA TGGCCGATCT CGAATGGGTC
AAGCAGGTGA TTGTGGCCGT GCGTAACATT CGCGCCGAGC TCAACATCGC GCCAAGCAAG
CCGTTGTCAG CCCTGCTGCG CGGTGTAAGC GATGAAGACA AGGCCCGCAT CGAAGCCAAC
CAGGCCTTCT TCGGCACGCT TGCCAAGCTT GAGAGCATGA CCATTCTCGC CGAAGGCGAA
GCCGCCCCCA TGGCCACCAC TCAGCTGATT GGTGAAATGG AACTGCTTAT TCCTATGGCA
GGCCTGATTG ACGTTGCCGC CGAGATGGCC CGTATCGACA AGCAGCTGGA GAAGCTGACT
GGCGAAGTGG CCCGCATCGA AGGCAAGCTG TCCAACCAAG GCTTTGTGGC CAAGGCCCCG
GCTGAAGTGA TTGAGAAAGA GCGCGCCAAG GCCGCCGACA TCAAGCGCGA CATGGACAAA
CTGACCGAGC AGAAAGCCGA GCTTGCGAAA CTTGAAGCCT GA
 
Protein sequence
MEKTYNPQSI EQALYRVWEE KGYFKPHGDA SQGNYCIMIP PPNVTGSLHM GHAFQDTIMD 
TLIRYQRMKG KNTLWQVGTD HAGIATQMLV ERKLEAEQGK SRHDLGRDAF MEKVWEWKAQ
SGGTITSQLR RMGASVDWDR ERFTMDEGLS NAVQEVFVRL YDDKLIYRGK RLVNWDPKLH
TAISDLEVEN KEKAGHMWHF RYPLAGTELT ADGKDYLVVA TTRPETMLGD SAVAVHPEDE
RYASLIGKEI ILPIVNRRIP IIADEYVDKD FGTGCVKITP AHDFNDYEVG KRHSLPMFNI
LTQDATIRAL AEVLNTDGSH NSELDASLPE RYAGLDRFKA RDAIVAEFET LGLLEKIEPH
ALKVPYGDRS GVVIEPLLTD QWYVAVQKLA QPAIEAVENG DIKFVPQQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGKVYVGRS EDEVRQNHNL GSDVKLRQDD DVLDTWFSSA
LWTFSTLGWP EQTPELKTFH PTDVLVTGFD IIFFWVARMI MMTMYFIKDE DGKPQVPFKT
VYVTGLIRDE AGNKMSKSKG NVLDPLDMID GIDLESLVQK RTGNMMQPQL AAKIEKSTRK
EFENGIEPHG TDALRFTLAA MASTGRDINW DMKRLDGYRS FCNKLWNASR YVLMNTEEQD
CGQGGGDMKL SLADRWVIGK FQETVKAFDE HINAYRFDLA ANTLYEFTWN QFCDWYLELT
KPVLQNGSEA EQRGTRHTLV TVLEQLLRLM HPMMPYITET IWDRVKPLAG VEGDTLMLMS
FPEFDAAKVD AKAMADLEWV KQVIVAVRNI RAELNIAPSK PLSALLRGVS DEDKARIEAN
QAFFGTLAKL ESMTILAEGE AAPMATTQLI GEMELLIPMA GLIDVAAEMA RIDKQLEKLT
GEVARIEGKL SNQGFVAKAP AEVIEKERAK AADIKRDMDK LTEQKAELAK LEA