Gene Shew_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_0923 
SymbolvalS 
ID4922037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp1049573 
End bp1052452 
Gene Length2880 bp 
Protein Length959 aa 
Translation table11 
GC content57% 
IMG OID640162455 
Productvalyl-tRNA synthetase 
Protein accessionYP_001093053 
Protein GI127511856 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATACAACCC ACAGTCAATC GAACAGGCTC TGTACCAGAA CTGGGAAGAG 
AAAGGATACT TTAAGCCCCA CGGCGATGAG TCGAACGGCA ATTACTGCAT CATGATCCCG
CCACCAAACG TGACCGGTAG CCTGCACATG GGTCATGCCT TCCAAGACAC CATCATGGAT
ACCCTAATCC GTTACCAACG CATGAAGGGC AAGAACACCC TGTGGCAGGT AGGTACCGAC
CACGCGGGTA TCGCCACCCA GATGTTGGTG GAGCGCAAGG TAGAGGCCGA AGAGGGCAAG
AGCCGTCACG ACCTGGGCCG TGAAACCTTT ATCGACCGTA TCTGGGACTG GAAAAACCAA
TCTGGTGGCA CCATCACCAA GCAGCTGCGC CGCCTGGGCG CCTCGGTCGA TTGGGATCGT
GAACGCTTCA CCATGGATGA AGGCATGTCA GCCGCGGTAC AAGAGGTGTT TGTGCGTCTG
TACAACGACG ACCTCATCTA CCGTGGCAAG CGTCTGGTTA ACTGGGATCC TAAGCTACAC
ACAGCCATCT CAGACCTCGA AGTCGAAAAC AAAGAGAAGC AGGGCAGCAT GTGGCACTTC
CGCTACCCGC TGGCCGACGG CGCACTGACC GCCGACGGTA AAGACTACCT TGAAGTGGCT
ACCACGCGTC CAGAAACCAT GCTGGGCGAC AGCGCGGTTG CCGTACACCC AGACGATGAG
CGCTATCAGT CGCTGATCGG CAAGTTCATC CTGCTGCCAA TCGTTAACCG TCGTATCCCT
ATCGTCGCCG ACGACTATGT GGACATGGAG TTCGGTACCG GCTGTGTGAA GATCACCCCG
GCTCACGACT TCAACGACTA TGAGGTAGGT AAGCGTCACA ACCTGCCCAT GTTCAACATT
CTGACCATAG ACGCGGCCAT CCGCAGCCAG GCAGAGGTGG TTAACTCAGA TGGCACCGCC
AACGACGAGC TAGATGGCAG CCTACCAGAG CGCTTTGCCG GCTTGGACCG TTTCAAGGCC
CGTACCGCTA TCGTCGACGA GTTCGAGTCA CTCGGCCTGC TGGGCAAGAT TGACCCGCAC
GCCCTCAAGG TGCCTTACGG CGACCGCTCT GGCGTGGTGA TCGAGCCACT ACTGACAGAC
CAGTGGTATG TAGCCGTGGC CCCAATGGCC AAGACCGCCA TCGAAGCGGT TGAGAACGGT
GACATCAAGT TTGTGCCGCA GCAGTATGAA AACATGTACT TCTCTTGGAT GCGCGACATT
CAAGACTGGT GTATCTCGCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC ATGGTACGAC
GAGGCCGGTA AGGTATATGT GGGCCGCGAC GAGGCCGAGG TACGCGCCAA GCATAATCTG
GATGATTCAG TGGTGCTGCG CCAAGACCCA GACGTGCTGG ATACCTGGTT CAGCTCGGCG
CTGTGGACCT TCTCAACCCT AGGCTGGCCA GACGATACCG AGGCCCTCAA GACCTTCCAC
CCAACCGATG TGTTGGTGAC AGGTTTCGAC ATCATCTTCT TCTGGGTGGC GCGCATGATC
ATGATGACCA TGCACTTCAT CAAGGATGAA GATGGCAAGC CACAGGTGCC GTTTAAGACG
GTGTATGTGA CCGGCCTTAT CCGTGACGAA CAGGGCAACA AGATGTCCAA GTCTAAGGGT
AACGTACTGG ATCCGCTGGA TATGATCGAC GGTATCGATC TCGAAGCCCT GGTTGAGAAG
CGCACCGGCA ACATGATGCA GCCTCAGCTG GCGGCTAAGA TCGAGAAGAG CACCCGTAAA
GAGTTTGCCG ACGGCATCGA GGCCCACGGC ACCGACGCCC TGCGTTTCAC CCTGGCAGCC
ATGGCCTCTA CCGGCCGTGA CATCAACTGG GACATGAAGC GTCTGGACGG TTACCGCAGC
TTCTGTAACA AGATCTGGAA CGCGTCGCGC TATGTGTTGA TGAACACAGA AGAGCAAGAT
TGTGGCCCTC AGTCTCCAAA CGGAAAAGCT GACGGCGAAA TGCAGCTGTC ACTGGCGGAT
CGCTGGATCG TCGGCCTGTT TAACCAGACG GTTAAGGCCT TTGACGAGCA CATGGAAAAC
TACCGTTTCG ACCTGGCGGC CAACACACTG TACGAGTTCA CCTGGAACCA GTTCTGTGAC
TGGTATCTTG AGCTGACCAA GCCAGTACTG CAAAACGGCA CCGAGGCCGA GCAGCGCGGT
ACCCGTCACA CCCTAGTGAC AGTACTGGAA GCCATGCAAC GTCTGCTGCA TCCTATGATG
CCATACCTGA CAGAGACCAT CTGGCAGCGC GTTAAGCCGC TGGCAGGCGT CGAGGGTGAC
ACCCTGATGC TGGCCGAGTT CCCTGTATAT CAGGCAAGCA AGGTAGATGA AGCCGCCATG
GCGGATCTTG AGTGGGTCAA GCAGGTGATC GTTGCCGTGC GTAACATCCG CGCCGAGCTG
AATATCGCCC CAAGCAAGCC GCTCAATGCC ATGCTACGCA GCGTCAGCGC GCAAGACAAG
GCTCGCGTCG AAGCTAACCA GACTTTCTTC GCCACCCTGG CCAAGCTGGA GTCGATGACG
ATTCTGGCGG ACGGTGACAC CGCGCCTATG TCGACCACTC AACTGGTTGG CGAGATGGAG
CTGTTGATCC CTATGGCAGG CCTTATCGAT GTGGCCGCCG AGATGGCGCG TATCGACAAG
CAGTTTGAAA AACTAGTGGG CGAAGCCAAG CGTATCGAAG GCAAGCTCAA TAACCAGGGC
TTCGTGGCCA AGGCGCCGGA AGCGGTTATT GAGAAGGAGC GCGCCAAGCT GGCCGAGTTC
CAACGGGATA TGGACAAGCT ATGCGAGCAG AAAGCCGAAC TGGCTAAGCT CGAAGGCTAA
 
Protein sequence
MEKTYNPQSI EQALYQNWEE KGYFKPHGDE SNGNYCIMIP PPNVTGSLHM GHAFQDTIMD 
TLIRYQRMKG KNTLWQVGTD HAGIATQMLV ERKVEAEEGK SRHDLGRETF IDRIWDWKNQ
SGGTITKQLR RLGASVDWDR ERFTMDEGMS AAVQEVFVRL YNDDLIYRGK RLVNWDPKLH
TAISDLEVEN KEKQGSMWHF RYPLADGALT ADGKDYLEVA TTRPETMLGD SAVAVHPDDE
RYQSLIGKFI LLPIVNRRIP IVADDYVDME FGTGCVKITP AHDFNDYEVG KRHNLPMFNI
LTIDAAIRSQ AEVVNSDGTA NDELDGSLPE RFAGLDRFKA RTAIVDEFES LGLLGKIDPH
ALKVPYGDRS GVVIEPLLTD QWYVAVAPMA KTAIEAVENG DIKFVPQQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD EAGKVYVGRD EAEVRAKHNL DDSVVLRQDP DVLDTWFSSA
LWTFSTLGWP DDTEALKTFH PTDVLVTGFD IIFFWVARMI MMTMHFIKDE DGKPQVPFKT
VYVTGLIRDE QGNKMSKSKG NVLDPLDMID GIDLEALVEK RTGNMMQPQL AAKIEKSTRK
EFADGIEAHG TDALRFTLAA MASTGRDINW DMKRLDGYRS FCNKIWNASR YVLMNTEEQD
CGPQSPNGKA DGEMQLSLAD RWIVGLFNQT VKAFDEHMEN YRFDLAANTL YEFTWNQFCD
WYLELTKPVL QNGTEAEQRG TRHTLVTVLE AMQRLLHPMM PYLTETIWQR VKPLAGVEGD
TLMLAEFPVY QASKVDEAAM ADLEWVKQVI VAVRNIRAEL NIAPSKPLNA MLRSVSAQDK
ARVEANQTFF ATLAKLESMT ILADGDTAPM STTQLVGEME LLIPMAGLID VAAEMARIDK
QFEKLVGEAK RIEGKLNNQG FVAKAPEAVI EKERAKLAEF QRDMDKLCEQ KAELAKLEG