Gene Ent638_3650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3650 
SymbolvalS 
ID5111898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3951382 
End bp3954237 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content55% 
IMG OID640493855 
Productvalyl-tRNA synthetase 
Protein accessionYP_001178358 
Protein GI146313284 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.227036 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CATACAACCC ACGAGATATC GAACAGCCGC TTTACGAGCA CTGGGAAGAG 
CAGGGCTATT TCAAGCCTAA CGGCGATGAA AGCAAAGAGT CCTTCTGCAT CATGATCCCG
CCGCCGAACG TCACCGGCAG TTTGCATATG GGTCATGCTT TCCAGCAAAC CATCATGGAC
ACCATGATCC GCTACCAGCG CATGCAGGGG AAAAATACCC TGTGGCAGGC AGGTACTGAC
CACGCGGGTA TTGCGACCCA GATGGTGGTG GAGCGTAAAA TTGCCGCTGA AGAAGGTAAA
ACCCGCCACG ATTACGGTCG CGACGCCTTT ATCGACAAAA TTTGGCAGTG GAAAGCAGAA
TCTGGCGGCA CCATTACCCG TCAGATGCGC CGTCTCGGCA ACTCCGTGGA CTGGGAGCGC
GAGCGCTTCA CCATGGACGA AGGCCTTTCC AATGCCGTGA AAGAAGTGTT CGTCCGTCTG
TACAAAGAAG ATCTGATTTA CCGTGGCAAG CGCCTGGTAA ACTGGGATCC AAAACTGCGC
ACGGCAATTT CCGATTTGGA AGTGGAAAAC CGCGAGTCGA AAGGCTCGAT GTGGCACATC
CGTTATCCGC TGGCCGATGG TGCGAAAACC GCAGACGGTA AAGATTACCT GGTCGTGGCG
ACGACCCGTC CGGAAACCCT GCTGGGCGAT ACCGGCGTGG CTGTTAACCC GGAAGATCCG
CGCTACAAAG ATCTGATTGG CAAGTTCGTC ATGCTGCCGC TGGTGAATCG TCGCATCCCG
ATTCTGGGCG ACGAACACGC TGATATGGAA AAAGGCACCG GCTGTGTGAA AATCACCCCG
GCGCACGACT TCAACGACTA TGAAGTTGGC CGTCGTCACG CCCTGCCGAT GATCAACATC
TTTACCTTTG ACGGTGACAT CCGCGAAAGC GCAGAAGTGT ACGACACCAA AGGCGAAGAA
TCTGACGTTT ACCCAAGCGA TATTCCAGCA GAATTCCAGA AGCTGGAACG TTTTGCCGCG
CGTAAAGCGG TGGTTGCCGC CATTGACGCG CTTGGCCTGC TGGAAGACGT TAAGCCACAC
GATCTGACCG TTCCTTACGG CGACCGTGGC GGCGTGGTTA TCGAACCGAT GCTGACCGAC
CAGTGGTACG TACGCGCTGA CGTGCTGGCA AAACCTGCGG TTGAAGCGGT TGAAAACGGC
GATATTCAGT TCGTGCCGAA GCAGTACGAA AACATGTACT TCTCCTGGAT GCGTGATATT
CAGGACTGGT GTATTTCTCG TCAGCTGTGG TGGGGTCACC GTATCCCGGC GTGGTACGAC
AACGACGGCA ACGTGTTTGT CGGCCGCACC GAAGAAGAAG TGCGTCAGGA AAATAACCTC
AGTGCAGACG TTGCGCTGCG TCAGGATGAC GACGTTCTCG ATACCTGGTT CTCTTCCGCA
TTGTGGACCT TCTCTACGCT CGGCTGGCCA GAGAACACCG ACGCGCTGCG TCAGTTCCAC
CCAACCAGCG TGATGGTGTC CGGCTTCGAC ATCATCTTCT TCTGGATCGC CCGCATGATC
ATGATGACCA TGCACTTCAT CAAAGATGAA AACGGCAAAC CTCAGGTTCC GTTTAAAACC
GTCTACATGA CCGGTTTGAT TCGTGACGAT GAAGGCCAGA AGATGTCTAA ATCCAAGGGC
AACGTTATCG ACCCACTGGA TATGGTTGAC GGTATCTCTC TGCAAGATCT GCTCGAGAAA
CGTACCGGCA ACATGATGCA GCCGCAGCTG GCGGAAAAAA TCGCTAAGCG TACCGAGAAG
CAATTCCCGG ACGGCATCGA GCCGCACGGC ACCGACGCCC TGCGTTTCAC CCTGGCGGCG
CTGGCTTCTA CCGGTCGCGA CATCAACTGG GACATGAAAC GTCTGGAAGG TTACCGCAAC
TTCTGTAACA AACTGTGGAA CGCCAGCCGC TTTGTGCTGA TGAACACTGA AGATCAGGAT
TGCGGCTTCA ACGGCGGTGA AATGACGCTG TCGCTGGCGG ATCGCTGGAT CCTCGCGGAA
TTCAACCAGA CCATCAAAGC ATACCGTGAA GCGCTGGACA CTTACCGCTT TGATATTGCG
GCAGGTATCC TGTACGAATT CACCTGGAAC CAGTTCTGCG ACTGGTATCT GGAGCTGACC
AAGCCGGTGA TGAACGGCGG CAATGAAGCG GAACTACGCG GAACGCGCAA CACGCTGATC
ACCGTTCTGG AAGGTTTGCT GCGCCTGGCG CATCCGATCA TTCCATTCAT TACCGAAACC
ATCTGGCAGC GCGTGAAGGT TATCAAAGGC ATTACTGCCG ATACCATCAT GCTCCAGCCG
TTCCCGGAGT TTGATGCCGC ACAGGTGGAT GAAGCCGCGG CTTCTGATAC TGAATGGCTG
AAACAAGCGA TCGTTGCCGT GCGTAATATT CGTGCGGAAA TGTACATCTC CCCGGGCAAA
CCGCTGGAAC TCCTGCTGCG CGGTTGCAGT GATGCGGCGG TTCGTCGTGT TAATGAGAAC
AGCAGCTTCT TGCAGAACAT GGCGCGTCTG GAAAGCATTA CCGTGCTGCC AGCCAATGAA
AAAGGTCCGG TTTCCGTGAC CAAAATCATC GATGGCGCCG AGCTGCTGAT CCCAATGGCT
GGTCTTATCG ACAAGGACAC TGAGCTGGCG CGTCTGGCGA AAGAAGTGAC CAAAGTCGAG
ATCGAAATTG GCAAAATCGA AAGCAAGCTG TCTAACGAAG GTTTCGTGGC ACGTGCACCA
GAAGCGGTTA TCGCGAAAGA GCGCGAACGT CTGGTCGCTT TCGCTGATGC AAAAGCGAAA
CTGATCGAAC AGCAAGCGGT TATCGCTGCG CTGTAA
 
Protein sequence
MEKTYNPRDI EQPLYEHWEE QGYFKPNGDE SKESFCIMIP PPNVTGSLHM GHAFQQTIMD 
TMIRYQRMQG KNTLWQAGTD HAGIATQMVV ERKIAAEEGK TRHDYGRDAF IDKIWQWKAE
SGGTITRQMR RLGNSVDWER ERFTMDEGLS NAVKEVFVRL YKEDLIYRGK RLVNWDPKLR
TAISDLEVEN RESKGSMWHI RYPLADGAKT ADGKDYLVVA TTRPETLLGD TGVAVNPEDP
RYKDLIGKFV MLPLVNRRIP ILGDEHADME KGTGCVKITP AHDFNDYEVG RRHALPMINI
FTFDGDIRES AEVYDTKGEE SDVYPSDIPA EFQKLERFAA RKAVVAAIDA LGLLEDVKPH
DLTVPYGDRG GVVIEPMLTD QWYVRADVLA KPAVEAVENG DIQFVPKQYE NMYFSWMRDI
QDWCISRQLW WGHRIPAWYD NDGNVFVGRT EEEVRQENNL SADVALRQDD DVLDTWFSSA
LWTFSTLGWP ENTDALRQFH PTSVMVSGFD IIFFWIARMI MMTMHFIKDE NGKPQVPFKT
VYMTGLIRDD EGQKMSKSKG NVIDPLDMVD GISLQDLLEK RTGNMMQPQL AEKIAKRTEK
QFPDGIEPHG TDALRFTLAA LASTGRDINW DMKRLEGYRN FCNKLWNASR FVLMNTEDQD
CGFNGGEMTL SLADRWILAE FNQTIKAYRE ALDTYRFDIA AGILYEFTWN QFCDWYLELT
KPVMNGGNEA ELRGTRNTLI TVLEGLLRLA HPIIPFITET IWQRVKVIKG ITADTIMLQP
FPEFDAAQVD EAAASDTEWL KQAIVAVRNI RAEMYISPGK PLELLLRGCS DAAVRRVNEN
SSFLQNMARL ESITVLPANE KGPVSVTKII DGAELLIPMA GLIDKDTELA RLAKEVTKVE
IEIGKIESKL SNEGFVARAP EAVIAKERER LVAFADAKAK LIEQQAVIAA L