Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_1503 |
Symbol | valS |
ID | 3761723 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 1627978 |
End bp | 1630752 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637786234 |
Product | valyl-tRNA synthetase |
Protein accession | YP_391769 |
Protein GI | 78485844 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000330515 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAAC ATTTCGATCC TAATTCTATT GAAACAAAGT GGTATCAAAC TTGGGAAAAA AGCGGCTACT TTAAGCCTCA AACCAGTACA ACTGGCGAAC ATTACAGCAT AATGATCCCT CCGCCTAATG TGACCGGAAG CCTTCATATG GGGCACGCTT TCCAAGATAC GATCATGGAT ACTTTAATTC GTTATCACCG TATGCGTGGC GACGAAACAC TTTGGCAACC TGGAACCGAC CATGCTGGTA TCGCTACCCA AATGGTGGTG GAACGTCAAT TGGCCGCGAA AGGGTTAAGT CGTCATGATT TAGGGCGTGA CGCTTTTACC CAAAAAATTT GGGAATGGAA GGCCGAATCA GGCGGTACGA TTACACAACA ACTTCGTCGT TTAGGCGCAT CGCCTGACTG GAGTCGCGAA CGGTTTACCA TGGATGACGG TTTATCTGAT GCGGTTAAAG AAGTATTTGT TCAGCTGCAT GAAGAAGGCC TGATTTACCG AGGCAAGCGT TTGGTGAACT GGGATCCGGT TCTGCATACG GCTGTTTCCG ATTTGGAAGT CCTTTCAGAA GAAGAAATGG GCAGCTTATG GCATATGCGT TACCCGTTAT CCGATGGATC AGGGCATTTA ATCGTCGCCA CCACCCGTCC AGAAACCATG TTTGGCGATC AAGCCGTTGC TGTTCATCCG GATGATGAAC GCTATCAACA CTTAATAGGT CAAACCATTA CCCTTCCTCT TGTCGGTCGC GAAATTCCAA TCATCGCAGA TGATTATGTG GAAATGGATT TTGGAACAGG TTGCGTCAAA ATCACCCCTG CGCATGACTT TAATGACTAT GAAATGGGGA AACGTCATAA TCTTCCGATG CTGAATGTCA TGACCATTGA TGCGGCCATG AATGAAGAAG TGCCTGAAAA GTATCAAGGG TTGGATCGTT TTGAGGCCAG AAAACAAGTC ATTGCGGATC TAGAAGTTCA AGATTTAATG GAAAAAATTG TACCGCATAA ATTAATGGTG CCTCGCGGTG ATCGTTCGCA TGCCGTCATC GAACCTTTTT TAACGGATCA ATGGTATGTT GCGGTAAACG AGCTTGCGAA ACCGGCCATT GATGCGGTTA AAAACGGAGA CATTGAATTT GTTCCTAAAA ACTGGGAAAA CACTTATTTT GAATGGATGA ATAATCTTCA AGATTGGTGT ATCTCGCGAC AAATTTGGTG GGGCCATCGA ATCCCAGCTT GGTATGATGA AGGAGGCAAT GTTTATGTTG CTCGCTCTGA AGAAGAAGTC CGTGCTAAAT ATAACCTAGA TACGAGCCAT TCTCTCCAAC AGGATGACGA TGTGTTAGAC ACGTGGTTCA GTTCAGCGTT ATGGACGTTT TCAACTCTAG GATGGCCTGA AAAAACACCC GAACTTGAAA AATTCCATCC GACATCCGTA CTCGTTACCG GGTTTGACAT CATTTTCTTC TGGGTGGCAA GAATGATTAT GATGACACTT AAATTTACGG GTGAAGTACC GTTCAAACAA GTTTATGTTC ATGGTCTTGT TCGTGACAGT GAAGGCCAAA AGATGTCGAA GTCAAAAGGA AATGTACTCG ACCCAATTGA CTTGATCGAT GGTATTGATC TTGAAAGCCT GGTTGCCAAA AGAACCACTG GCATGATGCA GCCAGAAAAA GCCGCTCAAA TTGAAAAGAC AACACGCAAA CACTTTGGCG AAGGTATTGA GTCATATGGT ACAGATGCAC TGCGTTTCAC CTTTGCATCA CTGGCTTCTA CCGGACGAGA TATCCGTTTT GATTTAAACC GATGCGAAGG CTATCGAAAC TTCTGTAATA AGTTGTGGAA TGCAACGCGT TATGTACTTA TGAACACAGA AGAGAAAGAC ACCGGTACAG ATGAAACGCT TGATACCGAA CTGTCACTAG CGGATAAGTG GATTATTTCA AAACTGCAAA ACGTAGAAAT GGACGTTGCC AAGCACTTTG ATCAATATCG TTTTGATCTT GCAGCGCATA CCCTATATGA ATTCACATGG AATGAATATT GTGATTGGTA TTTAGAGTTG GTAAAACCCA TTTTAAATTC AAAAACAGCG ACAGAAGCTC AACAACGAGG TACTCGCCAA ACGCTGGTAA GAGTTTTAGA AACAATTTTG CGTTTATTAC ACCCTATTAC ACCCTACATC ACTGAAGAAG CTTGGCACAG TGTTGCATCA CTCGCTGGGA AAACCGGCGA TACAATCATG TTACAACCCT ATCCACAACC GAATGAAGCT CTGATTGATA CGGCATCGGA AAAAGAGCTG GAATGGGTTA AACACGTTAT CATGGGCGTT CGTAAAATTC GTTCGGAGAT GGATATTGCG CCAAGTAAAG CCTTACCTAT CCTTCTAACC AATCTGCAAG AGCAAGATAA AGTATGGCTG GAAAACAACC GCGTTTTCTT ACAAACCTTG GCGAAACTGG ACACTATAAC GTTATTGGAA AATGAAACAG AAGCGCCAGA ATCCGCCGTT GCATTGGTTG GCGAAATGAA AATTTTAATT CCAATGGCCG GCCTCATTGA TAAAGAAGCC GAACTGTCGC GCCTATCAAA AGAGATCAAA CGACTCGAAG GTGAAGTAAA ACGTTTCACA GGTAAATTAT CAAATGAAAG CTTTGTCTCT AAAGCACCCG AAGCCGTGGT TGAAAAAGAA AAGCAAAAGT TACAAGACAC CGAAATTGCA CTAAAAAACT TAAAGGATCA ATATGAAAAA ATTAGTCAAA TCTAA
|
Protein sequence | MEKHFDPNSI ETKWYQTWEK SGYFKPQTST TGEHYSIMIP PPNVTGSLHM GHAFQDTIMD TLIRYHRMRG DETLWQPGTD HAGIATQMVV ERQLAAKGLS RHDLGRDAFT QKIWEWKAES GGTITQQLRR LGASPDWSRE RFTMDDGLSD AVKEVFVQLH EEGLIYRGKR LVNWDPVLHT AVSDLEVLSE EEMGSLWHMR YPLSDGSGHL IVATTRPETM FGDQAVAVHP DDERYQHLIG QTITLPLVGR EIPIIADDYV EMDFGTGCVK ITPAHDFNDY EMGKRHNLPM LNVMTIDAAM NEEVPEKYQG LDRFEARKQV IADLEVQDLM EKIVPHKLMV PRGDRSHAVI EPFLTDQWYV AVNELAKPAI DAVKNGDIEF VPKNWENTYF EWMNNLQDWC ISRQIWWGHR IPAWYDEGGN VYVARSEEEV RAKYNLDTSH SLQQDDDVLD TWFSSALWTF STLGWPEKTP ELEKFHPTSV LVTGFDIIFF WVARMIMMTL KFTGEVPFKQ VYVHGLVRDS EGQKMSKSKG NVLDPIDLID GIDLESLVAK RTTGMMQPEK AAQIEKTTRK HFGEGIESYG TDALRFTFAS LASTGRDIRF DLNRCEGYRN FCNKLWNATR YVLMNTEEKD TGTDETLDTE LSLADKWIIS KLQNVEMDVA KHFDQYRFDL AAHTLYEFTW NEYCDWYLEL VKPILNSKTA TEAQQRGTRQ TLVRVLETIL RLLHPITPYI TEEAWHSVAS LAGKTGDTIM LQPYPQPNEA LIDTASEKEL EWVKHVIMGV RKIRSEMDIA PSKALPILLT NLQEQDKVWL ENNRVFLQTL AKLDTITLLE NETEAPESAV ALVGEMKILI PMAGLIDKEA ELSRLSKEIK RLEGEVKRFT GKLSNESFVS KAPEAVVEKE KQKLQDTEIA LKNLKDQYEK ISQI
|
| |