Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_2036 |
Symbol | valS |
ID | 6375729 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | - |
Start bp | 2193396 |
End bp | 2196152 |
Gene Length | 2757 bp |
Protein Length | 918 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642684527 |
Product | valyl-tRNA synthetase |
Protein accession | YP_001960427 |
Protein GI | 189500957 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.895366 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGACAT TGTCGGAAGG CGATGTCTGC ATTTACGTAT GGCATCATAA GCGAGTAATG AGTTCAAACC TCGAAAAGAC CTATAATCAT CATGATGTGG AGGGCCGCTG GAACAGCGCG CACTGGGAGT CTCTCGGTAC CTTCGATGCG GAGAGTTCAC GAGTGATCGA AGGGGACAAG GCATCCTATA CGGTGCTGAT GCCGCCGCCT AACGTTACCG GCAGCCTGAC CCTCGGCCAT GTTCTTAACC ACACCCTGCA GGATATATTT ATTCGTTATC AGCGCATGTC AGGACGAGAA GCGCTCTGGC TGCCGGGTAC CGATCATGCC GGTATCGCGA CACAGACCGT GGTTGAAAAA AAGCTTCGTA AAGACGGGGT GTCGCGTTAC GACCTGGGCC GCAAGGAGTT TCTTGAACAT GTATGGAAAT GGCGGGATGA GTACGGCGGG CTTATTCTCA AACAGCTTCG GACTCTTGGT ATTTCCTGTG ACTGGCGGCG TAATCTTTTC ACCATGGACG AAGGCGCATC CAGTGCGGTT CGTCACGCTT TTGTCTCTCT CTACAGGGAT GGTTTGATCT ATCGGGGAAA ACGGATCATC AACTGGTGCC CTGTCTCGCA GACAGCGCTT TCCGATGAAG AGGTGATCAT GAAACCCCGG AAGGACAAGC TGGTATTTGT CCGTTATGCC CTGGCCAAAA GACCGGGAGA GTATATCACG ATAGCCACTG TCAGGCCGGA AACCATTCTG GCGGATGTTG CCATCGCGGT CAATCCGGAA GATCCGAGGT ATCGTGATCT CGTTGGAGAG CTGGCCGTCG TCCCGATTGC CGGACGGCAT ATTCCCGTCA TTGCCGATGA GTATGTCGAT ATTGATTTCG GGACAGGTGC GTTGAAAATC ACGCCGGCAC ACGATCCGAA TGACTTTGAG GTCGCGGGCC GTCATAACCT TCCTGTTCTC TCGGTTGTCG GGCGTGACGG CAAAATGATC GATGAATTCG GGTACGCGGG TCTTGACCGT TTTGAGGCGA GGGAGAGGAT TACAGAGGAT CTTGAAAGCA GGGGTAATCT CGTCAAGGTC GAGGAGTACG AGCATAATGT GGGATACTCT GAAAGAGCGG ACGTTGTCGT GGAACCCTAT CTTTCCGAGC AGTGGTTTGT CACCATGAAG CCTCTTGCCG AAAAAGCCCT TGAGGTGGTC AATGACGGCC AGATACGGTT TCATCCGTCA CACTGGATCA GCACCTATCG GCACTGGATG GAAAATATCC GGGACTGGTG TATTTCACGC CAGCTCTGGT GGGGACACAG GATACCCGCA TGGTATGACC AGGAAGGAAA AATCTGGGTT GCGGAATCTT ATGAAGATGC CTGCCATCTT GCCGGAACAG ATAAGCTTGT TCAGGATGAG GATGTACTGG ATACCTGGTT TTCATCGTGG CTCTGGCCGC TGACGACACT GGGCTGGACC GGCAAGGATG AGGACAATAA GGATATTCAG GCATTTTATC CGACCGATAC TCTTGTGACA GGGCCGGATA TCATTTTCTT CTGGGTTGCC CGGATGATCA TGGCCGGTTT GTATTTCAAG GGGGACGTAC CGTTCAGGGA TGTTTACTTT ACCAGCATCA TCAGGGACAT GAAAGGCAGA AAACTTTCCA AATCCCTCGG CAATTCCCCC GATCCCCTGA AAGTTATCGA TGCCTATGGG ACCGATGCGC TTCGCTTTAC CATTGTCTAC ATTGCTCCTC TTGGACAGGA TGTGCTGTTT GGTGAAGAGA AATGCGAGTT CGGCCGGAAT TTTGCCACCA AGATATGGAA CGCCGCCCGG CTTGTGTTCA TGCAGCGGGA AAAAGTGTTC GGCTCTTCTG ACGCTTTCAG CGATCTCTAT CAGGGTTTTA CTCCTGAGTC CGCCTCTCTG TCCGATGCCC AGCAATGGAT AACGGGTCGT TACCAGGAAA TGCTCGGTTC CTATCACAAA GCGTTCGCGC AGTTCAGAAT GAACGATATC ACCAGGATTG TCTATGACTT TTTCCGTGGC GATTATTGTG ACTGGTACCT GGAAACCCTG AAAATCGAGT TGGACGGAGA GCGCGACGAG ACGAAAAACC GTCATGCGGT ATGTCTTGCC GTCTACCTTT TAGAGGGTAT ACTGAAGGTG CTCCATCCTG TCATGCCGTT TATCACGGAA GAAATCTGGC ATCACATTCT CGAACGTGAC AGCAGTGAAA GCATCGCTCA TGCAGCCATG CCTCAGGCAG TGGAGGGATG GTCGGACGAT ATGCCGAAGC ACTTCTCTTC CGTTCAGAAA ATCATATCGG AAACACGAAG CCTTCGTGCG GTTTTCGGTG TTCCGCACGA TATGAAAGCA AGCCTGGTTG TCAGTGCTCC TGAAGAGTCT GCAAGGAGGA TAGTTACCGC CAACGCTCAC ATCGTGACCA GAATGACCGA CTGCAGCGTC ACTATTGAAG ATGGAGATCT GCCGCGTCCG CCCCATTCGG CAAGTTCCGT GGTTGACGGG AATGAACTTT TTATGCCTCT TGAAGGGTTG ATTTCGTTTG AAAAAGAGAT TGCCCGACTG GAGAAAGAGG CCGACAATAT TTCTTCATAT GTTACACGCA TGGAGAAAAA ACTTTCCAAT AAAGGTTTTG TCGATAACGC CCCGGATGAG GTGATTGAAG GTGAGCGTTC AAAGCTGGCC GATGCGAAAA GCAATCTCGG CAAAGTGCAG GCCAGTCTGG AAGTTCTCTG CAACTGA
|
Protein sequence | MQTLSEGDVC IYVWHHKRVM SSNLEKTYNH HDVEGRWNSA HWESLGTFDA ESSRVIEGDK ASYTVLMPPP NVTGSLTLGH VLNHTLQDIF IRYQRMSGRE ALWLPGTDHA GIATQTVVEK KLRKDGVSRY DLGRKEFLEH VWKWRDEYGG LILKQLRTLG ISCDWRRNLF TMDEGASSAV RHAFVSLYRD GLIYRGKRII NWCPVSQTAL SDEEVIMKPR KDKLVFVRYA LAKRPGEYIT IATVRPETIL ADVAIAVNPE DPRYRDLVGE LAVVPIAGRH IPVIADEYVD IDFGTGALKI TPAHDPNDFE VAGRHNLPVL SVVGRDGKMI DEFGYAGLDR FEARERITED LESRGNLVKV EEYEHNVGYS ERADVVVEPY LSEQWFVTMK PLAEKALEVV NDGQIRFHPS HWISTYRHWM ENIRDWCISR QLWWGHRIPA WYDQEGKIWV AESYEDACHL AGTDKLVQDE DVLDTWFSSW LWPLTTLGWT GKDEDNKDIQ AFYPTDTLVT GPDIIFFWVA RMIMAGLYFK GDVPFRDVYF TSIIRDMKGR KLSKSLGNSP DPLKVIDAYG TDALRFTIVY IAPLGQDVLF GEEKCEFGRN FATKIWNAAR LVFMQREKVF GSSDAFSDLY QGFTPESASL SDAQQWITGR YQEMLGSYHK AFAQFRMNDI TRIVYDFFRG DYCDWYLETL KIELDGERDE TKNRHAVCLA VYLLEGILKV LHPVMPFITE EIWHHILERD SSESIAHAAM PQAVEGWSDD MPKHFSSVQK IISETRSLRA VFGVPHDMKA SLVVSAPEES ARRIVTANAH IVTRMTDCSV TIEDGDLPRP PHSASSVVDG NELFMPLEGL ISFEKEIARL EKEADNISSY VTRMEKKLSN KGFVDNAPDE VIEGERSKLA DAKSNLGKVQ ASLEVLCN
|
| |