Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_3477 |
Symbol | valS |
ID | 3935951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 3528059 |
End bp | 3531148 |
Gene Length | 3090 bp |
Protein Length | 1029 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637905851 |
Product | valyl-tRNA synthetase |
Protein accession | YP_511419 |
Protein GI | 89055968 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0964875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAAAGA CCTTCAACGC GGCCGAAGCC GAAGCCCGCA TCTACGCCAA ATGGGACCGG TCCGGTGCCT TCAAGGCCGG TGCCAATGCC AAGCTGGGCG CGGAGCCCTT TTCCATCATG ATCCCGCCCC CCAATGTGAC CGGCGTCCTG CACGTCGGCC ACGCCTTCAA CAACACGCTG CAAGACGTCC TGACCCGCTG GCACCGGATG CGTGGCTTCG ACACGCTCTG GCAACCCGGC ACGGATCATG CGGGCATCGC CACGCAGATG GTGGTGGAGC GTGAACTGGC GAAGTCCCAG ATCCAGCGCA AGAACCTGAC GCGCGAGGAA TTCCTGGCCC ATGTGTGGGA GTGGAAGGCC AAATCCGGCG GCACCATCCG CGAGCAACTC AAGCGCCTGG GCGCGTCGTG TGATTGGTCG CGCGAAGCCT TCACCATGTC CGGCGCCCCC GGCGCGCCTG AGGGGGAGGA GGGCAATTTC CACGACGCCG TCATCAAGGT CTTCGTGGAG ATGTACAACA AAGGCTACAT CTATCGCGGC AAACGGCTGG TGAACTGGGA TCCGCATTTT GAGACGGCGA TTTCCGATCT GGAGGTCGAG AATATCGACC AGCCGGGCCA CATGTGGCAC TTCAAATACC CCCTCGCAGG TGGCGCAACC TACGAATACG TCGAGACGGA CGAGGACGGG GTCGAGACCC TGCGCGAGAC CCGCGACTAC ATCTCCATCG CCACCACGCG GCCGGAAACC ATGCTCGGCG ACGGCGCGGT CGCGGTCCAC CCCGATGATA CACGCTATGC AGCCATCGTC GGCCAGCTCT GCGAAATTCC TGTCGGTCCC AAGGAACACC GCCGCCTGAT CCCGATCATC ACCGATGACT ACCCCGATCC CTCATTCGGC TCGGGCGCGG TGAAGATCAC CGGCGCCCAT GACTTCAACG ATTACGAGGT CGCCAAGCGC GGCAACATTC CCTGCTATCG CCTGCTCGAC ACCAAAGGCG CCTTGCGCGA TGACGGCGCG CCCTATGCCG AGGCTGCCGC CATTGCCCAA GCCGTCGCCA ACGGAGACGC CACTCTGGGC GACATGGACG TCGACGCCCT CAACCTCGTC CCCGACCACC TGCGCGGCCT CGACCGGTTC ATGGCGCGGG AGCGCATTGT CGAAGAGATC ACCGCCGACG GCCTCGCGGT CATGACCACA TCCGACGACC CGCGCTTGGG GGCGAAGCCC AAGAAGAAAA GCGGCGAGGA CGCGCAAGCG GAGGAGCCGC CCGAGACCGA AAAACCACTG GTGCCGTTGG TGGAGGCGAA GCCGATCACC CAGCCCTTCG GTGACCGCTC CAAGGTCGTG ATCGAGCCGA TGCTGACCGA TCAATGGTTC GTCGACACCG CCAAAATCGT CGAACCCGCC CTCAATGCTG TCCGCTCCGG CCGCACGAAA ATCATCCCCG AACAGCACCG CAAGGTGTAC TTCCACTGGC TGGAGAATAT CGAGCCGTGG ACAATCTCCC GCCAACTCTG GTGGGGTCAT CAGATCCCGG TCTGGTATGC CGATGAAATG GAGAACGGCG AAGTAGTGAA TGCCGGACCG ATGTTCTGTG CCGCGACAGA GCAAGAAGCA CTTCAAGCAG CTCAAAGCCA CTATGGTCCG AAGCGAGTGG TCTTGCCCGA GAGCGTTGTC CAGCAATCGG GCGGTCGGCT TAAAATGGAC GAGATGAAGA CTGCCAAGGG CAACGTCGAG ATTAGCCTTA ACCTTGAGAA GTCCAACTAC GTGAAACTGC GTCGTGACCC GGACGTCCTC GACACGTGGT TCTCCTCCGG CCTCTGGCCC ATCGGCACCC TCGGCTGGCC CGAGCAAACC CCGGAACTCG ACCGTTACTT CCCGACCTCC ACGCTCATCT CCGGCTTTGA CATCATCTTC TTCTGGGTCG CCCGGATGAT GATGATGCAG CTGGCCGTTG TGGACGACGT CCCCTTCCGT GACGTCTACG TCCACGCCCT CGTCCGCGAC GAGAAGGGGC GCAAGATGTC GAAATCCATC GGCAACGTCC TCGATCCCTT GGAACTCATC GACGAATATG GCGCCGATGC CCTGCGCTTC ACGCTCACCG CCATGGCCGC CATGGGCCGC GACCTGAAAC TCTCCACCGA CCGCATCCAG GGCTACCGCA ACTTCGGCAC CAAACTCTGG AACGCCGCCC GGTTCGCGGA GATGAACGAA TGCGTGCCCG ACCCGGCCTT CGACCCCAAA TCCCCCACCC AGACCGTCAA CAAGTGGATC ATCACGGAAA CCGCCCGTGC CCGCATCGCC CATGATGAGG CCCTTGAAAA CTACCGTTTC AATGACGCGG CGGGGGGGCT CTACCAATTC GTCTGGGGCA AGGTCTGCGA CTGGTATCTG GAATTTGCCA AGCCGCTTTT CGCCTCCGGT GACGACGCCG TCATTGCGGA AACCCGCGCC ACGATGGCCT GGGTGATCGA CCAATGCCTG ATCCTGCTGC ACCCGACCAT GCCCTTCATC ACCGAGGAAT TGTGGAGCGA AATCGCCACC CGTGACACCC TGCTGGTCCA TGCCGACTGG CCGACCTACG GCGCGGAATT CACGGACATC GCCGCAGAAA AAGAGATGTC CTGGGTCATC GCCCTGATCG AATCCATCCG CTCGGCCCGC CAACAGATGC ACGTCCCCGC AGGCCTGAAA GTCCAGCTTC TGCAATCGGA TCTGGACGCG GCGGGTCAGG CGGCTTTCGA CACCAACCAA GCCATGATCA CCCGCCTCGC GCGCCTCTCG GAGGTCACGC CAACCGACGC CTTCCCCAAA GGCACCGTTA CCATCGCCGT GGATGGCGGC ACCTTCGGCC TGCCCATCGC CGATCTGATC GACGTGGACG AGGAAAAGGC CCGTCTGGAA AAGACCCTCG GCAAACTCGC CAAGGAACTC GGCGGGTTGC GTGGGCGTCT GAACAACCCC AAATTCGCCG AAAGCGCACC CGCGGAGGTG GTGGAGGAAA CCAAAGCCAA CCTCAAAGCC CGGGAAGAGG AAGAGGCGCG CCTGAAACAG GCCCTCGCGC GGTTGGCCGA GGTCGGTTAG
|
Protein sequence | MEKTFNAAEA EARIYAKWDR SGAFKAGANA KLGAEPFSIM IPPPNVTGVL HVGHAFNNTL QDVLTRWHRM RGFDTLWQPG TDHAGIATQM VVERELAKSQ IQRKNLTREE FLAHVWEWKA KSGGTIREQL KRLGASCDWS REAFTMSGAP GAPEGEEGNF HDAVIKVFVE MYNKGYIYRG KRLVNWDPHF ETAISDLEVE NIDQPGHMWH FKYPLAGGAT YEYVETDEDG VETLRETRDY ISIATTRPET MLGDGAVAVH PDDTRYAAIV GQLCEIPVGP KEHRRLIPII TDDYPDPSFG SGAVKITGAH DFNDYEVAKR GNIPCYRLLD TKGALRDDGA PYAEAAAIAQ AVANGDATLG DMDVDALNLV PDHLRGLDRF MARERIVEEI TADGLAVMTT SDDPRLGAKP KKKSGEDAQA EEPPETEKPL VPLVEAKPIT QPFGDRSKVV IEPMLTDQWF VDTAKIVEPA LNAVRSGRTK IIPEQHRKVY FHWLENIEPW TISRQLWWGH QIPVWYADEM ENGEVVNAGP MFCAATEQEA LQAAQSHYGP KRVVLPESVV QQSGGRLKMD EMKTAKGNVE ISLNLEKSNY VKLRRDPDVL DTWFSSGLWP IGTLGWPEQT PELDRYFPTS TLISGFDIIF FWVARMMMMQ LAVVDDVPFR DVYVHALVRD EKGRKMSKSI GNVLDPLELI DEYGADALRF TLTAMAAMGR DLKLSTDRIQ GYRNFGTKLW NAARFAEMNE CVPDPAFDPK SPTQTVNKWI ITETARARIA HDEALENYRF NDAAGGLYQF VWGKVCDWYL EFAKPLFASG DDAVIAETRA TMAWVIDQCL ILLHPTMPFI TEELWSEIAT RDTLLVHADW PTYGAEFTDI AAEKEMSWVI ALIESIRSAR QQMHVPAGLK VQLLQSDLDA AGQAAFDTNQ AMITRLARLS EVTPTDAFPK GTVTIAVDGG TFGLPIADLI DVDEEKARLE KTLGKLAKEL GGLRGRLNNP KFAESAPAEV VEETKANLKA REEEEARLKQ ALARLAEVG
|
| |