Gene Jann_3477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3477 
SymbolvalS 
ID3935951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3528059 
End bp3531148 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content62% 
IMG OID637905851 
Productvalyl-tRNA synthetase 
Protein accessionYP_511419 
Protein GI89055968 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0964875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAAGA CCTTCAACGC GGCCGAAGCC GAAGCCCGCA TCTACGCCAA ATGGGACCGG 
TCCGGTGCCT TCAAGGCCGG TGCCAATGCC AAGCTGGGCG CGGAGCCCTT TTCCATCATG
ATCCCGCCCC CCAATGTGAC CGGCGTCCTG CACGTCGGCC ACGCCTTCAA CAACACGCTG
CAAGACGTCC TGACCCGCTG GCACCGGATG CGTGGCTTCG ACACGCTCTG GCAACCCGGC
ACGGATCATG CGGGCATCGC CACGCAGATG GTGGTGGAGC GTGAACTGGC GAAGTCCCAG
ATCCAGCGCA AGAACCTGAC GCGCGAGGAA TTCCTGGCCC ATGTGTGGGA GTGGAAGGCC
AAATCCGGCG GCACCATCCG CGAGCAACTC AAGCGCCTGG GCGCGTCGTG TGATTGGTCG
CGCGAAGCCT TCACCATGTC CGGCGCCCCC GGCGCGCCTG AGGGGGAGGA GGGCAATTTC
CACGACGCCG TCATCAAGGT CTTCGTGGAG ATGTACAACA AAGGCTACAT CTATCGCGGC
AAACGGCTGG TGAACTGGGA TCCGCATTTT GAGACGGCGA TTTCCGATCT GGAGGTCGAG
AATATCGACC AGCCGGGCCA CATGTGGCAC TTCAAATACC CCCTCGCAGG TGGCGCAACC
TACGAATACG TCGAGACGGA CGAGGACGGG GTCGAGACCC TGCGCGAGAC CCGCGACTAC
ATCTCCATCG CCACCACGCG GCCGGAAACC ATGCTCGGCG ACGGCGCGGT CGCGGTCCAC
CCCGATGATA CACGCTATGC AGCCATCGTC GGCCAGCTCT GCGAAATTCC TGTCGGTCCC
AAGGAACACC GCCGCCTGAT CCCGATCATC ACCGATGACT ACCCCGATCC CTCATTCGGC
TCGGGCGCGG TGAAGATCAC CGGCGCCCAT GACTTCAACG ATTACGAGGT CGCCAAGCGC
GGCAACATTC CCTGCTATCG CCTGCTCGAC ACCAAAGGCG CCTTGCGCGA TGACGGCGCG
CCCTATGCCG AGGCTGCCGC CATTGCCCAA GCCGTCGCCA ACGGAGACGC CACTCTGGGC
GACATGGACG TCGACGCCCT CAACCTCGTC CCCGACCACC TGCGCGGCCT CGACCGGTTC
ATGGCGCGGG AGCGCATTGT CGAAGAGATC ACCGCCGACG GCCTCGCGGT CATGACCACA
TCCGACGACC CGCGCTTGGG GGCGAAGCCC AAGAAGAAAA GCGGCGAGGA CGCGCAAGCG
GAGGAGCCGC CCGAGACCGA AAAACCACTG GTGCCGTTGG TGGAGGCGAA GCCGATCACC
CAGCCCTTCG GTGACCGCTC CAAGGTCGTG ATCGAGCCGA TGCTGACCGA TCAATGGTTC
GTCGACACCG CCAAAATCGT CGAACCCGCC CTCAATGCTG TCCGCTCCGG CCGCACGAAA
ATCATCCCCG AACAGCACCG CAAGGTGTAC TTCCACTGGC TGGAGAATAT CGAGCCGTGG
ACAATCTCCC GCCAACTCTG GTGGGGTCAT CAGATCCCGG TCTGGTATGC CGATGAAATG
GAGAACGGCG AAGTAGTGAA TGCCGGACCG ATGTTCTGTG CCGCGACAGA GCAAGAAGCA
CTTCAAGCAG CTCAAAGCCA CTATGGTCCG AAGCGAGTGG TCTTGCCCGA GAGCGTTGTC
CAGCAATCGG GCGGTCGGCT TAAAATGGAC GAGATGAAGA CTGCCAAGGG CAACGTCGAG
ATTAGCCTTA ACCTTGAGAA GTCCAACTAC GTGAAACTGC GTCGTGACCC GGACGTCCTC
GACACGTGGT TCTCCTCCGG CCTCTGGCCC ATCGGCACCC TCGGCTGGCC CGAGCAAACC
CCGGAACTCG ACCGTTACTT CCCGACCTCC ACGCTCATCT CCGGCTTTGA CATCATCTTC
TTCTGGGTCG CCCGGATGAT GATGATGCAG CTGGCCGTTG TGGACGACGT CCCCTTCCGT
GACGTCTACG TCCACGCCCT CGTCCGCGAC GAGAAGGGGC GCAAGATGTC GAAATCCATC
GGCAACGTCC TCGATCCCTT GGAACTCATC GACGAATATG GCGCCGATGC CCTGCGCTTC
ACGCTCACCG CCATGGCCGC CATGGGCCGC GACCTGAAAC TCTCCACCGA CCGCATCCAG
GGCTACCGCA ACTTCGGCAC CAAACTCTGG AACGCCGCCC GGTTCGCGGA GATGAACGAA
TGCGTGCCCG ACCCGGCCTT CGACCCCAAA TCCCCCACCC AGACCGTCAA CAAGTGGATC
ATCACGGAAA CCGCCCGTGC CCGCATCGCC CATGATGAGG CCCTTGAAAA CTACCGTTTC
AATGACGCGG CGGGGGGGCT CTACCAATTC GTCTGGGGCA AGGTCTGCGA CTGGTATCTG
GAATTTGCCA AGCCGCTTTT CGCCTCCGGT GACGACGCCG TCATTGCGGA AACCCGCGCC
ACGATGGCCT GGGTGATCGA CCAATGCCTG ATCCTGCTGC ACCCGACCAT GCCCTTCATC
ACCGAGGAAT TGTGGAGCGA AATCGCCACC CGTGACACCC TGCTGGTCCA TGCCGACTGG
CCGACCTACG GCGCGGAATT CACGGACATC GCCGCAGAAA AAGAGATGTC CTGGGTCATC
GCCCTGATCG AATCCATCCG CTCGGCCCGC CAACAGATGC ACGTCCCCGC AGGCCTGAAA
GTCCAGCTTC TGCAATCGGA TCTGGACGCG GCGGGTCAGG CGGCTTTCGA CACCAACCAA
GCCATGATCA CCCGCCTCGC GCGCCTCTCG GAGGTCACGC CAACCGACGC CTTCCCCAAA
GGCACCGTTA CCATCGCCGT GGATGGCGGC ACCTTCGGCC TGCCCATCGC CGATCTGATC
GACGTGGACG AGGAAAAGGC CCGTCTGGAA AAGACCCTCG GCAAACTCGC CAAGGAACTC
GGCGGGTTGC GTGGGCGTCT GAACAACCCC AAATTCGCCG AAAGCGCACC CGCGGAGGTG
GTGGAGGAAA CCAAAGCCAA CCTCAAAGCC CGGGAAGAGG AAGAGGCGCG CCTGAAACAG
GCCCTCGCGC GGTTGGCCGA GGTCGGTTAG
 
Protein sequence
MEKTFNAAEA EARIYAKWDR SGAFKAGANA KLGAEPFSIM IPPPNVTGVL HVGHAFNNTL 
QDVLTRWHRM RGFDTLWQPG TDHAGIATQM VVERELAKSQ IQRKNLTREE FLAHVWEWKA
KSGGTIREQL KRLGASCDWS REAFTMSGAP GAPEGEEGNF HDAVIKVFVE MYNKGYIYRG
KRLVNWDPHF ETAISDLEVE NIDQPGHMWH FKYPLAGGAT YEYVETDEDG VETLRETRDY
ISIATTRPET MLGDGAVAVH PDDTRYAAIV GQLCEIPVGP KEHRRLIPII TDDYPDPSFG
SGAVKITGAH DFNDYEVAKR GNIPCYRLLD TKGALRDDGA PYAEAAAIAQ AVANGDATLG
DMDVDALNLV PDHLRGLDRF MARERIVEEI TADGLAVMTT SDDPRLGAKP KKKSGEDAQA
EEPPETEKPL VPLVEAKPIT QPFGDRSKVV IEPMLTDQWF VDTAKIVEPA LNAVRSGRTK
IIPEQHRKVY FHWLENIEPW TISRQLWWGH QIPVWYADEM ENGEVVNAGP MFCAATEQEA
LQAAQSHYGP KRVVLPESVV QQSGGRLKMD EMKTAKGNVE ISLNLEKSNY VKLRRDPDVL
DTWFSSGLWP IGTLGWPEQT PELDRYFPTS TLISGFDIIF FWVARMMMMQ LAVVDDVPFR
DVYVHALVRD EKGRKMSKSI GNVLDPLELI DEYGADALRF TLTAMAAMGR DLKLSTDRIQ
GYRNFGTKLW NAARFAEMNE CVPDPAFDPK SPTQTVNKWI ITETARARIA HDEALENYRF
NDAAGGLYQF VWGKVCDWYL EFAKPLFASG DDAVIAETRA TMAWVIDQCL ILLHPTMPFI
TEELWSEIAT RDTLLVHADW PTYGAEFTDI AAEKEMSWVI ALIESIRSAR QQMHVPAGLK
VQLLQSDLDA AGQAAFDTNQ AMITRLARLS EVTPTDAFPK GTVTIAVDGG TFGLPIADLI
DVDEEKARLE KTLGKLAKEL GGLRGRLNNP KFAESAPAEV VEETKANLKA REEEEARLKQ
ALARLAEVG