Gene Nham_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1828 
SymbolvalS 
ID4033005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2035123 
End bp2037987 
Gene Length2865 bp 
Protein Length954 aa 
Translation table11 
GC content63% 
IMG OID637970300 
Productvalyl-tRNA synthetase 
Protein accessionYP_577102 
Protein GI92117373 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGAAA AGACCTACCA GCCCGCTGAT ATTGAAAGCC GCATGTCCCG TATCTGGGAA 
GAGGCCGGCG CGTTCAAGGC GGGGCGGGCC GAGCGCCGCG ACGCCGAACC GTTCACGATC
GTTATCCCGC CGCCGAACGT CACCGGCTCG TTGCATATGG GTCACGCGCT CAACAACACG
TTGCAGGACG TGCTGTGCCG CTTCGAGCGG ATGCGCGGCC GCGACGTGCT GTGGCAGCCC
GGCACCGATC ACGCCGGCAT TGCGACCCAG ATGGTGGTGG AGCGGCAGTT GATGGAGCGG
AAGGAACCCG GCCGCCGCGA GATGGGTCGT GCCAGGTTTC TCGAGCGGGT TTGGCAGTGG
AAGGCTGAAA GCGGCGGCGT CATCGTCAAC CAGTTGAAGC GGCTCGGCGC GTCCTGCGAT
TGGTCGCGTG AGCGCTTCAC CATGGACGAA GGCCTGTCGC GTGCGGTTGC CAAGGTGTTC
GTCGAACTGC ACCGCGCCGG CTTGATCTAC AAGGACAAGC GGCTGGTCAA CTGGGACCCG
AAGCTGCTCA CCGCGATTTC CGATCTCGAG GTGAAACAGA TCGAGGTCAA GGGCAGCCTG
TGGCATCTGC GCTATCCGAT CGAGGGCAAG GGCTTCAATC CGGACGATCC GTCGACCTAT
ATCGTCGTCG CCACCACGCG GCCGGAAACC ATGCTGGGAG ACACGGCCGT CGCCGTGCAT
CCGGATAACG AGAAGCTTGA GCACCTGATC GGCAGCAACG TCGTGCTGCC ACTGGTCGGC
CGCCTCATTC CGATCATCGG CGACGACTAT GCCGATCCGG AAAAGGGCAC CGGCGCGGTC
AAAATCACCC CGGCCCACGA TTTCAACGAC TTCGAGGTCG GAAAGCGCCA CCGCCTGCCG
CAGATCGGCG TGCTCGATCG GGAAGGGCGG TTGACGCTAT CCGACAACGA GGATTATCTG
CGCGGCCTGC CCGAAGGCGC GCTGATGCTG GCCGAGGAGC TTCACGGCAC CGACCGTTTC
GCCGCGCGCA AGGCGATCGT CGCACGCCTC GAAGATTTCG GCTTCCTCGC CAAGGTCGAG
CCGCACGCGC ACATGGTGCC GCACGGCGAC CGCTCCGGTG CTGTCATCGA ACCCTACCTG
ACGGATCAAT GGTACGTCGA CGCGAAAGAA CTCGCGCAGC CGGCGATGGC GGCCGTGCGT
TCGGGTGATA CGACGTTCGT GCCGAAGAAC TGGGAGAAAA CCTACTTCGA GTGGATGGAA
AACATCCAGC CGTGGTGCAT CTCGCGCCAG CTCTGGTGGG GTCACCAGAT TCCGGCCTGG
TATGGTCCGG ACGGCAAGGT GTTTGTCGCG GAGACAGAGG ACGAAGCGGT CGGCCATGCG
CTCGGCTATT ACGCAGAGCA GGGTGTCATC TCCGTAGACG AGGGCGCCGA AATGGCGCGC
GATCCGGCAA AGCGTGAGGG CTTCATCACG CGAGACGAAG ACGTGCTCGA CACCTGGTTC
TCCTCGGGGC TGTGGCCGTT CTCGACGCTT GGCTGGCCGG ACGAGACGCC GGAGTTGAAG
CGTTACTATC CGACCAACGC GCTGGTCACC GGTTTCGACA TCATCTTCTT CTGGGTCGCC
CGGATGATGA TGATGGGCAT CTACTTCATG AGGGAGGTGC CGTTCTCGAC GGTCTACATC
CACGCGCTCG TCCGCGACGA GAACGGCGCC AAGATGTCGA AGTCGAAGGG CAACGTCATC
GATCCCCTGC ATCTGGTCGA CAAATACGGC GCCGATGCGT TGCGCTTCAC GCTCGCGGCG
ATGGCGGCGC AGGGCCGCGA CATCAAGCTG TCGCCGCAGC GGGTCGAAGG TTATCGTAAT
TTCGCGACCA AGCTCTGGAA CGCCTGCCGT TTCGCGGAAA TGAACGATTG CGTCGTTCCT
GCGCAGTTCG ATCCGACCGT CGCGAAAGAA ACGCTCAACC GATGGATCGT TCATGAGACC
GCGCGTGCCA CGTGCGAGAT CACCGAGGCT ATCGAAGCCT ATCGTTTCAA TGACGCCGCC
GGCGCGATCT ATCGCTTCGT CTGGAATGTC TACTGCGACT GGTATCTAGA GCTCGCAAAG
CCGGTGATGA TGGGTGAGGA CGGTCCCGCC AAGTCCGAGA CCCGTGCCAT GGTCGCCTGG
GCTCGCGATG AAATCCTGAA GCTGTTGCAC CCGTTCATGC CGTTCCTCAC CGAGGAATTA
TGGACGGCGA CATCGACACG GACGCAACTG CTGACGCTGA CGCCATGGCC GATCAAGGTC
GGTCTCACCC GCGAGCAGCA TGCGTCGATC GCGGCCGCCG CCGCCGATCC CTTCGCCGCG
CCAGAGCCGC TCGCCGATCC ATTGGAGCCG GCGTTTCGCG ACGATGCGGC GGAGGCCGAG
ATCGGCTGGG TGGTCGATCT GGTCACCGCC ATCCGCTCGG TCCGCGCGGA GATGAATATT
CCGCCCGCAA CGCTGACGCC GCTGGTGCTC GCAGGCGTAT CCGCCGAGAG CGAGGCGCGG
GCGCAGCGCT GGAGCGACGT CGTCAAGCGA ATGTCCCGGC TCGCAGACAT CTCATTCGCA
GACCATGTGC CACCAGGCGC CGTGCAACTG CTGATCCGCG GCGAAGTCGC CGCGCTGCCG
TTGAAAGGCA TCGTCGATGT TGCCGCCCAG CGCGTGCGTT TGGGGAAGGA AATCGCCAAG
GCCGATGCCG ACATCGATCG CGTCGATTTC AAACTCGCCA ACGAGAAATT CCTCGCCAAC
GCGCCCGAAG AGATCGTCGA GGAAGAAAAA GACAAGCGGG AAGCCGCCGT TGCGCGCAAA
GCGAAATTCG TCGAGGCGCT GGAGCGTTTG AAGGCTGCTG AGTGA
 
Protein sequence
MIEKTYQPAD IESRMSRIWE EAGAFKAGRA ERRDAEPFTI VIPPPNVTGS LHMGHALNNT 
LQDVLCRFER MRGRDVLWQP GTDHAGIATQ MVVERQLMER KEPGRREMGR ARFLERVWQW
KAESGGVIVN QLKRLGASCD WSRERFTMDE GLSRAVAKVF VELHRAGLIY KDKRLVNWDP
KLLTAISDLE VKQIEVKGSL WHLRYPIEGK GFNPDDPSTY IVVATTRPET MLGDTAVAVH
PDNEKLEHLI GSNVVLPLVG RLIPIIGDDY ADPEKGTGAV KITPAHDFND FEVGKRHRLP
QIGVLDREGR LTLSDNEDYL RGLPEGALML AEELHGTDRF AARKAIVARL EDFGFLAKVE
PHAHMVPHGD RSGAVIEPYL TDQWYVDAKE LAQPAMAAVR SGDTTFVPKN WEKTYFEWME
NIQPWCISRQ LWWGHQIPAW YGPDGKVFVA ETEDEAVGHA LGYYAEQGVI SVDEGAEMAR
DPAKREGFIT RDEDVLDTWF SSGLWPFSTL GWPDETPELK RYYPTNALVT GFDIIFFWVA
RMMMMGIYFM REVPFSTVYI HALVRDENGA KMSKSKGNVI DPLHLVDKYG ADALRFTLAA
MAAQGRDIKL SPQRVEGYRN FATKLWNACR FAEMNDCVVP AQFDPTVAKE TLNRWIVHET
ARATCEITEA IEAYRFNDAA GAIYRFVWNV YCDWYLELAK PVMMGEDGPA KSETRAMVAW
ARDEILKLLH PFMPFLTEEL WTATSTRTQL LTLTPWPIKV GLTREQHASI AAAAADPFAA
PEPLADPLEP AFRDDAAEAE IGWVVDLVTA IRSVRAEMNI PPATLTPLVL AGVSAESEAR
AQRWSDVVKR MSRLADISFA DHVPPGAVQL LIRGEVAALP LKGIVDVAAQ RVRLGKEIAK
ADADIDRVDF KLANEKFLAN APEEIVEEEK DKREAAVARK AKFVEALERL KAAE