Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0132 |
Symbol | |
ID | 4071720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 139622 |
End bp | 142411 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982132 |
Product | valyl-tRNA synthetase |
Protein accession | YP_589211 |
Protein GI | 94967163 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0525] Valyl-tRNA synthetase |
TIGRFAM ID | [TIGR00422] valyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0431753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCACG AGCTTCCTAA GGCATACGAA CCATCTGCGA TCGAGCATCG CTGGGCCGAG TACTGGGTCC AGGAAAAGCT GTATCACGTC GAAACCCCCG CCGAGAACGA CCACACGCCG ACGTTCACAT TGTTGCTGCC GCCGCCAAAC GTAACCGGGC GACTGCACAT GGGGCACATG CTGAACCACA CGGAGATGGA CATCATCATC CGATGGCGGC GCATGCGTGG CGAGCGGACG CTGTGGCTAC CGGGCACGGA CCATGCGGGC ATCGCAACGC AAATGATGGT GGAGCGACAA CTGGCGACGG AGGGAAAGAA CCGTCGCGAG ATTGGACGTG AGAAGTTCCT GGAACGCGTT TGGGAGTGGA AGAAGGAGTA TGGCGGGGCA ATCACGTCGC AGATGCGTCG GATTGGCGAC TCGGTGGATT GGGACCGCGA ATACTTCACG ATGGACGACC ATCTTTCGGT CGCGGTGAGG GAAGCGTTTG TTCGGCTGTA TGAGCAAGGG CTGGTGTATC GCGGCAAGTA CATTGTGAAT TGGTGTCCGC GTTGCGGGAC GGCGATTTCG GACCTGGAAG TAGCGCACGA AGAGACACAG GGAAAGCTGT GGGAGATTCG GTACCCGGTT GAGGGTACGG ATGAGTCGAT CGTGGTGGCG ACGACGCGGC CGGAGACGAT GCTGGGCGAT ACCGCGGTTG CGGTGAATCC GAAGGACGAG CGTTACACGC ACCTGCACGG CAAGATGGTG CGACTGCCGT TGATGGACCG GCTGATTCCG ATCATCCTCG ACGAGCTCGC GCAGCCAGAG TTTGGGACGG GCGCGGTGAA GGTGACGCCG GCGCACGATC CAAACGATTT CCAGGCGGGA CTGCGGCACA ACCTGCCGCA GATTGATGTA ATGGACGAGC ACGCCGTGAT GAACGAGAAC GCCGGCGGCT ATCAAGGGCT CGATCGCTAC GAAGCGCGTG AGCAGATCGT GAACGACCTG CAGGCACAGG GGTTCCTGGT TGGGATCAAG GACCACACGC TGGCGCTGGG GAAATGCAGC CGCTGCAAAA CGATCGTGGA GCCGCGGCTA TCGACGCAGT GGTTCGTGGC GGTTAACAAG AAGCCAAATC ACGGCGGCAT GAGTTGGGCC GAGGCGGCGA TTGCGGCGGT GGAGCAGGGA CACATTCGCT TCACGCCGGA GAATTACAAA CCGATCTTCC TGCAATGGAT GCGCAACATC TACGACTGGT GCATTTCGCG GCAGTTGTGG TGGGGGCATC GCATTCCGGC GTGGTACTGC GAGGAGTGCA AGGAAGTCAC GGTCGCGCGC ACCACGCCCG AGGAGTGTTC GAAGTGCGGC GGCACGAAGC TGGATCAAGA CAATGACGTG CTCGATACTT GGTTCTCGTC GGGCATGTTG CCGTTTACGA CCTTGGGGTG GCCGGAGAAG ACGCGCGACC TCGAGGTGTT TTATCCGACG TCGCTGCTGC TGACGGCATT CGACATCCTG TTCTTCTGGG TGGCGAGGAT GATCATGATG GGTTGCTACT TCATGAGCGG GCCGAACCGT CCGACGGAGA TCGCGGGCGG CAAGGAGAAC GAGCTCAAGG AGAGCATTCC GTTTAAAGAG GTCTATATCC ACTCGCTGGT GCGCGACGCC GAGCGGCAGA AGATGTCGAA GACGAAGGGC AACGTGGTGG ACCCGATCGA CGTGCTGAAC AAATTCGGCA CGGATGCGGT GCGCTTCACG CTGGCTTCAA TGGCGGCGCC GGGGACGGAC ATTGCGTTCA GCGAGAGCCG TACGGAGAGC TATCGGTCGT TTGCCAACAA GATTTGGAAC GCGGCGCGCT TCATCTTTAT GAACGTGGAC CGTGCGGCGG AGAAGGGCGT GTGGTCGCTC GAGGAATTCG CGAAGACACA GCCGAAGGGC GAGGGATTGC CCGGATTCTC GACCGAGACG CTGGAAGATC GCTGGATTCT TTCGCGCTTC AATAAGGTGG CGCGCGAGGT GAGCGAGGCA CTGGAGACCT ACCGGTTCCA CGAGGCTTCG CACGTGGTTT ATCACTTCTT CTGGGGCGAG TTCTGCGACT GGTATATCGA ACTGACGAAG CCGCGGCTGG AAGCGGACGA GGCGACGGCG CGGAAGACAT TCGCGAATCT GCTGGCGGTA TTTGAGGGCG CGCTGCGGTT GTTGTCGCCG TTTATGCCGT TCATCACCGA GGAGATTTGG CACGCGATTT ATGACGGCAA GCCACCGTTG AAGTCGATCT CGCTGGCGGA GTATCCGAAG GCGAATGTGG CGCAGATCAG CGATGAAGCT GAGACGGAGA TGGCGATCTT GCAGGACCTG ATCCAGGCAG TGCGCAATAT TCGATCGGAG ATTGCGGATA TCAAGGCGCA GCCGAAAATC AAGGCAGGGA TTGAAGTGTT TGCGACGGCG GAGATCCAGC AGTTGGTGGA GCGGAACCGC GGGGCTTTGG AGCGGCTGGC AAATGTGTCG GAAGTGAAGT TTGTTGGAGA GTCGCTGGCG AAGGCTTCAT TGGCGCGCAG CACGGCACGG TTCGAAGTAC GCGTGGTGTA TGAGCAGAAA GTGGATGTCG CAGCCGAGCG CGAACGGCTG TCGAAAGAGT TGAAGAAATT GGAAGGCGAG TTCGCGAACA ACCAGCGGCA ATTGGGGAAT GAGAACTTCC TGCAGAAGGC CCCGGCGAAA GTGGTGGAAG GATTGCGGAC CCGTGAAGGG GAGTTGAAGG TGCTGATTGA GAAGGCGCAG TCGGCGTTGA AGGGGTTGGA AGGAAAATAG
|
Protein sequence | MAHELPKAYE PSAIEHRWAE YWVQEKLYHV ETPAENDHTP TFTLLLPPPN VTGRLHMGHM LNHTEMDIII RWRRMRGERT LWLPGTDHAG IATQMMVERQ LATEGKNRRE IGREKFLERV WEWKKEYGGA ITSQMRRIGD SVDWDREYFT MDDHLSVAVR EAFVRLYEQG LVYRGKYIVN WCPRCGTAIS DLEVAHEETQ GKLWEIRYPV EGTDESIVVA TTRPETMLGD TAVAVNPKDE RYTHLHGKMV RLPLMDRLIP IILDELAQPE FGTGAVKVTP AHDPNDFQAG LRHNLPQIDV MDEHAVMNEN AGGYQGLDRY EAREQIVNDL QAQGFLVGIK DHTLALGKCS RCKTIVEPRL STQWFVAVNK KPNHGGMSWA EAAIAAVEQG HIRFTPENYK PIFLQWMRNI YDWCISRQLW WGHRIPAWYC EECKEVTVAR TTPEECSKCG GTKLDQDNDV LDTWFSSGML PFTTLGWPEK TRDLEVFYPT SLLLTAFDIL FFWVARMIMM GCYFMSGPNR PTEIAGGKEN ELKESIPFKE VYIHSLVRDA ERQKMSKTKG NVVDPIDVLN KFGTDAVRFT LASMAAPGTD IAFSESRTES YRSFANKIWN AARFIFMNVD RAAEKGVWSL EEFAKTQPKG EGLPGFSTET LEDRWILSRF NKVAREVSEA LETYRFHEAS HVVYHFFWGE FCDWYIELTK PRLEADEATA RKTFANLLAV FEGALRLLSP FMPFITEEIW HAIYDGKPPL KSISLAEYPK ANVAQISDEA ETEMAILQDL IQAVRNIRSE IADIKAQPKI KAGIEVFATA EIQQLVERNR GALERLANVS EVKFVGESLA KASLARSTAR FEVRVVYEQK VDVAAERERL SKELKKLEGE FANNQRQLGN ENFLQKAPAK VVEGLRTREG ELKVLIEKAQ SALKGLEGK
|
| |