Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0907 |
Symbol | |
ID | 9244752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1111215 |
End bp | 1114415 |
Gene Length | 3201 bp |
Protein Length | 1066 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_003678857 |
Protein GI | 297559883 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.236278 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCCAACG ACCCCCGGCC CGAATCCCGC GCGCTGCCCC AGCTGCCCGC GCAGATCGAC CTGCCCGCCA TGGAGCGCGA GATCCTCGGC CGCTGGTCGA AGGAGAACGT CTTCCAGCGC TCGCTCGACC AGACCCGCGG CGGCCCCAAC TGGGTCTTCT ACGAGGGCCC GCCCACCGCG AACGGCCAGC CCGGCGTGCA CCACGTCGAG GCCCGCGCCT TCAAGGACGT CTTCCCGCGC TTTCGCACCA TGCGCGGCTA CCACGTGGAC CGCAAGGCGG GCTGGGACTG CCACGGCCTG CCCGTCGAGG TCGCCGTGGA GAAGGAGCTG GGCCTGTCCG GCAAGAAGGA CATCGAGTCC TTCGGCATCG CCGAGTTCAA CGACCGCTGC CGCGAGTCCG TCCTGCGCAA CGTGGACGCC TTCACCTCCA TGACCGAGCG CATGGGCTAC TGGGTGAACA TGGACGACGC CTACCGCACC ATGGACCCCC AGTACGTGGA GTCGGTCTGG TGGGCGCTCA AGCAGATCTG GGACAAGGGG CTGCTGGTCC GCGACTACCG GATCAGCCCC TACTGCCCGC GCTGCGGCAC CACGCTGTCC GACCACGAGC TCGCCCAGGG GTACGAGACC GTCACCGACC CGTCGGTGTA CGTGCGCTTC CCGGTGACCT CGGGCCCGCT GGCCGCCGCC GAGCACCCCA CCTCGCTGCT GGTGTGGACG ACCACCCCCT GGACGCTGGT CTCCAACACC GCCGTGGCCG TGCACCCGGA CGTGGAGTAC GTGGTGGCCA CCGACGGCCA GGAGCGCCTG GTCGTGGCGC GTCCGCTGTT CGAGAAGGTC CTCGGCGAGG GCTGGGAGCT CACCGGCGAG AGCTTCGAGG GCGACGAGAT GGAGCGCTGG ACCTACGAGC GCCCCTTCGA CCTGGTGCCC TTCGACGAGC CCGCCCACTA CGTCGTGCTC GCCGACTACG TCACGGTCGA GGACGGCACC GGCCTGGTGC ACCAGTCGCC CGCCTTCGGC GCCGACGACA TGGTGGTGTG CCGCGCCTAC GGGCTGCCCG TGGTCAACCC GGTCCGCCCC GACGGCACCT TCGAGGCCGA CCTGCCGCTG GTGGGCGGGG TGTTCTTCAA GGAGGCCGAC AAGACGCTGG TGCGCGACCT GCGCGACCGC GGCCTGCTCT TCCGCCACGT CAGCTACGAG CACTCCTACC CGCACTGCTG GCGCTGCCAC ACGGCGCTGC TGTACTACGC GGTGCCGTCC TGGTACATCC GCACCACCGC GGTCAAGGAC GAGCTCATCG CGCAGAACCA GGCCACCCAC TGGGTGCCCG AGAACGTCAA GGAGGGCCGC TTCGGCGAGT GGCTGCGCGG CAACGTCGAC TGGGCACTCT CGCGCAACCG CTACTGGGGC ACCCCCCTGC CGATCTGGGA GTTCCCCGAC GGGCGCCAGA TCTGCGTGGG CTCCCTGGAG GAGCTGGGCC GCCTCAGCGG GCGGGACCTG TCCTCCCTGG ATCCGCACCG CCCCTACGTC GACGACATCG TCATCCCCGA CCCGGACGCC GACCCCTCCC TGCCCGAGGA GGAGCGGGTG GCCCGCCGCG TCCCCGAGGT CATCGACGCC TGGTTCGACT CCGGGTCCAT GCCCTTCGCC CAGTGGGGCG CCCCGCACCA GAACGAGGAG ACCTTCCGGG AGAACTTCCC GGCGCAGTAC ATCTCCGAGG CCATCGACCA GACCCGCGGC TGGTTCTACT CGCTGCTGGC GGTGAGCACC CTGGTGTTCG GCCGCAACTC GTTCGAGAAC GTGGTCGTGC TCGGCCACAT CCTCGCCGAG GACGGACGCA AGATGAGCAA GCACCTGGGC AACGTCATGG AGCCCATCGC GGTGATGGAC CGGCACGGCG CGGACGCCCT GCGCTGGTTC ATGCTGGCCA GCGGCTCGCC GTGGACCGCC CGCCGGGTGG GCCACGCCGC CCTGGAGGAG ATCGTCCGCA AGGTGCTGCT GACCTACCAC AGCACCGTGT CGTTCTTCAC CCTGTACGCG AACGCCGGTG AGGGCTGGGA CCACTCGCTG CTGGACTCGG CGCCCGCGCC GCAGGACCGG CCGCTGCTGG ACCGGTGGCT GCTCTCGGAG CTCAACGAGG TCGTCCGCGA CGTCACCGAG GCGATGGACA CCTTCGACAC CACGGCCGCC GGGCGCCGCC TGACCGCGTT CGTGGACGAC GTGTCCAACT GGTACGTGCG CCGTTCCCGC CGCCGGTTCT GGGGCGGGGC CGCCACCCCC GAGGGCGCCG CGGCGTTCGC GACGCTCTTC GAGGCCCTGG AGACCGTCAC CCTGCTGATG GCGCCGATCG TGCCGTTCCT GACCGACCAC GTGTGGTCGG CGCTGCGCCG CCCGGGCGCC CCGGACTCGG TGCACCTGGC CTCCTGGCCC GAGGTGCGCG AGGACCTGAT CGACCCCGAG CTGTCCCGGA ACATGGCGCT GACCCGCCGT CTGGTGGAGC TGGGCCGCTC CGCCCGGGTG GACTCGGCCG TGCGCACCCG CCAGCCGCTG GCCCGCGCCC TGGTGGGCGC CCCGGGCTTC GCGGACCTGC CCGAGCAGCT GCGCGGCCAG ATCGCGGACG AGCTGAACGT GGCCTCGCTG GACTCGCTGT CCTCGGTGGG CGGCGACCTG GTGGACTTCA CCGTGAAGCC GAACTTCCGT GCGCTGGGCA AGCGGTTCGC CAAGCGCACC CCGCTGGTGG CCAAGGCCGT CCAGGCCGCA GACCCGGCCG AACTGGTGCG GCAGGTGCGC GCCACCGGCT GGGCGCGGGT GCACGTCGAG GACGAGCCGG TGGAGGTGAG CGCCGACGAG CTGCTGGTGA CCGAGCAGCC CCGCGAGGGC TGGGCGGTGG CCTCGGAGTC GGGTGAGACC GTCGCCCTGG ACCTGGAGCT CACGCCGGAG CTGCGCCGCG CGGGTCTCGC CCGCGAGATG GTCCGGATGC TCCAGGAAGC CCGCAAGCGG AGCGGGCTGG AGGTGTCGGA CCGCATCGAG GTGTGGTGGA CGGTCACGGA CGAGGCGACC GAGCTGGCGC TCGCCGAGCA CGGCCAGGCG ATCGCCGCCG AGGTGCTCGC CGACTCGTTC GTCGCCGGTG AGCCGGGCCG GGAGCTGCAC ACCGCCTCCT CGGAGGAGTT CGGTGTGACC TTCGGGTTCC GCAAGGCCTG A
|
Protein sequence | MANDPRPESR ALPQLPAQID LPAMEREILG RWSKENVFQR SLDQTRGGPN WVFYEGPPTA NGQPGVHHVE ARAFKDVFPR FRTMRGYHVD RKAGWDCHGL PVEVAVEKEL GLSGKKDIES FGIAEFNDRC RESVLRNVDA FTSMTERMGY WVNMDDAYRT MDPQYVESVW WALKQIWDKG LLVRDYRISP YCPRCGTTLS DHELAQGYET VTDPSVYVRF PVTSGPLAAA EHPTSLLVWT TTPWTLVSNT AVAVHPDVEY VVATDGQERL VVARPLFEKV LGEGWELTGE SFEGDEMERW TYERPFDLVP FDEPAHYVVL ADYVTVEDGT GLVHQSPAFG ADDMVVCRAY GLPVVNPVRP DGTFEADLPL VGGVFFKEAD KTLVRDLRDR GLLFRHVSYE HSYPHCWRCH TALLYYAVPS WYIRTTAVKD ELIAQNQATH WVPENVKEGR FGEWLRGNVD WALSRNRYWG TPLPIWEFPD GRQICVGSLE ELGRLSGRDL SSLDPHRPYV DDIVIPDPDA DPSLPEEERV ARRVPEVIDA WFDSGSMPFA QWGAPHQNEE TFRENFPAQY ISEAIDQTRG WFYSLLAVST LVFGRNSFEN VVVLGHILAE DGRKMSKHLG NVMEPIAVMD RHGADALRWF MLASGSPWTA RRVGHAALEE IVRKVLLTYH STVSFFTLYA NAGEGWDHSL LDSAPAPQDR PLLDRWLLSE LNEVVRDVTE AMDTFDTTAA GRRLTAFVDD VSNWYVRRSR RRFWGGAATP EGAAAFATLF EALETVTLLM APIVPFLTDH VWSALRRPGA PDSVHLASWP EVREDLIDPE LSRNMALTRR LVELGRSARV DSAVRTRQPL ARALVGAPGF ADLPEQLRGQ IADELNVASL DSLSSVGGDL VDFTVKPNFR ALGKRFAKRT PLVAKAVQAA DPAELVRQVR ATGWARVHVE DEPVEVSADE LLVTEQPREG WAVASESGET VALDLELTPE LRRAGLAREM VRMLQEARKR SGLEVSDRIE VWWTVTDEAT ELALAEHGQA IAAEVLADSF VAGEPGRELH TASSEEFGVT FGFRKA
|
| |