Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3174 |
Symbol | |
ID | 9247031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3799957 |
End bp | 3801174 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | tryptophan synthase, beta subunit |
Protein accession | YP_003681088 |
Protein GI | 297562114 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.546685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCACG ACCAACCCCT GCCCGACCAG CTCGGGCACT ACGGCCGCTT CGGCGGCCGC TTCGCGCCCG AAGCCCTCGT CGCCGCCCTC GACGAGGTGG CCGCGGAGTG GGAGAAGGCC AAGCAGGACC CCCAGTACCA GGCCGAACTC GCCGAACTGC TCAAGGACTA CACCGGGCGC CCCAGCGCCC TGAGCGAGGC CCGCAACTTC TCCGAGCACT GCGGCGGCGC GCGGATCCTG CTCAAGCGCG AGGACCTCAA CCACACCGGA TCCCACAAGA TCAACAACGT CCTCGGCCAG GCCCTGCTCA CCAGGCGCAT GGGCAAGACC CGCGTCATCG CCGAGACCGG AGCCGGACAG CACGGCGTGG CCACCGCCAC AGCCTGCGCC CTGCTCGGCC TGGAGTGCGT GATCTACATG GGCGAGGAGG ACACCCGCCG CCAGGCGCTC AACGTCGCCC GCATGCGCAT GCTCGGCGCC GAGGTCGTCC CCGTCACCAT CGGCAGCCGC ACCCTCAAGG ACGCCATCAA CGAGGCCTTC CGCGACTGGG TGGCCAACGT CGACCGCACC CACTACCTGT TCGGCACCGT CGCCGGACCG CACCCCTTCC CCAAGCTCGT GCGCGACCTG CACTTCGTCG TCGGCCAGGA GGCCCGCGAA CAGGTCCTGG AGCGCGTCGG CAGGCTCCCC GACGCGGTCG CCGCGTGCGT GGGCGGCGGC TCCAACGCCA TGGCCGTCTT CGCGGCCTTC ATCCCCGACG AGGAGGTCGC CCTGTACGGC TTCGAGGCCG GTGGGGAGGG GGCGCGGACC ACCCGCACCG CCGCCTCCAT CACGGCGGGC AGCCCCGGCG TCTTCCACGG GGCGCGCACC TTCGTGCTCC AGGACGAGTA CGGCCAGACC CTGCCCAGCC ACTCCATCTC CGCCGGACTC GACTACCCGG CGGTGGGCCC CGAGCACGCC TACCTCGCCG ACACCGGCCG CGCCACCTAC GAGCCGGTCA CCGACGCCGA GGCGATGGAG GCCTTCCGGC TGCTGTGCCG CACCGAGGGC ATCATCCCCG CCATCGAGAG CGCGCACGCC CTGGCCGGCG CCCGCAAGCT CGGCGAGCGC CTCGGCCCGG ACGCCGTCAT CCTGGTGAAC CTCTCCGGGC GCGGCGACAA GGACGTTGAC ACCGCGGCCG CCTACTTCGG CCTCGTCGAC CCGGAGGGAC AGGCGTGA
|
Protein sequence | MSHDQPLPDQ LGHYGRFGGR FAPEALVAAL DEVAAEWEKA KQDPQYQAEL AELLKDYTGR PSALSEARNF SEHCGGARIL LKREDLNHTG SHKINNVLGQ ALLTRRMGKT RVIAETGAGQ HGVATATACA LLGLECVIYM GEEDTRRQAL NVARMRMLGA EVVPVTIGSR TLKDAINEAF RDWVANVDRT HYLFGTVAGP HPFPKLVRDL HFVVGQEARE QVLERVGRLP DAVAACVGGG SNAMAVFAAF IPDEEVALYG FEAGGEGART TRTAASITAG SPGVFHGART FVLQDEYGQT LPSHSISAGL DYPAVGPEHA YLADTGRATY EPVTDAEAME AFRLLCRTEG IIPAIESAHA LAGARKLGER LGPDAVILVN LSGRGDKDVD TAAAYFGLVD PEGQA
|
| |