Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1032 |
Symbol | |
ID | 9244878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1271587 |
End bp | 1272723 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | thiazolinyl imide reductase |
Protein accession | YP_003678981 |
Protein GI | 297560007 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00459922 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0434723 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCCGC GACCCCTGCG CGTGCTGCTG TGCGGCACCG GTTTCGGCCG CTTCTACGCC GACGCGCTGG CGCGCATGCC CGACACCTTC GAACTCGCCG GGATCCTCTC CCGGGGCGGC GAGTTCTCCC GCCGCTACGC CGAGCGGGCC GGGGTGCCCC TGTACACCGG CGTCGACCAG GTGCCCGGCA GCGTCGACGC GGCCTGCGTG GTCGTGGGCG CGGCGGTCTC CGGCGGGGCC GGGGGCGAGC TGGCCCGCGC CCTGCTGGCC AGAGGCGTGC ACGTCCTCCA GGAGCACCCC GTCCACGAGG ACGAACTCAC CGCGGCCCTG CGCGCCGCCC GCGGCGGGGG AGCGGTCTAC CACCTCAACC CCTTCTACCG GCACGTGGCG CCGATCCGGC GCTTCCTGGA GGCGGCCCGT GTCCTGCGCG AGCGGGGACC GGTGCTGTTC GCCGACGCCA CCTCCGCCGT CCAGGTCCTG TACCCGCTGC TGGACGTGCT CGCGCGCGCC CTGGGCGGGC TGCGCCCCTG GGCGCTGACC GAACCGCTCG CGACGGACCC CGAGGTCGCC TCCCTGACCG CGCACGCCCA GCCCTACCGG GTGCTCCAGG GCAGCGTCGC GGGGGTCCCC CTGACCCTGC GGGTGCAGAA CCAGATGGAT CCCTCCGACG GGGACAACCA CGCCCTGCTC TGGCACCGGC TGGCCCTGGG CACCGAACAC GGGGTGCTCA CCCTGGCCGA CACCCACGGC CCCGTGCTCT GGCAGCCCCG GCTGTACGCG CCCCGCGACG CCCACGGCCG CCTGCGCTCC ACCGGTCCGG ACACCGGGCA CCTGGCCGAA CCGGCCACCA CGGTGCTGGA GGGAACGGAC GCGCCGCCCT ACCGGGAGGT CTTCGACCGC CTGTGGCCCG AAGCGCTCAC CCGTGCCCTG GCCGACTTCG CCGCCGACGT GCGCGCCGGG GCGAACCCGC TGGTGCGGGG GCAGTTCGAC CTCGGTGTGT GCCGGATGTG GCACGAGACC ACCGGCCGCC TGGGCCGGCC CGAGGTCATC CGCCCCGGTC CGCCCACGGC GCACCTGGGC GCCGCCGACC TGGGACCGGC TCCGCGCGAC GGGGACGCCC GGGAGGCGGC GCCGTGA
|
Protein sequence | MSPRPLRVLL CGTGFGRFYA DALARMPDTF ELAGILSRGG EFSRRYAERA GVPLYTGVDQ VPGSVDAACV VVGAAVSGGA GGELARALLA RGVHVLQEHP VHEDELTAAL RAARGGGAVY HLNPFYRHVA PIRRFLEAAR VLRERGPVLF ADATSAVQVL YPLLDVLARA LGGLRPWALT EPLATDPEVA SLTAHAQPYR VLQGSVAGVP LTLRVQNQMD PSDGDNHALL WHRLALGTEH GVLTLADTHG PVLWQPRLYA PRDAHGRLRS TGPDTGHLAE PATTVLEGTD APPYREVFDR LWPEALTRAL ADFAADVRAG ANPLVRGQFD LGVCRMWHET TGRLGRPEVI RPGPPTAHLG AADLGPAPRD GDAREAAP
|
| |