Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3971 |
Symbol | |
ID | 9247842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4749604 |
End bp | 4751352 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_003681874 |
Protein GI | 297562900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.137474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGGGCGC GTCCCCTCGT GGCGCGCGTG ACCGCCGCGG CGGAGCGGGA GCGGGCGGGA CTGCGCCGCG TGCTGCGCGG CGGGGCGGTC AACATGGCGG GGGCCGTGGT CGGCGCCGCG CTGAACCTCG CCGTGATCGT GACGATCACT CGGGCGTTCT CCCAGGAGAC GGCGGGGCTG CTGTTCTCGG CGACGTCAGT GTTCCTGATC GCCGCGGTGG TGGCCAACCT GGGCGCGTCG GACGGGTTGG TGTACTTCAT CGCCCGGATG CGCGTGTTCG GCGAACCCGG GGGCGTGCCC CGGCTGCTGC GGACGGCCGC GGCCCCGGCC GTGCTGGCGG CGTGCGCGCT GGCGGTGCTG CTGGTGGTGT GCGCCGGTCC GGTGGCGCGG GGCCTGGGCG GCGGCGAGGC GGAGGTCTAC CTGCGGCTGC TGGCGGTGTT CCTGCCGTTC GCGGTGCTGG CGGACACGGC CCTGGCCGCG ACGCGGGCCC ACCACGACAT GGCCGGGACC GTGCTGGTGG ACAAGGTGGG CCGCCCGCTG GCCCAGCTGG CCCTGGTGAC GGGGGTCGCG CTGTCGGGGG CGGCGGGGCT GTTGGCGCTG GCGTGGGCCG GGCCGTACCT GCCCGCGGCC GTGGTGGCGT GGTTCTGGTT GGGACGTGTC GTGCGCCGGG CCTTTCCCGA GGCCGGTGGG GCCTCCGGGG ATGCGGACGG ATCCGGGGGT GCGCACGGTT CCAGGGGTGC GGATGCCCTC GGGAAGACGC ACGGCTCCGG GGATGCGGAT GCCCTCGGGA AAGCGGCGCC CGTGGGGAAG CGGGGGACTT ATGGTGCCCC CAGTGCCACT GGGACGGCTG TGGATTCGGA GATCGGCCGC GCCCGTGAGG AGTCCGCGAC CACGGAGACG GCCGTGGCTC CGGAGGCCGT GGAGGAGAGC GTGGAGCGGG TGGAGGCGCG GACGTTCTGG GCCTTCTCCC TGCCGCGTGC GGTGGCCGCG GTCGCCCAGA TGGGCGTGCA GCGCGGCGGC GTGGTCCTCG TGGCGCTCCT GGGCGGGCTG ACCGGCGCGG CGGTGTTCAC GGCGGCCACG CGGGTCATGG TGGTCGGCCA GTTCGGCACG CAGGCGGTGC TGTACGCGGC CCAGCCCAGG TTCGCCGAAC AGCTGGCGAC GGGCGACCAC GCCGGGGTCC GGGCCCTCTA CCAGGCGGGA ACGGCGTGGC TGGTGTGCCT TCTGTGGCCT TTGTACCTGT CGGTTCTGGT GTTCGCCCCC CAGGTGATGC GGCTGTTCGG GCCGGAGTAC GCGGCGGGGG CGACGGCGCT GGTCGTGGTG TGTGCGGGCC AACTGACGGC CGCCGCGCTG GGGATGAGCG ACCTGGTCCT GACGATGACC GGGCTGACCC GGCTCAACCT GGTCAACAAC GTGCTGTCGC TGGCGGCGAA CGTGCTGGTG TGCGTGCTGC TCGTACCCGT GGCCGGAGCG ACCGGGGCGG CCGTGGCCCT GGTGGCCGCG ATGCTGGTGC GCAAGCTGCT CCCGCTGTGG CAGTTGCGGT CCCACGTGGT GCTGCACCCG TTCAGCCGCC CGGTGCTGGC CGCGACCGCC AGCGCGCTGA CGTGGTTCGG CGTCCTGCCG CTGCTGCTGG AGGCACTGCT GGGCGGCGGG ATCGCGACGC TGGCGATGGC GGTGGCCTCG GGAGCGGTGG GGCACCTGGT GACGGTGTGG TCGCTGCGCG GACTGCTGGG CCTGGACCCG CGCCGGTGA
|
Protein sequence | MGARPLVARV TAAAERERAG LRRVLRGGAV NMAGAVVGAA LNLAVIVTIT RAFSQETAGL LFSATSVFLI AAVVANLGAS DGLVYFIARM RVFGEPGGVP RLLRTAAAPA VLAACALAVL LVVCAGPVAR GLGGGEAEVY LRLLAVFLPF AVLADTALAA TRAHHDMAGT VLVDKVGRPL AQLALVTGVA LSGAAGLLAL AWAGPYLPAA VVAWFWLGRV VRRAFPEAGG ASGDADGSGG AHGSRGADAL GKTHGSGDAD ALGKAAPVGK RGTYGAPSAT GTAVDSEIGR AREESATTET AVAPEAVEES VERVEARTFW AFSLPRAVAA VAQMGVQRGG VVLVALLGGL TGAAVFTAAT RVMVVGQFGT QAVLYAAQPR FAEQLATGDH AGVRALYQAG TAWLVCLLWP LYLSVLVFAP QVMRLFGPEY AAGATALVVV CAGQLTAAAL GMSDLVLTMT GLTRLNLVNN VLSLAANVLV CVLLVPVAGA TGAAVALVAA MLVRKLLPLW QLRSHVVLHP FSRPVLAATA SALTWFGVLP LLLEALLGGG IATLAMAVAS GAVGHLVTVW SLRGLLGLDP RR
|
| |