Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5208 |
Symbol | |
ID | 9249101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 358087 |
End bp | 359364 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003683094 |
Protein GI | 297564121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.330402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.673241 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGATCG CGATGGTCGC CGAACACGCC AACCCCCTCC CCGCCCACAG GGGCGAGCCC GCCTGCCCCG CCAGCCTGCA CGTGTGCGCC CTGTCCCGGC AGCTGGCCAA GCGGGGCCAC CGGGTGACGG TCTACGCCCG CCGCAGTGAC CCCGACCAGC CCGACGGCCG CACCCGCATG GCGCGCGGCG TCTCCGTCGC CTACCTGGAC GCCGGCCCGG CCCGGCCGCT GTCCCCCGAG GAGCACGCCG AGCACACCGG CGCCTTCGGC AGCGCCCTGG CCTCCGTCCT GGACGAGGAC AGCCCCGACG TCCTGCACGC GATCGGCTGG ACCAGCGGCC TGGCCGCCCT GCACGCGCAG GCGCACAGCG AGAGCGACCA GACCGGCACG CCCCTCGTGC AGACCTTCCA CTCCCTCAAC GCCAGCGAGC AGCGCTCCGG CCTCGGCCAC CACCCCGAGC GCGCCCGCAT GGAGACCATC CTCGCCTCGC GCGCCGACCG CGTGCTGGTC AACTCCACCG ACCAGCAGGT CGAGCTGGCC CGCCTGGGCG TCCCCCGCCA CCACGTCAAC GTCGTGCCCT TCGGTGTGGA CCCCGACCAC TTCAGCGTGG AGGGCAGCGC CTCCGCCGAG CACTGGCACT CCCGGCGCGA GGAGCGCGCC CGCCTGGTCT CGGTCACCTC CCTGACCGAG GCCGGCGGCG CCGACCGGCT CGTGGAGGCC ATGACCCGCC TCCCCGAGGC GGAGCTGCTG CTCGTCTCCA CCGCCGAGGA CCTGGACGTG GCCCTGGACG AGAACGCCCG CCGGATCGAG CTCCTGGCCA AGGAGGCCGG GGTGAACGAC CGCGTCCACC TGGCCGGGCC CGTGGAGCGC AAGGAGCTGC CGCGCCTGCT GCGCTCCGCG GACGTGTACG TGTCCGCCGC CTCCTACGAC CCCTACGGCG GGGCCGTGCT GGAGGCCATG GCGTGCGGCC TGCCCGTGGT GGCCACCGCC ACCGGCGCCA CCCCGGGGGC CGTCCTGCAC CGCACCAGCG GCGTGCTGAT GCGCTTCGGC CGCCCCGACG AGGTCGTGCG CTCCGTGCGC GCGGTCCTCA ACACCCCGAC CATGAGCACC GCGTACGGCA TCGCCGCCGT GGACCGGGCC CGCTCCCGGT TCACCTGGCA GCGGATCGCC GTCGAGACGG AGCTCGCCTA CGAGCGCTCC CGCCCGCAGC AGACGGAACA GGACCGCGCC GACGAGGACG AGACGGACGG TCTGCTACTG TCCGGGACCG CGCACTGA
|
Protein sequence | MKIAMVAEHA NPLPAHRGEP ACPASLHVCA LSRQLAKRGH RVTVYARRSD PDQPDGRTRM ARGVSVAYLD AGPARPLSPE EHAEHTGAFG SALASVLDED SPDVLHAIGW TSGLAALHAQ AHSESDQTGT PLVQTFHSLN ASEQRSGLGH HPERARMETI LASRADRVLV NSTDQQVELA RLGVPRHHVN VVPFGVDPDH FSVEGSASAE HWHSRREERA RLVSVTSLTE AGGADRLVEA MTRLPEAELL LVSTAEDLDV ALDENARRIE LLAKEAGVND RVHLAGPVER KELPRLLRSA DVYVSAASYD PYGGAVLEAM ACGLPVVATA TGATPGAVLH RTSGVLMRFG RPDEVVRSVR AVLNTPTMST AYGIAAVDRA RSRFTWQRIA VETELAYERS RPQQTEQDRA DEDETDGLLL SGTAH
|
| |