Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0580 |
Symbol | |
ID | 9244422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 721827 |
End bp | 723026 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003678533 |
Protein GI | 297559559 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.55279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAC GCCCGCTGCG CGTCCTGATC GCCAGCGACA CCTATCCCCC CGACGTCAAC GGGGCCGGAT ACTTCACCCA CCGCCTGGCC GAGGGACTGG CCGGACGGGG GCACCGGGTC CACGTGGTGT GCCCCTCCGA GCGGGGCGAA CCCCACGTCA CGGTCAACGG CGCGGTGACC GAGCACCGCC TGCGCTCGGC CCCCATCCCC TTCATGCGCG CCGCCGTCCC GCTGGGGATG GGCGGGCACA TCGCCAGGGT CATCGAGCGC CTGGACCCCG ACGTCGTGCA CGCGCAGAGC CACTTCCCGC TCAGCCGCTC GGCGATGCGC AGGGGCCGTG CCGCAGGCGT CCCCGTCGTG CTCACCAACC ACTTCATGCC GGACAACCTG TACGCGCACG CCCGGATCCC CGCGCCGATG CAGGAACTCG CGGGCCTCCT GGCCTGGAGG GACATGGTCC GGGTGGCGGG GGAGGCAGAC CACGTGACCA CGCCGACCCC GCGGGCGGCC CGGCTCCTGC GGGAGAAGGG GTTCACGCGT GAGGTCGAGG CCATCTCGTG CGGGATCGAC CTGGAGCGGT TCCGCCCCCA CGAGGACCCC GCCGCGGCCC GGCGCCGGTT CGGTCTGCCC GACCGGGACA CGATCGTGTT CGTGGGCAGG CTGGACGCGG AGAAGAGGAT CGACGACACC ATCCGGGCGC TCGCGCGGAT CGTGCCCGAA CGCGACGCCC AGCTGGCCCT GGCCGGGACC GGTCAGCGCG AGGAGGAGCT GCGCGCGCTG GCCGCGGAGC TGGGGGTGGC GGACCGGGTG TTCTTCCTGG GGTTCGTCCC GGACGAGGAC CTGCCCCTGG TCTACGCGGC GGGGGACGCC TTCGCCATCG CGGGTGTGGC CGAGTTGCAG AGCATCGCCA CCCTGGAGGC GATGTCCACC GGTCTGCCGG TGGTGGCCGC GGACGCGATG GCGCTGCCGC ACCTGGTGGA GGAGGGGCGC AACGGGTTCC TCTTCCCGCC GGGCGACCCC GTGCGCCTGG CGGACCGGCT GCTCCTCGTG CTCGGCCCCG GTCGGCGCGC GCTCGGCGCG GCCAGCCGTG AGCTCGCGCG GCGGCACGAC CACCACCGGT CCCTGGAGCG GTTCGAGGAG GTCTACACCC GTCTGCGGGG CACGGCTCCC GCGCGGGTGG AGGGGCTGGT CGCCGCCTGA
|
Protein sequence | MAERPLRVLI ASDTYPPDVN GAGYFTHRLA EGLAGRGHRV HVVCPSERGE PHVTVNGAVT EHRLRSAPIP FMRAAVPLGM GGHIARVIER LDPDVVHAQS HFPLSRSAMR RGRAAGVPVV LTNHFMPDNL YAHARIPAPM QELAGLLAWR DMVRVAGEAD HVTTPTPRAA RLLREKGFTR EVEAISCGID LERFRPHEDP AAARRRFGLP DRDTIVFVGR LDAEKRIDDT IRALARIVPE RDAQLALAGT GQREEELRAL AAELGVADRV FFLGFVPDED LPLVYAAGDA FAIAGVAELQ SIATLEAMST GLPVVAADAM ALPHLVEEGR NGFLFPPGDP VRLADRLLLV LGPGRRALGA ASRELARRHD HHRSLERFEE VYTRLRGTAP ARVEGLVAA
|
| |