Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1573 |
Symbol | |
ID | 9245423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1924899 |
End bp | 1926161 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003679508 |
Protein GI | 297560534 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.539919 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0199703 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATCG CCATGGTCTC CGAACACGCC AGCCCGCTGG CGGCGATCAC CGGAGAGGAC GCGGGCGGCC AGAACGTCCA CGTGGCCGAG CTGGCCGCGG CGCTCGCCGC GCGCGGACAC GAGGTCGTGG TCCACACGCG CCGCACCGAC GCCGAGCGGC CCGACAGCGT CTCCCTGGGC CCGGGCGTGC GCGTGGAGCA CGTGCGCGCC GGACCGGCGG CCCCGATCAG CAAGGACGAG CTGCCCCAGT ACATGCCCGA GTTCGCGCAG CGGCTGCGGG CGGCCTGGCG CGTCCAACGG CCCGACGTCG TCCACGCCCA CTTCTGGATG AGCGGTTTCG CCTCCCTGCG GGCGGCCGGC GCCCTGGGCC TGCCCGTGTT GCAGACCTTC CACGCCCTGG GCACCGTCAA GCGCCGCCAC CAGGGCACGG ACGACACCAG CCCCGCCGAG CGCGTCCCGA CCGAACGCGC CGTGGCCGGT CAGTGCGACA TGGTCGTGGC CACCTCCACC GAGGAGCGGC GCGAACTCGC CGAGTGGGGG ATCCCGCCCG CGCGCGTGGC CGTGGTGCCC TGCGGCGTGG ACACCTCGCG CTTCACCGCC GAGGGTCCCG CCGCCGAGCG CGGCGACCGC CCGCGCCTGC TCAGCCTGGG CCGCCTGGTC AGGCGCAAGG GCGTGGACAC GGTGATCCGC GCGCTGGCCG AGGTGCCCGA GGCCGAACTG GTCATCGCCG GAGGAACCGC CCGCGAACGC CTGTGGACCC AGCCCGAGGC GGTACGGCTG CGCATGGCCG CCGAGCGCGC GGGCGTGGAC GACAGGGTCC GCTTCCTGGG CTGCGTGGAC CGCGCTGAGG TCCCCGCCCT GCTCAGGTCG GCCGACGTGG CCGTCAACGT GCCCTGGTAC GAGCCGTTCG GGATCTCCAC CGTCGAGGCG ATGGCCTGCG GTGTCCCCGT GGTCGCCTCC CGCGTGGGCG GCCACGTCGA CACCGTCGTG CACGGGGAGA CCGGGCTCCT CGTCCCGGCC AGGTCGCCCG AGCGGCTGGG CCGCGCCGTG CGCTGGCTGC TCTCGGACGA GGCGACCAGG ATCTCCTTCG CCGACGCCGC GGCCGAGCGG GCGCGCGACC ACTACTCGTG GGCGGAGGTG GCCCGGCGCA CCGAGGAGTG CTACCTGCAC GTGACCACGA CCGCGACCGG TGCGGCCGTC CCCGCGCCCC GCGCGGGGCT CAGCACGCCG GTCGCCACCA CCACCGGGGG AGGCGAGGAG TGA
|
Protein sequence | MRIAMVSEHA SPLAAITGED AGGQNVHVAE LAAALAARGH EVVVHTRRTD AERPDSVSLG PGVRVEHVRA GPAAPISKDE LPQYMPEFAQ RLRAAWRVQR PDVVHAHFWM SGFASLRAAG ALGLPVLQTF HALGTVKRRH QGTDDTSPAE RVPTERAVAG QCDMVVATST EERRELAEWG IPPARVAVVP CGVDTSRFTA EGPAAERGDR PRLLSLGRLV RRKGVDTVIR ALAEVPEAEL VIAGGTARER LWTQPEAVRL RMAAERAGVD DRVRFLGCVD RAEVPALLRS ADVAVNVPWY EPFGISTVEA MACGVPVVAS RVGGHVDTVV HGETGLLVPA RSPERLGRAV RWLLSDEATR ISFADAAAER ARDHYSWAEV ARRTEECYLH VTTTATGAAV PAPRAGLSTP VATTTGGGEE
|
| |