Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3853 |
Symbol | |
ID | 9247724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4625698 |
End bp | 4626852 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | glycosyl transferase group 1 |
Protein accession | YP_003681756 |
Protein GI | 297562782 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.208611 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.427246 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGCGC TCGTGGCCAC GGTGGTGCAC CACCCGGAGG ACGCACGGAT CCTGCACCGG CAGATCCGCG CCCTGTTGGA CGCCGGACAC AGCGTGACCT ATGTGGCGCC GTTCCGCGAG TGCGGGGTGA CCCCCTGGTC GGAACTGCGC TCGGTGGACG TGCCGCGCTC CTCGGGGCGC GAGCGGCTCG CCTCGCTGCG CGCCGCCCGC GCGGTGCTCG CCGAGCAGGC GCCCCTGGCC GACCTGCTGC TCTTCCACGA CCCCGAACTC CTGATGGCCC TGCCGTCCAG ACGCCCGGTG ACGGTGTGGG ACGTGCACGA GGACACGGCG GCGGCCCTGC TCACCAAGGC GTGGGTGCCC CGGGCGCTGC GGCGTCCGCT GGGCACGGTG GTGCGCTCCT TCGAGCGGCA CGCGGAGCGG CGGATGCGGC TGATGCTGGC CGAGGAGGGG TACCGCTCCC GGTTCCGCCT GGAGCACCCG GTGGTGCCCA ACACCACCGA GGTGCCGGAG TTCCCGGCGC GCGAGCCGGG CGACGACCGG ATCGTGTACC TAGGCCAGGT GTCCGAGGCG CGCGGCGCGC GCGAACTGGT GGAGCTGGGG CGCATGCTGC GCCCGCACGG CGTGCGCCTG GAGGTGATCG GCGGGGCCGA CGCCGGGGTG CGGCCGCTGC TGCGCGAGGC CCAGCAGGAG GACGTCCTGC ACTGGTACGG GTTCGTGCCC AACGACCGGG CGCTGCGGAT CTGCGCGGGC GCCATGGCGG GGCTGAGCCT GCTGCACGAC ACGCCCAACT ACCGGCACTC GATGCCGACC AAGGTCGTGG AGTACATGGC GCACGGCCTG CCGGTGGTGA CCACGCCCAA CCCGATGGCA CAGGAGCTGG TGACCGGCCG TCCGGAGGGC CCGTCGGGCC TGGTGGTGCC GTTCGGGGAC GTGTCGGCCG CGGCGGAGTC GGTGCTGCGG CTGCGCCGGG ACGCGGAGCT GCGCCGGAAC CTGGCGCGCA CCGGGCACCG GATCGCGCGG ACCTCCTTCC ACTGGCCGGT CCAGGCGCGC CTGTTCGTCA AGCGCCTGGA GGCGTGGGCG GACGAGGCCT CCGGCGGGCC GCTGGCCGTC GTGCCGCCCC CGCGGAGTCG CCAGCGCACT CCCGTGCGTG ACTGA
|
Protein sequence | MHALVATVVH HPEDARILHR QIRALLDAGH SVTYVAPFRE CGVTPWSELR SVDVPRSSGR ERLASLRAAR AVLAEQAPLA DLLLFHDPEL LMALPSRRPV TVWDVHEDTA AALLTKAWVP RALRRPLGTV VRSFERHAER RMRLMLAEEG YRSRFRLEHP VVPNTTEVPE FPAREPGDDR IVYLGQVSEA RGARELVELG RMLRPHGVRL EVIGGADAGV RPLLREAQQE DVLHWYGFVP NDRALRICAG AMAGLSLLHD TPNYRHSMPT KVVEYMAHGL PVVTTPNPMA QELVTGRPEG PSGLVVPFGD VSAAAESVLR LRRDAELRRN LARTGHRIAR TSFHWPVQAR LFVKRLEAWA DEASGGPLAV VPPPRSRQRT PVRD
|
| |