Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4786 |
Symbol | |
ID | 9248669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5672740 |
End bp | 5673870 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Peptidoglycan-binding domain 1 protein |
Protein accession | YP_003682676 |
Protein GI | 297563702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0569189 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCCG ACACCGCGCT CCTCCCCGAG GAGGAGCGGG AGGAGGAGCG GCCGCCCCGC CGCTGGTTCG GTCCGCTGTG GCTGGGCGCG GCGCTGCTCA CCCTCGTGCT CCTGTGCGTC GTGGCGTGGG CGGCCGTGCG AGGACTGCCG GGGGGTTCCG CGGACGGTGA GGCTCCCGAC CCCGGACCGA CGGCGCGGAT CGAACGCACC ACGCTGCTGC GCGAGGAGAA CCTCGACGGC GCCCTGGGGT ACGCGGGCGG GGGCACGTTC TTCGCCCGCT CCGACGGCGT GGTCACCCGG CTGCCCGGGG TGGGCTCCGA ACTGACCGCG GGCGACCTCG CCTGGGAGAT CGACGGCAGG CCCACGGTCC TGCTGCGCGG GGACAGGCCC GCCTACCGGC CGCTGGAACC GGGGAGCAGC GGCGAGGACG TGCGCCAGTT CGAGCGGGCG CTGGCCGAAC TGGGCTACTC GGGCTTCACC GTGGACGACG AGTACACGTG GCTGACCGCC GAGGCCGTCC GTCGCTGGCA GGACGACACC GAGGGCATGG AGGTCACCGG CGAGGTGCAC CCCTCCCAGA TCTGGTACAC GTCCGGCGCG GTGCGGGTGA CCGGGCACGA GGTGGACGTC AGCGCGGGCG TCGCGCCCGC GACGGCGCTG CTGACCACCA GTTCCACCCG CCAGGTCGTG CGTGTGGACC TGGCCGTCGG CGACCGCGAC CTGCTCACCG AGGACGCCGG GGTCACCGTC CAACTGCCCG GCGGGGAGAG CGTCGCCGGA GTGGTGGAGA GCGTCGGCAC CGTCGCCAGC GTCGAGGAGG GCGAGGAGGG CGCGGGCGGG GGCGGCGACC CCACCGTGGA GGTGGTCATC GACCTGGAGG AGGACCCCGC GGGCTTCCTC GACCAGGCAC CGGTCACCGT GGTCGCCCGC GGGGAGTCGC GGGAGGACGT GCTGGCCGCG CCGGTGGGCG CGCTGATCGC GCTGCCCGGG GACCGCTACG GACTGTCCGT GGTGGACGCG GACGGCACCG TGCGCGACGT GCCGGTGGAG ACGGGCTGGT TCTCGGACGG CCGGGTGGAG GTCAGCGGCG AGGGGATCGG CGAGGGCACC GAGGTGGTGG TCCCCGAATG A
|
Protein sequence | MSADTALLPE EEREEERPPR RWFGPLWLGA ALLTLVLLCV VAWAAVRGLP GGSADGEAPD PGPTARIERT TLLREENLDG ALGYAGGGTF FARSDGVVTR LPGVGSELTA GDLAWEIDGR PTVLLRGDRP AYRPLEPGSS GEDVRQFERA LAELGYSGFT VDDEYTWLTA EAVRRWQDDT EGMEVTGEVH PSQIWYTSGA VRVTGHEVDV SAGVAPATAL LTTSSTRQVV RVDLAVGDRD LLTEDAGVTV QLPGGESVAG VVESVGTVAS VEEGEEGAGG GGDPTVEVVI DLEEDPAGFL DQAPVTVVAR GESREDVLAA PVGALIALPG DRYGLSVVDA DGTVRDVPVE TGWFSDGRVE VSGEGIGEGT EVVVPE
|
| |