Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0746 |
Symbol | |
ID | 9244588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 914651 |
End bp | 915763 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | protein of unknown function DUF218 |
Protein accession | YP_003678697 |
Protein GI | 297559723 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.770006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAACGG CGAACGACGG CCGCGGTGAC GAGCCGTGGG ACGACCCGAC CACCGCTCCC TCGTCCGACG ACGAGGCCAC CCAGGTCTTC TCCCGGGGGG CCTACCTCCC CGCGTCCTCC GGTGCCGAGG GCGCCGAGAG CACCCGCGCG TTCTCCGTGG ATGACGCCAC CCCCGGCTTC TCCGCCGACG ACGACGCCAC CCGCGCGTTC TCCGACGACG AGACCACGCG GGAGTTCCGG CGCGACGAGG TCCTGACGCC CGGGGCGACC GCCCCGCAGG GCGTGCCCGC CGACCTCGCG GCGGCCGACG ACGCCGTCTT CACCCGCCGC TTCGTCCGCG AGCGTCCGCG GCCCCGCGCC CGCGTCTCCC CGCCTCCGGC GCCCGCACCC CCCGAGGCGG GGTCCCCGGA GGAGGAGTTC CCGCGCGCCC CGGAGACGGC GCCCCGCCGC CGCCCCGAGC CGCCGTCCTT CTCCCGGCCG CGCCGCCGCG GTCGGCGCCG CTTCCGCCTG CGCTGGCTGG TCAGCCTGCT GCTGGTCACC GCCCTGGCCG TCCCGGTCGG CACCTGGGCC TGGGTCTGGT ACGTGGCCCG CCAGGACGAG CGCACCCCCT CCGACGCCAT CGTCGTGCTC GGCGCCAGCC AGTACAACGG CGTGCCCTCG CCCGTCTTCG AGGCCCGGCT GCGCCAAGCC CAGGTCCTCT ACCTGGAGGG CGTGGCCCCG GTCATCGTCA CCGTCGGCGG CAAGCTGCCC GGCGACAACT TCACCGAGGC CGCCTCCGGA CGCAACTGGC TGATCGAGGT GGGCGTCCCC GGCGACCAGG TCATCGCGGT GGAGGAGGGG AGTGACACCC TCCAGAGCAT CGAAGCCGTC GCCGGGGTCT TCGAGGCCAA CTCCTGGGAC ACCGCCATCC TCGTCTCCGA CCCCTGGCAC AGCCTGCGCT CCGAACGCAT GGCCGCCGCC CACGGCATCG AGGCGGGCAC CTCGCCCTCG CGGTCGGGGC CCGCGGTCAT CGAGCGCCGC ACCCAGCTGT GGTACATCAC CAGGGAGACC GCGGCGCTCT GGTACTACTG GATCTTCAAC GACAGCAGCG ACATCCAGGT CGATGCCGCC TGA
|
Protein sequence | MRTANDGRGD EPWDDPTTAP SSDDEATQVF SRGAYLPASS GAEGAESTRA FSVDDATPGF SADDDATRAF SDDETTREFR RDEVLTPGAT APQGVPADLA AADDAVFTRR FVRERPRPRA RVSPPPAPAP PEAGSPEEEF PRAPETAPRR RPEPPSFSRP RRRGRRRFRL RWLVSLLLVT ALAVPVGTWA WVWYVARQDE RTPSDAIVVL GASQYNGVPS PVFEARLRQA QVLYLEGVAP VIVTVGGKLP GDNFTEAASG RNWLIEVGVP GDQVIAVEEG SDTLQSIEAV AGVFEANSWD TAILVSDPWH SLRSERMAAA HGIEAGTSPS RSGPAVIERR TQLWYITRET AALWYYWIFN DSSDIQVDAA
|
| |