Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4476 |
Symbol | |
ID | 9248355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5314356 |
End bp | 5315744 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | DEAD/DEAH box helicase domain protein |
Protein accession | YP_003682371 |
Protein GI | 297563397 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTATC CGCACCGCAC CGACACCAGC CGCGGTCGTC CCTCCCGGGG ATACGGCCGT CAGCGCCCCC GCACGCCCGG ACACCCCCGG TCCGGCGGTC CTTCCGGGGA GTTCGCCCTC CCGGTCAGCA TCACCCCAGC GCTCCCGCCG GTGGAGTCCT TCGAGGCGCT GGACATGCCC GCACCGTTGA AGCGGACCCT GACCCGGCAG GGGCTGACGG TCCCGTCCGA GATCCAGGCG GCCACCCTGC CGAACTCCCT GGCCGGACGC GACGTCCTGG GCCGCAGCCG GACCGGTTCG GGCAAGACCC TGGCCTTCGG ACTGGCTCTG CTCGCCCAGC TCGACGGGCG CAAGGCCGAG CCCCGGCGCC CCCTGGCCGT GGTGCTCGCA CCCACCCGTG AGCTGGCCCA GCAGGTGGCC GACGCACTGG CGCCCTACGC GCGGTCGGTG GGGGTGCACG CGACGACCGT GGTGGGCGGC ATGCCCATCA GCCGCCAGTC CCGCGCCCTG CGTCAGGGCG TGGAGCTGGT CGTGGCCACC CCGGGCCGCC TGCGCGACCT GATGGAGCGC GGCGACTGTG TTCTGGACCA GGTTGAGGTC ACCGTCCTGG ACGAGGCCGA CCAGATGACC GACATGGGCT TCATGCCGCA GGTCACCGCG ATTCTCCAGC AGGTTCCGGC CGACGGTCGA CGCATGCTGT TCTCGGCCAC CCTGGACCGC AACGTCGACA CCCTGGTCCG CCGGTTCCTG AACGACCCGG TGGTCCACTC GGTCGACCTG TCCGAGGGCG CGGTCTCCAC CATGGAGCAC CACGTCATGC ACGTGTCGCA CGCCGACAAG CAGGACGTCG TGACCCGGAT CGCGGCCCGC GACGGTCGGG TGATCATGTT CCTGGACACC AAGCGCGCCG TGGACCGGAT GGCCGAGCAC CTGCTGGCCA ACGGGGTCCT CGCCGCGCCG CTGCACGGCG GGCGGTCCCA GCCCCAGCGC ACCCGCACCC TCGACCAGTT CAAGAGGGGC GCCGTGACCG CGCTGGTCGC CACCAACGTC GCCGCGCGCG GCATCCACGT GGACGGGCTG GACCTGGTGG TCAACATCGA CCCGCCCACC GACCACAAGG ACTACCTGCA CCGGGGCGGA CGCACCGCGC GCGCGGGCGA GGCGGGCAGC GTGGTCACGC TCGTGCTGCC CAGCCAGCGG CGCGACATGA CCCGGCTGAT GAGCCGGGCC CGGATCGACG CGCACACCGC GCGGGTGGAC GCCCAGGACG TGGAGCTGAC CAGGGTGACC GGAGCGCGTG AGCCCTCGGG CGTACCGGTC GCCGTGCCTG CCGGGGGCTC GCGGGAGCGG CGGCACCGTG GTGCCGCGGC ACGCCGACCG CGCCGCTGA
|
Protein sequence | MSYPHRTDTS RGRPSRGYGR QRPRTPGHPR SGGPSGEFAL PVSITPALPP VESFEALDMP APLKRTLTRQ GLTVPSEIQA ATLPNSLAGR DVLGRSRTGS GKTLAFGLAL LAQLDGRKAE PRRPLAVVLA PTRELAQQVA DALAPYARSV GVHATTVVGG MPISRQSRAL RQGVELVVAT PGRLRDLMER GDCVLDQVEV TVLDEADQMT DMGFMPQVTA ILQQVPADGR RMLFSATLDR NVDTLVRRFL NDPVVHSVDL SEGAVSTMEH HVMHVSHADK QDVVTRIAAR DGRVIMFLDT KRAVDRMAEH LLANGVLAAP LHGGRSQPQR TRTLDQFKRG AVTALVATNV AARGIHVDGL DLVVNIDPPT DHKDYLHRGG RTARAGEAGS VVTLVLPSQR RDMTRLMSRA RIDAHTARVD AQDVELTRVT GAREPSGVPV AVPAGGSRER RHRGAAARRP RR
|
| |