Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4684 |
Symbol | |
ID | 9248566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5556877 |
End bp | 5558487 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003682576 |
Protein GI | 297563602 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCCG CCTTCTCCGT CGTCTGCACG GGTCTGTCCT TCGCCTGGCC CGACGGCACG CCCGTCCTGA CCGGTCTGGA CGCCGCCTTC GGCACCGGGC GGACCGGGCT GGTCGGCCGC AACGGCAGCG GCAAGTCCAC CCTGCTCCGC CTCGTCGCGG GGCGGCTGAC CCCGGCGTCG GGCACGGTGG CCGTGGACGG CGACGTCGGC TACCTGGACC AGGGACTGAC CCTGGACACC GGCCGCACCG TCGCCGAACT GCTGGGCGTC GACCGGGCCC GCACCGCGCT GCACGCCATC GAGGCGGGCG AGGCCACCGA GGCGAACTTC GCCGCCGTCG GCGAGGACTG GGACGTGGAG GAACGCGTCC TGGCCCAGCT CGAACGGTTC GGCGTCGCCC TGACGGGGGA CGCCCCGCTG GACCGCCCCG TCGGCACCCT CTCCGGAGGG GAGGCCGTGC TGGTCGCGCT CGCGGGGCTG GCGCTGCGCC GCCCCGCCGT CACCCTGCTG GACGAGCCCA CCAACAACCT GGACCGCCGC GCCCGGGAGC GGCTCCACGA CGCGGTCGCG TCCTGGCAGG GGGTGCTCGT CGTGGTCAGC CACGACCGCG AACTCCTGGA ACGCGTCGAC CAGATCGCCG AACTCCGGGA AGGGAGGATG CGCGTGTGGG GCGGCAACTA CTCCGCCTAC ACCGAGCAGC TCGCCGCCGA GCAGGAGGCG GCCCGGCGCA TGGTGCGCGT GGCCGAGGCC GACGTGCGCC GCGAGAAGCG CCAGCTCGCC GACGCCCAGG TCAAGCTGGC CAGGCGGGTG CGTTACGGCA AGAAGATGTA CGACACCAAG CGCGAGCCCC GCGTGGTCAT GAAGCAGCGC AAGCGCGACG CCCAGGTGGC GGCGGGCAAG CACCGCGTCA TGCAGGAGGC CAAGCTGGTC GGGGCCAGGG AGGAGCTGGG CCGGGCGGAG GACGCCGTCC GCGACGACGA CGCCATCCGC GTCGCCCTGC CCGGCACCGG GGTCCCGGCC GGACACGGCG TCCTGGAGCT GGGCACTCCG GTCGGCGGGC GGCTGTACCT GCGCGGGCCC GAACGGGTGG CGCTGGTCGG GCCCAACGGG TCCGGCAAGA CCACCCTGCT GCGGGCCGTG GTCGGCCGGG GGCGCCACCC GCAGGCCGAG GTGGTGCACG CCGCCGACCG GATCGGCTAC CTGCCGCAGC GGCTGGACGT GCTGGACGAC GGCCTGAGCG TGCTCGACAA CGTGCGCGCG GCGGCGCCCT CCGCCACGCC GCACCGGGTG CGGGCCGGTC TGGCCCGCTT CCTCATCCGC GGGGACCGGG TGGAGCAGAG GGCGGGCGAC CTCTCCGGCG GCGAGCGGTT CCGGGTGGCG CTGGCCCGGC TGCTGCTGGC GGACGAGCCT CCCCGGCTGC TGGTGCTGGA CGAGCCCACC AACAGCCTCG ACCTGGACAG CCAGCGGCAG CTGGCCGAGG CCCTCGCGGA CTACCGGGGC GCCCTGCTGG TGGCCAGCCA CGACCACGCG TTCCTGCGCG AGGTCGGAGT GGGCCGCTGG TGGCGGACCG AGCGCGGGCA CGCCCCGGTG GAGGTGTCCG GGGCCGACTG A
|
Protein sequence | MSSAFSVVCT GLSFAWPDGT PVLTGLDAAF GTGRTGLVGR NGSGKSTLLR LVAGRLTPAS GTVAVDGDVG YLDQGLTLDT GRTVAELLGV DRARTALHAI EAGEATEANF AAVGEDWDVE ERVLAQLERF GVALTGDAPL DRPVGTLSGG EAVLVALAGL ALRRPAVTLL DEPTNNLDRR ARERLHDAVA SWQGVLVVVS HDRELLERVD QIAELREGRM RVWGGNYSAY TEQLAAEQEA ARRMVRVAEA DVRREKRQLA DAQVKLARRV RYGKKMYDTK REPRVVMKQR KRDAQVAAGK HRVMQEAKLV GAREELGRAE DAVRDDDAIR VALPGTGVPA GHGVLELGTP VGGRLYLRGP ERVALVGPNG SGKTTLLRAV VGRGRHPQAE VVHAADRIGY LPQRLDVLDD GLSVLDNVRA AAPSATPHRV RAGLARFLIR GDRVEQRAGD LSGGERFRVA LARLLLADEP PRLLVLDEPT NSLDLDSQRQ LAEALADYRG ALLVASHDHA FLREVGVGRW WRTERGHAPV EVSGAD
|
| |