Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4977 |
Symbol | |
ID | 9248866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 117226 |
End bp | 118845 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | membrane-flanked domain protein |
Protein accession | YP_003682865 |
Protein GI | 297563892 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0396765 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCCCC CGTTCGGGTG GCGGCCCCAG CCGTCCCGGC CCCCGCGGCC GGAGGACGAC GTCACCACGG GCGTGTTCCG TGCGCACTGG ATGAGCGTCC CGCTCCAGGT CGTCGCGGTG ACCCTGGTGT TCGTGGCGCT GGTCGGCCCC ATCCTCAACA CCTTCGGCTT CGCCTGGGTC GTGATGATGG CCGTGGCCGT CGTCTTCGGC ACGCTCGCCT ACGCTCTGCC CGCGTGGTGG TACGCCACCT TCGGGCTGCG CGAGGACCAC CTGGTCGTCC ACAGCGGCCT GGTCAGGCGC AGCTCGCGCG AGGTGCCGCT CAGCCGCCTC CAGGCGGTGG ACGTGGTCCG GCCCCTGTTC CTCCAGGTGT TCGGCATGGC CGAGCTGCGC ATCGAGCTGG CGGGAGGCGA CGGGAGCGAC ATCCGGCTGC GCTGCCTGCC CCGCGTCCTG GCCGAGCGCC TGCGCGTGGC GGTCCTCGCA CACGCCGCCG GACTCCCCGG GCGCTCGCCC GAGGCGCCGG AGTGGCCGTT CTACCGGCTG CCCTTCCTGC TGCTGCTCGG CGCGCTGACC TTCCGCGTGC CGGTCCTGCT GTCCTTCATC GCGCTGCTCA TGCTCGCCAC CGCGGGCGCG GTCTTCGCGG AAGCGGGCGT GGTGGGCGCG CTCCTCCCGC TCACCCTGGG GCTGCTGCGC TACTTCTGGG GGCCCCTGGC CCGCTACACC GACTACTACG CCTCCCTCTC CTCCGACGGT CTGCGCCTGC GCTACGGCAT GTTCCAGCGG CGCATGCAGA CGGTCCCGCC GGGGCGCGTG CAGGCCGTGC GCGTGGTCGA GCCCCTGCTC TGGCGGGCCC TGGGCGTGGC CCGGGTCGAG GCCAACGTGG CCGGTTACGC GGGCGCGCGC CAGGCCGACT CGTCCACCCT GCTGCCCGTG GCGCCGCGCC GCACGGCCTT CGCCCTGGTC AACGAGCTGT TCCCGGAGAG CGAGGCGGCC CATGTGCCGC TCGTGCCCAA CGACAGGCCG GTGCCGAACC TGATGGGCGT GGACGAGCAC CTCTTCGTCA GCACGCGCGG CCTGTTCTGC CTGGTGACCG AGATCGTCCC GGTGGAACGC GTCCAGACGG CGCGCCTGGT GGCCGGGCCG CTGGCCCGCC TGACGGGACG TGTCGCGGTG GACGTGGACA CGCCCCCGGG GCCGGTCCGC GCCAGGGCCC ACGGGCGCCG GGCCCGGGAG GCGCGCCGCT TCCTGGACGC GCTGACCGAG TACGGGCGCC GGGCCCGGGT GCCCGCGGCG GGCACGGAGC GCTGGGCGAC CCGAGCCACC CTCACCCGCC GGACCGGGGC CGCGGGGCGG CCCCCCTTCG AGGAGGCCGG GCCCGGAGAA CCCGCGTCCG CCGGGGCGGC GTCGGCGCAC ATCGAGGACG CGCAGGACCC GACGCGCGTC GAGGCCCCGG CTGCGGACGC GCCGCCGCCG GGCGGCCGTC CGCGTTTCCT GGCGCCGGGG GCCGAAGGTC CCGAATCCGA GGATTGGGGA CCAGGGGTCG CGGAAGCCGG GAAAGACGGC ACCGCCGACA CCTCCGCCGA GGACGGTACG GAGGGGACCC GGGACACCCC CGAACGCTGA
|
Protein sequence | MLPPFGWRPQ PSRPPRPEDD VTTGVFRAHW MSVPLQVVAV TLVFVALVGP ILNTFGFAWV VMMAVAVVFG TLAYALPAWW YATFGLREDH LVVHSGLVRR SSREVPLSRL QAVDVVRPLF LQVFGMAELR IELAGGDGSD IRLRCLPRVL AERLRVAVLA HAAGLPGRSP EAPEWPFYRL PFLLLLGALT FRVPVLLSFI ALLMLATAGA VFAEAGVVGA LLPLTLGLLR YFWGPLARYT DYYASLSSDG LRLRYGMFQR RMQTVPPGRV QAVRVVEPLL WRALGVARVE ANVAGYAGAR QADSSTLLPV APRRTAFALV NELFPESEAA HVPLVPNDRP VPNLMGVDEH LFVSTRGLFC LVTEIVPVER VQTARLVAGP LARLTGRVAV DVDTPPGPVR ARAHGRRARE ARRFLDALTE YGRRARVPAA GTERWATRAT LTRRTGAAGR PPFEEAGPGE PASAGAASAH IEDAQDPTRV EAPAADAPPP GGRPRFLAPG AEGPESEDWG PGVAEAGKDG TADTSAEDGT EGTRDTPER
|
| |