Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2503 |
Symbol | |
ID | 9246353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2966655 |
End bp | 2968265 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | protein of unknown function DUF112 transmembrane |
Protein accession | YP_003680428 |
Protein GI | 297561454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.835777 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTCCC TGACACCCAT GATCAACGGG TTCGGTGTCG TCCTCGAACC GGTCAACCTG CTCTACTGCC TGATCGGCGT CGTGGTCGGC ATGCTGGTCG GGGTCCTGCC CGGGCTCGGG CCCGCGGCCA CGATCGCGAT CCTGCTCCCC CTGACCTTCG GCCTCGAACC GGTGACCGCG ATCATCATGC TCGCGGGCAT CTTCTACGGC ACCCAGTACG GGGGGACGAT CACCTCGGTC CTGCTGCGCC TGCCCGGCGA GGCGTCCTCG GTGGTGACGG TCTTCGACGG CCACATGCTG GCCCGCCAGG GCCGGGCCGG GACGGCGCTG GGCATCGCGG CCGTGGGCTC GTTCGTGGGC GGGACCGTGT CGATCGTGGC CCTGTCCCTG GTCGCGCCCC TGGTGGCGAG CTTCGCCCTG GACTTCGGCC CGCCCGAGTA CACCGCGCTG GCGCTGCTGG GCATCCTGCT GGTGTCCACC GTCGGCAACG GCAGCCGGAT CAAGGCGGTC ATCGCCGCCG GCGTGGGCCT GCTGCTGGCC ACGGTCGGGC TGGACACCTT CACCGGCGCC GAACGCTTCA CCTTCGACTC CATGGCGCTG TCCGACGGGA TCGACTTCGT GCCGATCGCG ATGGGCCTGT TCGGCATCGG GGAGATCCTG CACAGCCTGG AGGAACGCCA CCGGGCGCCG AAGAAGCCCC TCAAGGTCAC CAACACCTGG CCCTCGCGCA AGGACCTGCG CCAGTCGTCG GGCGCGATCG GGCGGGGTTC GCTCATCGGC TTCGCGCTGG GCATCCTGCC CGGCGGAGGC GCCACCCTGT CCTCCCTGGC GGCCTACGCG ATGGAGAAGC GGCGCTCACG CGACCCCGAG CGCTTCGGCA GGGGCGCGGT GGAGGGCGTC GCCGCTCCCG AGACGGCCAA CAACGCCGCC GCCACCTCCT CGTTCATCCC GCTGCTGACC CTGGGCATCC CGGCGAACGC GACGATGGCG ATCATCTTCG GCGCGCTGCT CATCCAGGGT GTGCCGCCGG GACCGGAGCT GGTGACCCAG GAGCCGGAGC TGTTCTGGGG CGTCATCAAC TCGATGTACA TCGGCAACAT CCTGCTGCTG ATCATGAGCA TTCCGCTGGT GGGGCTGTTC GTGCGGATCC TGCGGGTGCG CCCGACGATC CTGGCGCCCA TCACGGTGCT GATCACGCTG GTGGGCGTGT ACACGGTGCG CAACAACGTG TTCGACATCG TGCTGGTGGT GGTCTTCGGA CTGTTGGGCT ACCTGATGAA GAAGTTCGGC TTCGACCCGG GGCCGCTGGT GCTGGCCTTC GTCCTGGGTT CGCTGCTGGA GAGCTCGCTG CGCCGGTCAC TGCTGCTCTT CGACGGGGAC CCCACGGGGT TCCTGACCCG GCCGATCTCG GGAACGCTGT TGCTGCTGTT GGCGGTGGTG ATCGTGCTGC CGCTGACGCG CGCTCTGTGG CGGTGGTACC GGGGCCGGGT CGATGGCAGT AGCAGTGGCA GTGGCAGTGC CAGTGGTGGT GCCAGTGGCA GTGGTGACGG CGGCGGTAGT GGCGCTGGTG AGGGTAGGAG CGAGGAACCC GCAGGGAGGA CGGACGCCTG A
|
Protein sequence | MDSLTPMING FGVVLEPVNL LYCLIGVVVG MLVGVLPGLG PAATIAILLP LTFGLEPVTA IIMLAGIFYG TQYGGTITSV LLRLPGEASS VVTVFDGHML ARQGRAGTAL GIAAVGSFVG GTVSIVALSL VAPLVASFAL DFGPPEYTAL ALLGILLVST VGNGSRIKAV IAAGVGLLLA TVGLDTFTGA ERFTFDSMAL SDGIDFVPIA MGLFGIGEIL HSLEERHRAP KKPLKVTNTW PSRKDLRQSS GAIGRGSLIG FALGILPGGG ATLSSLAAYA MEKRRSRDPE RFGRGAVEGV AAPETANNAA ATSSFIPLLT LGIPANATMA IIFGALLIQG VPPGPELVTQ EPELFWGVIN SMYIGNILLL IMSIPLVGLF VRILRVRPTI LAPITVLITL VGVYTVRNNV FDIVLVVVFG LLGYLMKKFG FDPGPLVLAF VLGSLLESSL RRSLLLFDGD PTGFLTRPIS GTLLLLLAVV IVLPLTRALW RWYRGRVDGS SSGSGSASGG ASGSGDGGGS GAGEGRSEEP AGRTDA
|
| |