Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2791 |
Symbol | |
ID | 9246642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3333363 |
End bp | 3334673 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | Extracellular ligand-binding receptor |
Protein accession | YP_003680710 |
Protein GI | 297561736 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.55279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.125183 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCAC TGAGCATCCT CTCCTCGACG CTGGCCGTCA CGGTTCTCCT CACCTCCTGC TCCAGCGGCA CGGACGGCTC CGACCGGGCC GTCACCCCCG GGGTGACGGC CGACGCCGTC GTCATCGGCA CGCACCAACC GCTCACCGGA GCGGCCTCCC CGGGTTTCCG CCACGTCTCC ACCGGCGCCC GCGCCGTGTT CGACTACATC AACGACAACG GCGGCATCCA CGGCCGCCGG ATCGAGTACC AGGTCCAGGA CGACGCGTTC GACCCCGCGC AGACCCAGGA GGCCACCCGC AGCCTCATCG ACGACCAGGA GATCTTCGCC ATGCTCGGCG GCCTGGGCAC CCCCACCCAC GAGGCCGTGA TCGAGGAGCT CAACGAGGCG GGCGTCCCCG ACCTGTTCGT CTCCTCCGGC GCCCTGGCCT GGGACCAGCC CGAGGTCTAC CCCCACAGCT ACGGCTTCCA GGTCGACTAC ACCCGGGAGG CCAAGATCCA GGGCCAGTAC ATCGCCGAGA ACTTCCCCGG CGACAGGGTC GGCCTGCTCT ACCAGAACGA CGACGTGGGC CCCTCCTCTC ACGCGGGGAT CGAGCAGTAC CTCACCGAGG AGATCGTGGC CTGGGAGTCC TACGACCCCG GCGTCCCCGA GCTCGCCGGA CAGGTCGAGG AGCTCAAGCG GTCGGGCGCC GAGGTCGCCG TCTGCCACTG CATCCCCGCC TTCCTGGCCC TGGCCGTCCT GGAGGCCACC GCGATCGGCT ACACCCCGCA GTGGGTGGCG CCCAGCTTCG GCGGCGACGT GGCGGTGGCC ACCGGCCTCA TCGAGGAGTA CGCGCAGGGC ACGGCGGCCG AGAACGTCCC GCCCGAGGCC TTCCTGGACG GTCTGATCAT CACCGCGTTC CTGCCGATGG CCGCCCAGCG CGAGGACCCG TGGACCGAGT TCTTCCTGGA GATCCACGAG AGGTACAACG AGGGCACGCC CTTCACCGAC ACCACCGTCT ACGGCATGGT GCAGGCGGTC CTGTTCGCCC AGGTGCTCAT GGAGGCCGGC CCCGACCTGA CCCGGGAGAG CCTGCTCGGC ACCCTCAACT CCCACGAGTG GACGGGGCCC GGCCTGGTGC CGTTCAACGC CACAGAGGAC GACCACAGCG GCTACGCCGG GGTGATGGTG GTGCAGCACC ACGCCGGCGA GGAGCCCGAG ATCCTCCAGG AGCCCATGGT CACCGACAGC GACGGCGGGG AGGTCCTGCC CTTCGAGCTG GACCGGCCCT CGCCCGACGA GGTGTCCCTC TTCGGGGGGG CCGGCGGCTA G
|
Protein sequence | MRPLSILSST LAVTVLLTSC SSGTDGSDRA VTPGVTADAV VIGTHQPLTG AASPGFRHVS TGARAVFDYI NDNGGIHGRR IEYQVQDDAF DPAQTQEATR SLIDDQEIFA MLGGLGTPTH EAVIEELNEA GVPDLFVSSG ALAWDQPEVY PHSYGFQVDY TREAKIQGQY IAENFPGDRV GLLYQNDDVG PSSHAGIEQY LTEEIVAWES YDPGVPELAG QVEELKRSGA EVAVCHCIPA FLALAVLEAT AIGYTPQWVA PSFGGDVAVA TGLIEEYAQG TAAENVPPEA FLDGLIITAF LPMAAQREDP WTEFFLEIHE RYNEGTPFTD TTVYGMVQAV LFAQVLMEAG PDLTRESLLG TLNSHEWTGP GLVPFNATED DHSGYAGVMV VQHHAGEEPE ILQEPMVTDS DGGEVLPFEL DRPSPDEVSL FGGAGG
|
| |