Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3167 |
Symbol | |
ID | 9247024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3791309 |
End bp | 3793111 |
Gene Length | 1803 bp |
Protein Length | 600 aa |
Translation table | 11 |
GC content | 78% |
IMG OID | |
Product | protein of unknown function DUF43 |
Protein accession | YP_003681081 |
Protein GI | 297562107 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGCGA CTCCTCCGGG CCCGGACGGC CCCCCTTCCG CTGCCTCCGG GCGGGTACCG GTCGCGGACG GCGACTCCGC CCGCGCCGAC CGCCCCCTTG ACGGCCCGCC GCCCCTTCCC GAGCCCGCCG CGGAGCTGCT GCGCGGCCAC GGCGTGGACG CGCCCCGGCT GCGCCGTGTG CTGGCGCTGC TCTCGGACGG GCGCGAGTGG GAGGCGAACG CGCTGGTGCG CGCCAGCGGC GTCGCGTACG CGCTGGTGTC CTCCCTGGTG GGGGCCCTGA CGGAGGCCGG TGAGCTGGCG CCCGGGGACG GGCGGGGCCG GGTGCGCCTG GTGCGGCCCG AGCGCTACGC GGGCGCGGCC GGGGAGGAGC CCGCCGACCC GGTGGCCCAC CTGCTCCCCC GCTACTCCCG GGCGCTTGCG GAGCTGGCGC GGGCCGTGGA GGAGGGGCCC GCCTCGGATC TGGACCTGGA CCACGTGGCC GCGACGGCCG AGACCGCGCT GCGCCGGGCG CTGCTGCTGT CCACCCGGTT CGACCTGCGC GACCGCACGC TCCTGTGCGT GGGCGACCAC GACCTGACCT CGGTGGCGCT GACCCTGGTG TGCCCTGGCG CCCGGGCGCA GGTCGTGGAC ATCGACGAAC GCGTCCTGGC CCACATCGAC TCCCTGGCGG CCGAGCTCGG GCTGGCGGTG CGTACGCACG CCGCGGACCT GCGTCTGGGG CTGCCGCCCG CGGTGCGCGG CGGCGCGGAC GTGGTGTTCA CCGATCCCCC CTACACGCCC GACGGGGTGG AGCTGTTCGT GCGGCGCGGT GTGGAGGGGA TGGCCGATCC CCGCCGGGGC CGGGTGCTGG TGGCCTACGG GGCCAGTGAG ACCACCCCGC GGCTGGCCGC GGCCACGCAG GCGAGGCTGG TGCGGATGGA CCTGCTGTTG GAGGCGGTGT GGCCGGACTT CAACCGCTAC CACGGGGCCG AGTCGATCGG CGCGGCCTCC GACCTGTACG TGCTGCGGCC TCTGGCGCGT ACGCTGCCCG CTCCCTCGGG TGAGGTGGCG CGCGTGTACA GCCAGGGGGT GAACGCCAAG GAGGCCCGGG GCGGTCTGGA CGCCGACCGG GCCCGCGCCG TGCTGGACCG GGTGGCCGGG GAGACCGCGA GTGCCGCGGG TCCCGGCGGG ACCGGGGAGA CCGTTGGGGC CGGGGAGGCC GGTGGTGATC GCGCGCCGAC GCTGGTGGGC GCGTGGCCGG GCGAGGTCGC CGGGTCGGGG CGGGTGCGCC TGTCCACGTG GTTGGAGGCG CCCGGGCAGG GCGGCGGACG CGCGGTGGTC AATCTCACGG GCGGGTGGGA CCGCTGGGCG GCGCGGGCGG CCCTGGCCGC CGCCGGGGAC ACCGTGTACG TGCTGGTGCC GTCCTCGGCG GCGTGCGTGC GCGACGAGGC GGGCCAGCGG GGGCTGCGCG CCCTGGTGGA ACCGCGCTTC GGGGTGCGGT TCCTGCGCGG GTTCGGCGCG GACGGCCTGA CGGCGGTGCG GCTGACCCGG CGCCCGGACG CGGACTCGGC GGTGGACCGG CTGCTGGTGT ACGTGCAGGA GAGGGCGCAC GGCACGCTCA CCGCGACCCT GCGGGCGGGG CTGGTGGAGG TGTCGGCGTG GCGGGAGAGC CCGGTCAACA AGCGCACGGC GCGCCACGCG GTGGCCGCGG CGCCGGAGTG GGTGTCCGGG CACACCCTGC TCGATCTGCC CGAGCACCGC TTCGGTGAGC TGCGCGGGGT GGCCGCCGAC CTCCTGGAGC GGGTGGGGTC CTCCCCGGCC TGA
|
Protein sequence | MHATPPGPDG PPSAASGRVP VADGDSARAD RPLDGPPPLP EPAAELLRGH GVDAPRLRRV LALLSDGREW EANALVRASG VAYALVSSLV GALTEAGELA PGDGRGRVRL VRPERYAGAA GEEPADPVAH LLPRYSRALA ELARAVEEGP ASDLDLDHVA ATAETALRRA LLLSTRFDLR DRTLLCVGDH DLTSVALTLV CPGARAQVVD IDERVLAHID SLAAELGLAV RTHAADLRLG LPPAVRGGAD VVFTDPPYTP DGVELFVRRG VEGMADPRRG RVLVAYGASE TTPRLAAATQ ARLVRMDLLL EAVWPDFNRY HGAESIGAAS DLYVLRPLAR TLPAPSGEVA RVYSQGVNAK EARGGLDADR ARAVLDRVAG ETASAAGPGG TGETVGAGEA GGDRAPTLVG AWPGEVAGSG RVRLSTWLEA PGQGGGRAVV NLTGGWDRWA ARAALAAAGD TVYVLVPSSA ACVRDEAGQR GLRALVEPRF GVRFLRGFGA DGLTAVRLTR RPDADSAVDR LLVYVQERAH GTLTATLRAG LVEVSAWRES PVNKRTARHA VAAAPEWVSG HTLLDLPEHR FGELRGVAAD LLERVGSSPA
|
| |