Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3238 |
Symbol | |
ID | 9247095 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3870844 |
End bp | 3872442 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003681150 |
Protein GI | 297562176 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.083946 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCA CGCCATCGGG CGAGGGGGCC CTGGCGCCCC CCGCCGTTGA GCTGCGCGGG ATCACCAAGC GTTTTCCCGG CGTCGTGGCC AACCACGACA TCGACATCAC CGTGGCTCCC GGTACCGTGC ACGCCATCGT CGGCGAGAAC GGTGCGGGCA AGTCCACGCT GATGAAGACC CTGTACGGCA TGCACCGGCC GGACGAGGGA CACATCTACG TCCAGGGCCG AGAGGTGCGC TTCGGCTCGC CCTCCGACGC CATCCGCAAC GGCATCGGCA TGGTGCACCA GCACTTCATG CTCGCCGACA ACCTCACCGT GCTGGAGAAC GTGGTCCTGG GCGCCGAGCG CCGCCACGGC ATCGGCAACC GCGCCCGCGC GCGCATCCGC GAGCTGTCCG CCCAGTACGG CCTGGGGGTC GACCCCGACC GCCTCATGGA GGAACTGGGC GTGGGCGACC GCCAGCGCGT GGAGATCCTC AAGGTCCTCT ACCGCGGCGC GCGGACCATC ATCCTCGACG AGCCCACCGC CGTCCTGGTC CCGCAGGAGG TCGACGAGCT CTTCGACAAC CTGCGCGAGC TCAAGCGCGA GGGCCTGACC GTCATCTTCA TCTCCCACAA GCTGGACGAG GTGCTCTCCG TCGCCGACGA GATCACCGTG ATCCGCCGCG GCACCACCGT GGCCACCGCG GACCCGGGCA CCACCACCGC CCGGGACCTG GCCACGCTCA TGGTCGGCGG CGAGCTGCCC GTGCCCGAGC TGCGCGAGTC CACCGTCACC GACCACGTCG TGCTGTCCCT GGACGGGGTC ACCGTGCACT CCGCCGACGG CCGCGCCGTC GTGGACGGGG TGAGCGTCGA CATCCGCCGG GGCGAGATCG TCGGCATCGC CGGTGTCGAG GGCAACGGCC AGTCCGAGCT CATCGAGGCC ATCATGGGCA TGCGCCCGCT GGCCGCCGGG AGCATCCGCC TGGAGGAGCA GGACATCACC GGCTGGCCCA CTCTCAGGAT CCGCGAGGCG GGTGTGGGCT ACATCCCCGA GGACCGCCAC CGGCACGGCG TGCTGCTGGA GTCCCCCCTG TGGGAGAACC GCATCCTCGG CCACCAGACC AAGGAGCCCA GCGTCCGCGG CCCCTGGATC AACCGGACCG GCGCGCGTGC CGACTCCGAG CGCATCGTCG CCGAGTACGA CGTGCGCACC CCGGGGATCG ACGTCATCGC CGACGCCCTG TCCGGCGGCA ACCAGCAGAA GTTCATCATC GGTCGGGAGA TGAGCGGCTC CCCGCGCTTC CTGGTCGCCG CCCACCCCAC CCGGGGCGTG GACGTGGGCG CCCAGGCCGC CATCTGGGAG CAGCTGCGCG ACGCCCGTGC CGCGGGCCTG GCCGTGCTTC TGGTCTCCGC CGACCTGGAC GAGCTGATCG GCATGTCCGA CACCCTCCAC GTCATCCTGC GCGGCCGACT GGTCGCGCAG GCCGACCCGA CCACCGTCAC ACCCGAACAG CTGGGCTCGG CCATGACCGG CGCCGGACTG CACCGGGCCG ACCAGAGCAC GGAAGGCGAC AGCGGTGCCG ACCGAAACGG AAGCGAGGGC GGCGCATGA
|
Protein sequence | MSSTPSGEGA LAPPAVELRG ITKRFPGVVA NHDIDITVAP GTVHAIVGEN GAGKSTLMKT LYGMHRPDEG HIYVQGREVR FGSPSDAIRN GIGMVHQHFM LADNLTVLEN VVLGAERRHG IGNRARARIR ELSAQYGLGV DPDRLMEELG VGDRQRVEIL KVLYRGARTI ILDEPTAVLV PQEVDELFDN LRELKREGLT VIFISHKLDE VLSVADEITV IRRGTTVATA DPGTTTARDL ATLMVGGELP VPELRESTVT DHVVLSLDGV TVHSADGRAV VDGVSVDIRR GEIVGIAGVE GNGQSELIEA IMGMRPLAAG SIRLEEQDIT GWPTLRIREA GVGYIPEDRH RHGVLLESPL WENRILGHQT KEPSVRGPWI NRTGARADSE RIVAEYDVRT PGIDVIADAL SGGNQQKFII GREMSGSPRF LVAAHPTRGV DVGAQAAIWE QLRDARAAGL AVLLVSADLD ELIGMSDTLH VILRGRLVAQ ADPTTVTPEQ LGSAMTGAGL HRADQSTEGD SGADRNGSEG GA
|
| |