Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4055 |
Symbol | |
ID | 9247927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4850381 |
End bp | 4852024 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003681957 |
Protein GI | 297562983 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0422007 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGGCC CGCCCCTGCT CCGGATGAGC GGCATCACCA AGTCCTTCCT GGGCGTGCGC GTCCTGCACG GGATCGACCT GGAACTCCAC CCCGGCGAGC TGCACGCCCT GGTCGGCGAG AACGGGGCGG GCAAGTCCAC CCTGATGAAG GTGCTGGCCG GGGTGCACCG CGCGGACGGG GGCACGGTCG AGCTGGAGGG CGGCACCGTC TCCTTCGAGC ACCCCGTCCA GGCCCAGCGC GCGGGCGTGA CCACGGTCTT CCAGGAGTTC AACCTCCTGC CCGACCGCAC CGTCGCCGAG AACGTCTTCC TCGGCCGCGA GATCCGCCGC CGCGGCCTGG TGGACGCCCG GGCCATGGAG CGGGCCACCG CCGAACTGCT CGCCGAACTC GGCCTGGAGG GCATCGACCC CCGGGCCCGG GTGCGGTCGC TGTCGGTGGC CGAACAGCAG ATCGTGGAGA TCGTCAAGGC GCTCTCGCAC GACGCGCGCA TCATCTCCAT GGACGAGCCG ACCGCCGCGC TGGCCGACCA CGAGGTGGAG GTGCTCTACC GGATCATCGG CCGCCTGCGC GAACGCGGCG TGGCGGTGCT GTACGTGTCG CACCGCATGC GGGAGATCTT CGACCTGGCC GACACCATCA CGGTGCTCAA GGACGGCCAT CTCGTGGACA CCGTCCCCGC CGGTGAGATC GGTCCGGCCG AGCTGGTCCG CAAGATGGTC GGGCGTCCGG TCTCGGCGGT CTTCCCCGAG CCCCTGGAGC CGCACGGCGA GCACGTGGGA CGGGTGCGGC TGTCGGTCAC CGGAGGCGGC AACACCCAGC TGGACGGGAT CGGCTTCGAG GTGCGCGGCG GCGAGATCCT GGGCCTGGGC GGGCTCCAGG GCAGCGGGCG CACCGAGGTC GCCCACGCCC TCTTCGGCGT CGAGCGCTTC ACCCGGGGCG AGGTCCGCGT GGACGGGCGG CGGGTGGACC CGCGCTCGCC GCGCACGGCG GTGCGGGCGG GCCTGGTGCT GGTCACCGAG GACCGCAAGG CGCAGGGGCT GGCGCTGAAC CAGTCGGTGG CGGCCAACGG CCGCCTGGTC CTGGACGCGG TCTGGCCCCT GGGCTCGGCG CGCGGGGCCC GGCGGCTGCC CGGCATCCTC TCCTCCCTGG AGCTGGTGGC GCGCGGCGGC CAGGACCAGG AGGTCCGGTA CCTGTCCGGC GGCAACCAGC AGAAGGTCGT GCTGGCCAAG TGGCTGGCCG CCGAACCCGG CGTGATGGTG CTCGACGAGC CCACGCGCGG CATCGACGTG GGCGCCAAGC AGGCCGTCTA CCGGCTCATG CGCGAGCTGG CCGCGGCCGG TGTGGCGATC GTGCTCATCT CCTCCGAGCT GCCCGAGCTG ATCGGCATGT CCGACCGGCT GGTCGTCCTG CGGGACGGCC GGGTGGCGGG CGAGCTGCCC GGCGGGGCCG CCGAGGAGGC GGTCATGGCG GTGGCCACCG GATCGCCGCA CCCGGGCGGG TCCGCCGCAC CGGTTCCCGG GCAGGACCCG GCCGCCGCTC CACCGCGCCC GGTCGCCCCC GCCCCACCGA CCGCGGGGGG CGACCCGGCC GCCGGAACCG ACAACGGCGG CTCCCCCGCA CGGCACGAGG AGGCCGCCCC GTGA
|
Protein sequence | MSGPPLLRMS GITKSFLGVR VLHGIDLELH PGELHALVGE NGAGKSTLMK VLAGVHRADG GTVELEGGTV SFEHPVQAQR AGVTTVFQEF NLLPDRTVAE NVFLGREIRR RGLVDARAME RATAELLAEL GLEGIDPRAR VRSLSVAEQQ IVEIVKALSH DARIISMDEP TAALADHEVE VLYRIIGRLR ERGVAVLYVS HRMREIFDLA DTITVLKDGH LVDTVPAGEI GPAELVRKMV GRPVSAVFPE PLEPHGEHVG RVRLSVTGGG NTQLDGIGFE VRGGEILGLG GLQGSGRTEV AHALFGVERF TRGEVRVDGR RVDPRSPRTA VRAGLVLVTE DRKAQGLALN QSVAANGRLV LDAVWPLGSA RGARRLPGIL SSLELVARGG QDQEVRYLSG GNQQKVVLAK WLAAEPGVMV LDEPTRGIDV GAKQAVYRLM RELAAAGVAI VLISSELPEL IGMSDRLVVL RDGRVAGELP GGAAEEAVMA VATGSPHPGG SAAPVPGQDP AAAPPRPVAP APPTAGGDPA AGTDNGGSPA RHEEAAP
|
| |