Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1035 |
Symbol | |
ID | 9244881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1274466 |
End bp | 1276325 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003678984 |
Protein GI | 297560010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00381243 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0281832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCACCG CACAGGACAC CTCCCCGCGC ACACAGGCCG CGCCGGAGCC GGGGACCGGG CGCGCGCCCT CCCCGCTCGG CGTCCTCCTG GAGCCGATCC GGGGGCGCGT CCGCGCTGCC GTGGTCCTCC AGGGCCTGGC CAGCGCGCTC GGCCTCGTCC CCCTGATCTG CCTGGCCGAG GCCGCCGCCG CGCTGCTGGC CGACGGGCCG ACCGACACCG CCCTGGTGTG GACGCTGGTC GCGGTCGCGC TGGCCGGAGC GGTCGCCGCC CTGGCCGCGG GCACCGCCTC CACCCTGGTC GGGCACCTGG CCGACAACGA CATGCAGCTG TCCGTGCGGC GCGCCCTGGC CCGCCACCTG GGGCGGGTTC CGCTGGGCTG GTTCTCCGGC CGCGGCTCGG GCCGGGTCAA GAAGGCCCTG CACGACGACA TCGAGGACGT CCACTCCCTG GTCGCCCACA CCCTCCCCGA CCTGGCCGCC GTCGTGGCGG TCCCCGTGGT CGCCCTGGCC TACCTCGCCT CCGTGGACTG GCGCCTGACC CTGGTCGCGG TGCTGCCGGT GGCCGCGGGA GTCCTGCTGT TCGCGCGGGC CATGGCCGGG TCCATGAAGA AGATGGCCGA GTACGCCGAG GCGATGGGCG CGGTCAACAC CGCGGCCGTG GAGTTCGTGG ACGGCATCCA GGTGGTCAAG CACTTCGGCG GCCACCGCAG GGCCCACGAG CGCTTCACCC GGGCCGCCGA CGCCTTCGCG GACTTCTTCG TCTCCTGGTC GCGCGCCACC ACCCCGGCCA CGGTCGCCTC CTTCCTGGTG CTGTCGGCGC CCACGGTCAC CGTCACCGTG GTGGCGGCGG GGGCGGGCTT CGCCGCCCTG GGCTGGAGCG AACCCGTCTC GGTGGTGGCC TTCGCCCTGC TGGCCCCTGC GCTGTGCGCG CCGATGAACG TGATCGGCTC GCGGGTGCAG CAGATCCAGA CGGCGCGGGC CGCCGCCGGA CGGGTCCGCG ACCTGCTCGC CACCCCGCCC CTGCCCGAGG GCTCCGGCGG GGGACCGCGG GGCGCCCGCG TGGTCTTCGA CGGGGTGCGC TTCGCCTACC CGGCCCAGGA CGGCGCCGAG GGGAGCGGAG CGGGCGAGGA GGTCCTGCGC GGCGTGGACC TGGTCCTGGA ACCGGGCACC GTGACCGCCC TGGTCGGCCC GTCCGGCGCG GGCAAGACCA CCCTGGCCAC CCTGCCCGCG CGCTTCGCCG ACGTCACGGC GGGCGCGGTC ACCGTGGGCG GGGCGGACGT GCGCGACATC CCCGCCGAGG AGCTGTACCG CACGGTCGGC TTCGTGTTCC AGGACGTGCG GCTGCTGCGG GAGACCGTGG CCGACAACAT CGCCATGGGC CGCCCCGGGG CGACCCGTGA GGAGGTGGAG GAGGCCGCCC GCGCCGCCCG CGTCGCCGAA CGGATCGAGG CGCTGCCGCG CGGCTACGAC TCCGTGGTGG GCGAGGACGC CGACCTGTCC GGCGGTGAGG CCCAGCGGGT CTCCATCGCC CGCGCCCTGC TCGCCGACAC CCCCGTGCTG GTGCTGGACG AGGCCACCGC CGCCGTGGAC CCGGTGTCGG AGGCGGCCAT CCAGGACGCC CTCGGCGAAC TGGCCCGGGG CCGCACGGTC CTGGTGATCG CGCACCGGCT GAGCACCGTG GCCGGGGCCG ACCTCATCGC GGTCATGGAC GAGGGGAGCG TGGTGGAGCG CGGCACCCAC GGCGAGCTGC TGGCGCGCGG CGGCCGGTAC GCCGACCTGT GGCGCGCCCA GCACCCGGAG GCCGGTGACG GGGACGCCCA TGACGGGGCC GCACGAGACG AACCGGGAGA AGACCAGTGA
|
Protein sequence | MSTAQDTSPR TQAAPEPGTG RAPSPLGVLL EPIRGRVRAA VVLQGLASAL GLVPLICLAE AAAALLADGP TDTALVWTLV AVALAGAVAA LAAGTASTLV GHLADNDMQL SVRRALARHL GRVPLGWFSG RGSGRVKKAL HDDIEDVHSL VAHTLPDLAA VVAVPVVALA YLASVDWRLT LVAVLPVAAG VLLFARAMAG SMKKMAEYAE AMGAVNTAAV EFVDGIQVVK HFGGHRRAHE RFTRAADAFA DFFVSWSRAT TPATVASFLV LSAPTVTVTV VAAGAGFAAL GWSEPVSVVA FALLAPALCA PMNVIGSRVQ QIQTARAAAG RVRDLLATPP LPEGSGGGPR GARVVFDGVR FAYPAQDGAE GSGAGEEVLR GVDLVLEPGT VTALVGPSGA GKTTLATLPA RFADVTAGAV TVGGADVRDI PAEELYRTVG FVFQDVRLLR ETVADNIAMG RPGATREEVE EAARAARVAE RIEALPRGYD SVVGEDADLS GGEAQRVSIA RALLADTPVL VLDEATAAVD PVSEAAIQDA LGELARGRTV LVIAHRLSTV AGADLIAVMD EGSVVERGTH GELLARGGRY ADLWRAQHPE AGDGDAHDGA ARDEPGEDQ
|
| |