Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2802 |
Symbol | |
ID | 9246653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3346185 |
End bp | 3347783 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003680720 |
Protein GI | 297561746 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.15491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0601311 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATCG CCACCGACCT CGAACTGCGC GTCGGTCCTC GACTCCTGTT GGAGCCGACC ACCTTCCGCG TCGCCGCCGG GGACCGGATC GGACTGGTCG GCCGCAACGG TGCGGGCAAG ACCACCATGA CCAAGGTGCT GGCGGGCGAG GGCCTGCCCA CCGGCGGCAC GGTCACCTCC TCCGGCAGCA TCGGCTACCT GCCCCAGGAC CCGCGCACCG GCGACCTGGA CGTGATCGCC AGGGACCGGA TCCTGTCGGC GCGCGGCATC GACGAGGCCC TGCGCGGGAT GCGCGAGGCC GAGAAGAAGA TGGCCAGCAC CGACACCAAG ACCCGTAACA AGGGCGTGCG CGCCTACTCG CGCTGGGAGG AGCGGCTGCA CGTGCTGGGC GGCTACTCCG CCGAGTCCGA GGCCGCCGCC ATCTCCTCCA GCCTGGGTCT GCCGGACAAG GTGCTCGGCC AGCCGCTGCA CACGCTCTCG GGCGGTCAGC GCCGCCGGAT CGAGCTGGCG CGCATCCTGT TCAGCGGGGC CGACACCCTG CTCCTGGACG AGCCCACCAA CCACCTGGAC GCCGACTCCA TCGCCTGGCT GCGCGACTTC CTCAAGTCCC ACCAGGGCGG GCTCATCGTG ATCAGCCACG ACGTGGAACT GGTCGAGCAC GTGGTGAACA AGGTGTTCTA CCTCGACGCC AACCGCAGCG TCATCGACGT CTACAACATG GGCTGGAAGC TGTACCTGGA GCAGCGCGAG GCCGACGAGC GCCGCCGCAG GCGCGAGCTG GCCAACGCCG AGAAGCAGGC CGACACCCTG CGCAAGCAGG CCGACCGCTT CCGGGCCAAG GCGTCCAAGG CGCGCGCCGC GCAGCAGATG CTCAACCGGG CCGACCGCCT CCTGGACGGC GTGCAGGGCG TGCGCAAGTC CGACAAGGTG GCCAAGCTGC GCTTCCCCGA CCCGGCGCCC AGCGGCCGCA CGCCGCTCAT GGCCGAGGGC CTGTCCAAGT CCTACGGGTC CCTGGAGATC TTCGCCGGGG TGGACCTGGC CATCGACCGG GGCAGCCGCG TGGTGATCCT GGGCCTCAAC GGCGCGGGCA AGACCACCCT GCTGCGCCTG CTCGGCGGTG TCGAGACACC CGACACGGGC CGGGTCGTGC CCGGTCACGG GCTCAAGCTC GGCTACTACG CCCAGGAGCA CGAGACCCTG GACGTGGACC GGTCCGTGCT GGAGAACATG ATGAGCGCGG CCCCGGACCT GCCGGAGGTC GAGGCGCGGC GCACGCTGGG CTCGTTCCTG TTCACCGGCG ACGACGTGGA CAAGCCCGCG GGCGTGCTCT CCGGCGGTGA GAAGACCCGG CTGGCGCTGG CCACGCTGGT GGTCTCCAGC GCCAACGTGC TGCTGCTGGA CGAGCCCACC AACAACCTCG ACCCGGCCAG CCGCGAGGAG ATCCTGGCGG CGCTGCGCAA CTACAAGGGC GCGATCGTGC TCGTCACCCA CGACGAGGGC GCGGTCGAGG CACTCCAGCC CGAGCGGGTC ATCCTCCTGC CCGACGGCGT CGAGGACGTG TGGAACGCCG AGTTCGAGGA CCTCATCGCG CTGGCCTGA
|
Protein sequence | MLIATDLELR VGPRLLLEPT TFRVAAGDRI GLVGRNGAGK TTMTKVLAGE GLPTGGTVTS SGSIGYLPQD PRTGDLDVIA RDRILSARGI DEALRGMREA EKKMASTDTK TRNKGVRAYS RWEERLHVLG GYSAESEAAA ISSSLGLPDK VLGQPLHTLS GGQRRRIELA RILFSGADTL LLDEPTNHLD ADSIAWLRDF LKSHQGGLIV ISHDVELVEH VVNKVFYLDA NRSVIDVYNM GWKLYLEQRE ADERRRRREL ANAEKQADTL RKQADRFRAK ASKARAAQQM LNRADRLLDG VQGVRKSDKV AKLRFPDPAP SGRTPLMAEG LSKSYGSLEI FAGVDLAIDR GSRVVILGLN GAGKTTLLRL LGGVETPDTG RVVPGHGLKL GYYAQEHETL DVDRSVLENM MSAAPDLPEV EARRTLGSFL FTGDDVDKPA GVLSGGEKTR LALATLVVSS ANVLLLDEPT NNLDPASREE ILAALRNYKG AIVLVTHDEG AVEALQPERV ILLPDGVEDV WNAEFEDLIA LA
|
| |