Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2557 |
Symbol | |
ID | 9246408 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3048307 |
End bp | 3049515 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003680482 |
Protein GI | 297561508 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000490216 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.117456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCCGA ACCACCGCCC TGACGAGACC ACACCCGACC CCGAATCCGG CCCCGCGGCG GCGCCCGAAC GCACGGCCCC GACCCCGGCC GCACCCGCCT CCGGGCCCGA ACGCACGTCC GCACCAGCGT CCGCGTCCTC GGCCGCGTCC ACGCCCGTAC CCGCGCTGGA GGCGGACCTG TGCCTGAGCC GGGGTGCCTT CACCCTGGAG GCCTCCCTGA CGGTGCGCCC GGGGGAGATC CTCGCCCTGC TGGGGCCCAA CGGCGCGGGC AAGTCCTCGG CGCTGCGCGC CCTGGCCGGA CTGGTGCCCC TCACCGGCGG GCGCGTCCTC GTGGACGGGC GCGACCAGAC CAGGACACCC GTGGAGCACC GCCCCATCGG CATGGTCTTC CAGGACTACC TGCTCTTCCC GCACATGAGC GCCCTGGACA ACGTGGCCTT CGGCCCCCGC CACCAGGGCC TGTCGCGGGC GGGGGCCCGC GAGCGCGCCG CCGAACTACT CGCCCACATG GACCTGTCCG CGTACGCGCG CGTGCGGCCG CGCCGCCTCT CCGGCGGCCA GGCCCAGCGC GTGGCGCTGG CCCGCGCCCT GGCCGTGCGC CCGCGCCTGC TGCTGCTCGA CGAGCCCATG GCCGCCCTGG ACGCCAGCAC CCGCATCGAC GTGCGCGCCC GGCTGGGCCA CCTGCTGGAG GAGTTCGACG GGGCCACGGT GCTGGTCACC CACGACCCGC TCGACGCCAT GGTGCTCGCC GACCGGGTGG CGGTGATCGA GGGGGGCCGG GTCGTCCAGC AGGGCGAGCC CGCCGAGGTG GCGCGGCGCC CCCGCACCGC CTACGTCGCG CGGCTGGTGG GGCTCAACCT CTTCCGGGGC ACGGCCGAGG GCACCACCGT CACCCTGGAC GGCGACGGTC CCGGCGGGCC CGTCCGGGTG GAGGCGCACG AGGCGCACCG GGGACCGGCC CTGGTGGCCT TCCCGCCGCG CGCCGTGGCC CTGTACCCGC ACCGTCCGCA CGGCAGTCCG CGCAACGTGT GGCGGCTGAC GGTGGAGGGG ATCGAGCGGT TCGGCGACCA GGTGAGGGTG CACCTGGCGG GGAACCCCTC CCTGGCCGCC GACATCAGCC CGGCGGCCCT GGCCGAGCTG GGACTGGCCC GGGGGGACGC GGTGTGGGCG GGGGTCAAGG CCGCCGAGGT CGAGTGCTAC CCGGGTTGA
|
Protein sequence | MAPNHRPDET TPDPESGPAA APERTAPTPA APASGPERTS APASASSAAS TPVPALEADL CLSRGAFTLE ASLTVRPGEI LALLGPNGAG KSSALRALAG LVPLTGGRVL VDGRDQTRTP VEHRPIGMVF QDYLLFPHMS ALDNVAFGPR HQGLSRAGAR ERAAELLAHM DLSAYARVRP RRLSGGQAQR VALARALAVR PRLLLLDEPM AALDASTRID VRARLGHLLE EFDGATVLVT HDPLDAMVLA DRVAVIEGGR VVQQGEPAEV ARRPRTAYVA RLVGLNLFRG TAEGTTVTLD GDGPGGPVRV EAHEAHRGPA LVAFPPRAVA LYPHRPHGSP RNVWRLTVEG IERFGDQVRV HLAGNPSLAA DISPAALAEL GLARGDAVWA GVKAAEVECY PG
|
| |