Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4197 |
Symbol | |
ID | 9248071 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5012386 |
End bp | 5014326 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | ABC transporter related protein |
Protein accession | YP_003682096 |
Protein GI | 297563122 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.329748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGGG CCGCCGACAA CGCCGAGACC CTCGGCCCGC CGCAGGCGTC CGGAGTGTTC GAAACGCCCC GAGCGCCCGA GGCGGCCCGC GCACCCGGAG CGCCCGCCGA TCCCGCCGCG TCCGCCGCTC CCGCGGAGCA CCGCGCGGCG GGACTGCCGA TCGCGCCGGT CGCTGCCGTG TGGCGGCGGC TGCGCCGTGC CGGGCGCGAG CACGGACGGC TGCTCGCCGT CGTCGTCCTG CTGTACGGGA CGGCCGCGCT CACGGCGCTG GCCTCGCCCT GGATCCTGGG TCTGATCATC GACACCGTGC GCGCCGAGGG GGCGCCGTCC GCGGCCTCGG AGCAGGCGGC CTCGCGCGTC GACGTGCTCG CCGGGCTCAT CGTCGCGGCG CTGGTGCTGC ACGCCGGGTT CACCCTGGCC TCCGTGGCGG CCTCGATCCG GTTCGGCGAG AGCGTGCTGG CCGAGCTGCG CGAGGAGTTC GTGCGCGCGG TGCTCCGGCT CCCGATGGGA GTGGTGGAGC GGGCGGGCAC GGGCGACCTC GTGGCGCGCA CCGGCCGCGA CATCGGCCAC CTGAGCCACA CCGTGCGCGT TTCGGTGCCG GTGATGGCGG TGAGCACGGT GACCCTGGTG GTCGTCACCA CGGCGCTCAT CGTCCTGCAC CCTCTCCTGC TGCTGGCGTG GCTGCCGTCG GCGCCCGTGC TGTGGCTGTC CACGCGCTGG TACGCGCGCA GGGCCCCGGA CGGGTACGTG CGCGAGCTGG GCACCTACTC GGAGCTGACC CAGAGCGTGA CCGACACCGT GGAGGGCGCG CACACCATCG AGTCGCTGGG ACGCCAGGCC CGGCGGATCG CGCTCAACGA ACAAAGGGTG GGGCGGGCCT ACGCGGCGGA GCGGTACACG CTGTGGCTGC GCTGCGTCTG GTACCCGCCG CTGGAGTTCG GGTACATGTT CCCGATCGCG CTGACCTTCC TGGTGGCTGG CCTGCTGTAC GCCGACGGAG CGCTGAGCCT GGGCGGGATC GCCACCGCGG TCTTCCTCAG CAGGCAGATG GCGCGGCCCC TGGACCAGCT GCTGGACCAG GTGGACAGCC TGATGATGGG CTTCACGAGC ATGCGCAGGC TGCTGGGCGT GGAGCTGGCC GGGGAACCGG AGGGGAGGGC GCGCTCCGCG GCGGACGCGG CCGGGACCGC CCGGCCCGGC GAGGTGCGGG TGGAGGACGT GCGCTTCGCC TACACCGACG CGGAGGTCCT GCACGGCGTC GACCTGGTGC TGGCGCCCGG TGAACGCCTG GCCGTGGTGG GTCCGAGCGG CGCGGGCAAG TCGACCCTGG GCAAGCTGAT CGCGGGCGTG CACCCGCCCA CCTCGGGCGC CGTCCGCGTG AGCGGTGCGC CGGTGTCGGG CCTGCCTCCC GAGGAACGGC GCGCGCGGGC GATCCTGCTG AGCCAGGAGA GCCACATGTT CCGGGGCACG ATCGCCGAGA ACCTGGCGTT GGCGCTGGAC CGGCCCGAGG GCGCGGCCGA GGTGGACGAG GAGCGCCTGT GGGAGGCGCT GGCCGCCGTG GACGCCGAGC CCTGGGTGCG CGCGCTGCCG GAGGGGCTGG GCACGCGGGT GGGGTCGGGC CACGCCCCGC TGGACCCGGC GCACGTGCAG CAGCTGGCCC TGGCCCGCGT GGTGCTGGCC GACCCCGACG TGCTGGTGCT GGACGAGGCC ACGTCCCTGA TGGACCCGCG TTCGGCCCGC CACCTGGAAC GCTCGCTGGC GGGGGTGCTG TCGGGGCGCA CGGTGGTGGC CATCGCGCAC CGGCTGCACA CCGCGCACGA CGCCGACCGG ATCGCGGTGG TGGAGGACGG CCGGATCAGC GAGCTGGGCA GCCACGACGA GTTGCTGGCC CGCGGCGGCT CCTACGCCGA CCTGTGGCGG GCCTGGCACG GCGAGGACTG A
|
Protein sequence | MTRAADNAET LGPPQASGVF ETPRAPEAAR APGAPADPAA SAAPAEHRAA GLPIAPVAAV WRRLRRAGRE HGRLLAVVVL LYGTAALTAL ASPWILGLII DTVRAEGAPS AASEQAASRV DVLAGLIVAA LVLHAGFTLA SVAASIRFGE SVLAELREEF VRAVLRLPMG VVERAGTGDL VARTGRDIGH LSHTVRVSVP VMAVSTVTLV VVTTALIVLH PLLLLAWLPS APVLWLSTRW YARRAPDGYV RELGTYSELT QSVTDTVEGA HTIESLGRQA RRIALNEQRV GRAYAAERYT LWLRCVWYPP LEFGYMFPIA LTFLVAGLLY ADGALSLGGI ATAVFLSRQM ARPLDQLLDQ VDSLMMGFTS MRRLLGVELA GEPEGRARSA ADAAGTARPG EVRVEDVRFA YTDAEVLHGV DLVLAPGERL AVVGPSGAGK STLGKLIAGV HPPTSGAVRV SGAPVSGLPP EERRARAILL SQESHMFRGT IAENLALALD RPEGAAEVDE ERLWEALAAV DAEPWVRALP EGLGTRVGSG HAPLDPAHVQ QLALARVVLA DPDVLVLDEA TSLMDPRSAR HLERSLAGVL SGRTVVAIAH RLHTAHDADR IAVVEDGRIS ELGSHDELLA RGGSYADLWR AWHGED
|
| |