Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4177 |
Symbol | |
ID | 9248051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4987321 |
End bp | 4988751 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682078 |
Protein GI | 297563104 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.192321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGTA TCGCCATGTT CCGGTCCCTG GCCAACCGCA ACTACCGGCT GTACGCCCTG GGCCAGCTGC TGTCCAACCC CGGCACGTGG ATGCAGCGCA TCGCGCAGGA CTGGCTGGTC CTCCAGCTCA GCGGGGGCAG CGGTATCGCG CTCGGCATGA CCACCGCCCT CCAGTTCCTG CCGCTGCTCC TCTTCGGGCT GTGGGGCGGG GCGCTGGTCG ACCGGCTGGA CAAGCGCAGG CTGCTGGTGT GCACCCAGGC CGCCATGGGC GTGCTCGCGG TCGGCCTCGG AATCCTGGCC ACCGCGGGGG CCGCGCAGGT CTGGCACGTG TACCTGTTCG CCTTCGGGCT CGGCCTGATC ACCGTGCTCG ACAACCCCGG GCGGCAGGCC TTCGTCCCGG AGATGGTCGA GCGCGAGCAC CTGTCCAACG CCATCGCGCT CAACAGCGCC AGCTTCCAGC TCGGCCGGGT GACCGGTCCG GCCGTCGCCG GTCTGCTCAT CGCGGTCATC GGCAGCGGAC CGGTCTTCCT CATCAACGGC GCGTCGTTCG GGTTCACCAT CCTCGCCCTG ATGATGATCC GCACCTCCGA GCTCAACCCC GCCGAGCGGG TGGCCCGGGG CAAGGGGCAG ATCCGGGAGG GGCTGCGCTA CATCGGCGGC AGGCGCGACC TGGTCCTGCT GCTGGTGCTG GCCGCCGCCA CGCAGTTCTT CGGCGCCAAC AGCCAGAACC AGATCGCGCT GATGGTCAAC AACGTGTTCG AGGCCGGAGC CGACGCGTTC GGCGTGGCCG CCGCCTTCCT GGCCGTCGGC GCCCTCGCCG GCGCCCTGCT GGCCGCCCGC CGCGACCGGC CCCGGCTGCG GCTGGTGCTG ATCGGGTCGC TGGCCTTCGG CGCGCTCCAG GTCGTCGCCG GGCTCATGCC CGGCTACGTG CCGTTCGTCC TGGTGCTGGT GCCCATGGGC GTGGCCTTCA TGACGTACGT GACCACGCTC AACGCCACCT TCCAGCTCAG CGTGGACCCG CGGATGCGCG GACGCGTCAT GAGCATGTTC CTGCTGGTGT TCATGGGCGT GGCGCCCATC GGCGCGCCCG TGGTGGGCCT GCTCGCCGAC ACCTTCGGCC CGGAGACGAG CCTGGTCATC GGCGGTGCGG TGACCCTCGT GGTCGTCGCG GTCATCTCCG CCCTGCTCTT CCGCGCCAAG GGCGTGCGCC CGCGCGACGT GCTCCCGGGG CGGCGCCCCT CCTCGGACGC GGAGACCGCC GAGGAGGCGA CCGCCGAGGA GGAGACCGCG GAGACCGGGA GCGAGCGCGA GGAGGCCCCG GAGGCGGAGC GCGAGCGGGT GGACGCCGAG CGCCTCCCGG ACGGCTCCAC CCGGGTCCGC CTCGTCCCGG ACGGGGACGG GACCCGACCG GACACGGTCA CCCGTACCTG A
|
Protein sequence | MRRIAMFRSL ANRNYRLYAL GQLLSNPGTW MQRIAQDWLV LQLSGGSGIA LGMTTALQFL PLLLFGLWGG ALVDRLDKRR LLVCTQAAMG VLAVGLGILA TAGAAQVWHV YLFAFGLGLI TVLDNPGRQA FVPEMVEREH LSNAIALNSA SFQLGRVTGP AVAGLLIAVI GSGPVFLING ASFGFTILAL MMIRTSELNP AERVARGKGQ IREGLRYIGG RRDLVLLLVL AAATQFFGAN SQNQIALMVN NVFEAGADAF GVAAAFLAVG ALAGALLAAR RDRPRLRLVL IGSLAFGALQ VVAGLMPGYV PFVLVLVPMG VAFMTYVTTL NATFQLSVDP RMRGRVMSMF LLVFMGVAPI GAPVVGLLAD TFGPETSLVI GGAVTLVVVA VISALLFRAK GVRPRDVLPG RRPSSDAETA EEATAEEETA ETGSEREEAP EAERERVDAE RLPDGSTRVR LVPDGDGTRP DTVTRT
|
| |