Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3371 |
Symbol | |
ID | 9247236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4027486 |
End bp | 4029027 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003681282 |
Protein GI | 297562308 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGATAG AAGCCACGGA TCTGCAGGGC GCGCGGAGTG CTCCCGCCCC GTTGACCAGG GGACAGGCGG TGGCCACACT CGTCGCCGTC GCCCTGTCCA GCGTGATGCT GCCGCTCGCC GTCACCGCTC CGGCCGTGGC GCTCACCCAG CTCGCGGCCG ACCTGAACGC CAGCGTCGGC GAGGCCCAGT GGGTCCAGAA CGCCTACAAC GTCACCTTCG CGGCGTTCAT GCTCGCCGCC GGCGGACTCG CCGACCGCTT CGGCAGGCGC CGGGTCCTCG TCATCGGCCT CGTCGTCTTC ACCGCGATGG CCACGGTGAT CGGCCTCTCC TCCAACATCC TCGTCATCGA CGTCGCCCGC GCGGTCCAGG GCATCGGCGC CGCGGGCATC ATGACCAGCG GATCGGCGAT CCTCGCCGAC TCCTTCCGGG GAGCGGCCCG GGCCCGGGCC TTCGGCCTCC TGGGGACCTC CTTCGGCTTC GGACTGGCCA TGGGGCCCTT CGTCGCCGGG CTGATGGTCA ACTTCCTGGA CTGGCGCATG GTGTTCCTGA TGAACCTCGC GTTCGCCGCC GTCGTCCTGC TCCTGGTCCG CTCGATCCGT GAGTCGAGCG ACCCCGGTTC CACCTCGGTG GACTGGGGCG GCGTCATGAC CTTCAGCACC AGCCTGTTCC TGCTCTCCCT CGCCTTCGTC CAGGGCGCCG AGGCGGGTTG GCTCAGCCTG AGCGCGATCG GGTCGGCGGT CGGGTTCCTC GTCTTCCTCG CCGCCTTCGT CGTGGTGGAG TCACGGGTCC GGCGCCCGAT GTTCGACCTC TCGCTGTTCA AGCGGCCCAC GTTCGTCGTG GTCGTCTGCC AGCCCTTCAC CATCACCTTC GGCTTCGTGG TCCTGCTCGT CTACCTGCCG CCCTTCTTCC AGGGCGTCGG CGGGTTCGGC GCCGCCGAGG CCGGGGCCCT GCTCCTGCCC CTGACCCTGC CGGTGCTCGC CCTGCCGATG CTGGCCGGGC AGCTCGCCGC CAGGCTCCCC CTGCGGGTCA TGCTCGCCAC CAGCTCCCTG CTCATCGCGG GCGGCTCCCT GTGGTTGATG ACCCTCCAGC CGGGCCAGCA CTGGACGGCG CTCGCCGCTC CCCTGGCCCT GTTCGGCACG GGGGTGGGAA GCGCCTTCGG CGTCATGGAC AACGCGGCGC TCAGCTCCGT GGAGGTCGAG CGGGCCGGGA TGGCCTCGGG CATCTTCAAC ACCATGCGCA TCACCGGGGA GAGCGTGGCG ATCGCCGGAG CCGGGTCCGT GCTGGCGACC CTGTCCCTGA ACACGCTCGA CCTCCCCTTC GCCGACCCCG AGCAGGAGCG GACCCTGGCG GGTGAGGCCA CCCAGGGCCG ACTGGAAACG GCGCTGGGCC AGTTCGCCGA GGCGGACCGC TCGACCGCGC TGGACGCCGT CTCGGCCAGC CTCACCTCCG CGATGCACAC CACGTTCCTG GGGCTGGCGC TCCTCGCCCT GGCCGGCGCG GTGGTCACGT TCCTCGTCGT CAGGGAACGC GAGCTCCGCT AG
|
Protein sequence | MAIEATDLQG ARSAPAPLTR GQAVATLVAV ALSSVMLPLA VTAPAVALTQ LAADLNASVG EAQWVQNAYN VTFAAFMLAA GGLADRFGRR RVLVIGLVVF TAMATVIGLS SNILVIDVAR AVQGIGAAGI MTSGSAILAD SFRGAARARA FGLLGTSFGF GLAMGPFVAG LMVNFLDWRM VFLMNLAFAA VVLLLVRSIR ESSDPGSTSV DWGGVMTFST SLFLLSLAFV QGAEAGWLSL SAIGSAVGFL VFLAAFVVVE SRVRRPMFDL SLFKRPTFVV VVCQPFTITF GFVVLLVYLP PFFQGVGGFG AAEAGALLLP LTLPVLALPM LAGQLAARLP LRVMLATSSL LIAGGSLWLM TLQPGQHWTA LAAPLALFGT GVGSAFGVMD NAALSSVEVE RAGMASGIFN TMRITGESVA IAGAGSVLAT LSLNTLDLPF ADPEQERTLA GEATQGRLET ALGQFAEADR STALDAVSAS LTSAMHTTFL GLALLALAGA VVTFLVVRER ELR
|
| |