Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4430 |
Symbol | |
ID | 9248305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5274483 |
End bp | 5275679 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003682325 |
Protein GI | 297563351 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.559401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACGG AATCGGGAAA AAGGGTGTCG CGCCGCGGCA CGGCCGGCGG CAACCTCGCG TTGGCCACGG CGGCGTTCAC GCTCACCTTC TGGGCGTGGA ACCTCATCGG CCCCCTGTCG CGCACCTACA CCGAGAACCT GGGGCTGACA CCCACGCAGA CCTCGATCCT GGTGGCCTTC CCGGTGCTGG TGGGGGCGTT GGGGCGCATC CCCGTGGGCG CGCTGACCGA CCGCTACGGC GGGCGGCTGA TGTTCACGGC CCTGTGCTTC GCGAGCATCG TGCCGACGTT CCTGGTGGGG GTCTCGGGCG ACTCCTACGG GATGCTGCTG CTGTGGGGAT TCGTCCTCGG CGTCGCCGGG ACGTCGTTCG CGGTCGGCAT CCCGTTCGTG AACGCGTGGT ACCCGGCGAA CCGGCGCGGG TTCGCGACGG GCGTGTTCGG CGCGGGGATG GGCGGCACCG CCCTGTCGTC CTTCCTCACC CCCCGGCTGG TGGGCGCGGT GGGGCTGTTC ACCACCCACC TGCTGCTGTG CGCGGCGCTG GCGGTCATGG GCGTGGTGAT GTGGCTGATG TGCCGGGACT CGCCGGACTG GCGGCCGCGC ACGGAGCCCG CGCTGCCGCG CATGGGGGAG GCCGTGCGGC TGCGGGCGAC CTGGCAGGCG TCGCTGCTGT ACGCGGTCGC CTTCGGCGGT TTCGTGGCCT TCTCGACCTA CCTGCCGACG CTGCTGACGC TGGCCTACGA CTACGCGCAG ACCGCCGCGG GTCTGCGCAC CGCCGGGTTC GCGGTGGCGG CCGTGGCGGC CCGGCCGGTC GGCGGCGTCC TGTCGGACCG GATCGGCCCG GTGCGGGTGT GCCTGATCTC GTTCCTCGGC ACCTGCGTGT TCGCGGTCGT GCTGGCCCTG CACCCGCCCG CGGAGTTCCC GGCCGGTGCC GCGTTCGTGC TGATCGCGCT GTCGCTGGGT TTGGGCACCG GTGCCATGTT CGCGCTGGTG GCCAAGCTGG TGGAGCCGTC GAGGGTGGGC ACGGTCACGG GCCTGGTCGG TGCGGCCGGC GGGCTGGGCG GCTACTTCCC GCCGCTGCTC ATGGGGGCGG TGTACCAGGC GACCGGGGCC TACACGCTGG GCTTCGTACT GTTGGCGGCG GTCGCCCTGG CGGTGGCCCT GTACACCTGG CGGGCCTTCG CCCACGTGCG GGGATGA
|
Protein sequence | MRTESGKRVS RRGTAGGNLA LATAAFTLTF WAWNLIGPLS RTYTENLGLT PTQTSILVAF PVLVGALGRI PVGALTDRYG GRLMFTALCF ASIVPTFLVG VSGDSYGMLL LWGFVLGVAG TSFAVGIPFV NAWYPANRRG FATGVFGAGM GGTALSSFLT PRLVGAVGLF TTHLLLCAAL AVMGVVMWLM CRDSPDWRPR TEPALPRMGE AVRLRATWQA SLLYAVAFGG FVAFSTYLPT LLTLAYDYAQ TAAGLRTAGF AVAAVAARPV GGVLSDRIGP VRVCLISFLG TCVFAVVLAL HPPAEFPAGA AFVLIALSLG LGTGAMFALV AKLVEPSRVG TVTGLVGAAG GLGGYFPPLL MGAVYQATGA YTLGFVLLAA VALAVALYTW RAFAHVRG
|
| |