Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2701 |
Symbol | |
ID | 9246552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3221341 |
End bp | 3222852 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680622 |
Protein GI | 297561648 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.417248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.482514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTAGAG AATCGAAACC GCGGAGCCAG ACGACCCCTG CGCGGGCCGA GGGTTACCGC CAGGCCCGCA GCGCCACCAC GCTGGCCCGA TCAGCCGCAG GGGAGGGGGC GCCGCCTGCG ACGCTGGGCA CGGTCGGACT CCTGACCGCG TTGTCCGCCC TGCTGCTGTC GGTGATGAGC TTCGCCGCCG CGGGAATCGC CGTACCGAGT ATCGGCGCCT CGCTGCACGC TTCGGCGTCC GAGCAGTCGT TGGTGGTGTC GGTCTACTCC CTGGGCTTCG CCGCGCCGAT GGTCGTCGGC GGGCGTCTGG GCGACCTGTA CGGCAGGCGG CGGCTCTTCC TGTTCGGCAT GGCCGGATTC ACCGCGTTCT CGCTGATGGC GACGCTCGCG CCGACCATTG CCGTGCTGAT CGTCGCCCGC GCGCTCACCG GCGTGTCGGC GGCGGCGATG GTTCCCCAGG TGCTCGCGAC GATCACGGCC TCCACGCATG GACGCGAGCG TGCCCGGGCG GTGGCGTTGT TCGGGGCGAC CGCGGGCGGC GCGACGGCGG TGGGTCAGGT CCTCGGCGGC GTTTTGCTGT CGGTCCCCCT GCTCGGCTCC CCCTGGCGCA CGGTCTTCGC GATGAGTGTC CTCATGGGCG CCGTCGCGTT CCTCGCCGCT CTGCGCTGGA TGCCCAGCAC CGACGCACCG GGCGATCGTT CGCTGGACCT GGTCGGGACC GCGTTGCTGG GGGTATCGCT GCTCGCGCTG ATGATCCCGT TGTCCCAGGG CGGTGCGCTC GGTTGGCCCG GGTGGTGCTG GGCACTGCTG GCGGCCAGCC CGGTGGCGTT CGCGGCGTTC TGGACGCGGC AGCTCCGACT GCACCGCCGC GACCTGGTCC CGCTCGTTCC TCCGCCGCTC CTGCGTCTGA GGTCGTACCG GCTCGGCCTC ATCATGGCCC TCCTGCTCCA GTCGGCCTTC GGCGCGTTCA CGTTCCTCTA CGCGCTCTCC ACGCAGACGG GTCTGGGCTG GTCCCCGATG GGTGCGGCCC TCGTGCTGCT GCCGTTCGCA CTGTGCTTCT TCGCCGTGTC GATCTGGTCG GGAAAGCTGG CGCCCCGTTT CGGATTCCGC CGTCTGCTGA CGATCGGCGG GTTCGTCCAG GCGGCGATGC TGGTGGCGAC CGCGGCATCG GTGCTCATGC GGGGCCCGGG TATGAGCGGG TGGACGCTGG GAGCTCTGCT GGTCGGAGTC GGGGTCGGTC AGGCGCTCAT GTTCGGTCCG CTGGTCGGGG CGATGATCGC CGACGTCCCG CCCTCCTCGG CAGGAGCGGC CTCCGGGGTC ATCCAGACCG CGCAGCAGGC CGCCATGGGG CTCGGAGTCG CGGTCGCCGG AGGGGTTCTG GGTACTGCGA TGGCCGGTTC CACCGCCCCG CCCGGGCAGG ACTACATGAC GGCACTCGCG ATCTGCATGG TCGTCCAGGC CGCGTTCGCG ATCGCCTTCG CCCTCCTCGC CTTCGCCCTG CCCAGGCGCT GA
|
Protein sequence | MTRESKPRSQ TTPARAEGYR QARSATTLAR SAAGEGAPPA TLGTVGLLTA LSALLLSVMS FAAAGIAVPS IGASLHASAS EQSLVVSVYS LGFAAPMVVG GRLGDLYGRR RLFLFGMAGF TAFSLMATLA PTIAVLIVAR ALTGVSAAAM VPQVLATITA STHGRERARA VALFGATAGG ATAVGQVLGG VLLSVPLLGS PWRTVFAMSV LMGAVAFLAA LRWMPSTDAP GDRSLDLVGT ALLGVSLLAL MIPLSQGGAL GWPGWCWALL AASPVAFAAF WTRQLRLHRR DLVPLVPPPL LRLRSYRLGL IMALLLQSAF GAFTFLYALS TQTGLGWSPM GAALVLLPFA LCFFAVSIWS GKLAPRFGFR RLLTIGGFVQ AAMLVATAAS VLMRGPGMSG WTLGALLVGV GVGQALMFGP LVGAMIADVP PSSAGAASGV IQTAQQAAMG LGVAVAGGVL GTAMAGSTAP PGQDYMTALA ICMVVQAAFA IAFALLAFAL PRR
|
| |