Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2808 |
Symbol | |
ID | 9246659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3353697 |
End bp | 3354893 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003680726 |
Protein GI | 297561752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.794185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.146653 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGTCC AGACCAGAAC CGAGTCCCCT CCGCCGGAGG CCCGCAGGGC CAGAGTCGCG GTCTCCACCC TCTTCTTCGT CAACGGCTTC ACCTACACCA ACGCCGTCCC GTGGCTGCCG GTGCTCAAGG CCCAGCTGGG GCTGAGCAAC ACGGAACTGG GCCTGGCGAT CGCGGCGATG CCGACCGGCG CGATCCTGAC CGGCATGCTG GCGGGCCCGC TGATCCACTG GTTCGGCAGC GGCCGGACGG CGGTGGGGAC CAGCCTCATC TCTCTGGGAG CGCTCCCATT CATCGCGCTG GCGCAGAACT GGTGGATGCT GGCGGCGGCG CTGTTCGTGC TGGGCAGCGC GGACGCCTGG ACCGACTCGG CGCAGAACTC ACACGGCCTG CGCGTCCAGC GGCGCTACAG ACGCAGCATC ATCAACACCT TCCACGCGCT GTGGAGCATG GCCGCGGTGG CCGGAGGCCT CCTGGGCGCG GCCATGGCCG GTACCGGCGT GCCCATCCTG TGGCACCTGG GCGGCGTGGC CGCGGTGCTG GTGTGCGTGA ACCTGGCGGT GAGCCGGATG CTGCTGCCCG GCCCGGAGAG CAGCGAGCGC GAGGACGGCA CGGACGCCGG GAGCGGGCGA CGCCTGCGCG TGCCCGGACG GGCCGTGCTG CTCCTGCTGG TGCTGAGCGT GCTGCTGATG TTCGCCGGAG GCATCGAGGA CTCCGCCGCC ACCTGGGGCG CGGTGTACAT GACCTCGGAA CTGGAGGCGT CCCTGTTCCT GGCGGCGATG CCGTTCGTGG CCTGCCAGGC GATGATGACG CTGGGTCGCC TGGCCGGCGA CCGGGTGACC GACCGGTTCG GCGCTGCCGC CGTGGGACGC GCGTGCGGTC TGCTGGCGGG CGGCGGAATC GCGTTCGCCC TGCTGGTACC GAACCCGGTG GCCACGGTCA TCGGCTTCGG GGTGATGGGA CTGGGCGTGT CCACGCTCTT CCCGCTGACC CTGGCGGCGG CGGGGAACGT GCCGGGCGTG CGCACCGGGG ACGGGATCAC CGTCGTCGGG TGGCTGGGCC GCGCGGGCTT CCTGGCCTTC CCGCCGCTGG TGGGCTTCCT CGCCGACTCC TCCAGCCTGG GGAACGCGCT GTGGGTGATC GCGGGTGCCG GTGTGGGCGC CTTCCTGCTC GCCTTCGCCC TGCGTCCGCG CGTGTGA
|
Protein sequence | MNVQTRTESP PPEARRARVA VSTLFFVNGF TYTNAVPWLP VLKAQLGLSN TELGLAIAAM PTGAILTGML AGPLIHWFGS GRTAVGTSLI SLGALPFIAL AQNWWMLAAA LFVLGSADAW TDSAQNSHGL RVQRRYRRSI INTFHALWSM AAVAGGLLGA AMAGTGVPIL WHLGGVAAVL VCVNLAVSRM LLPGPESSER EDGTDAGSGR RLRVPGRAVL LLLVLSVLLM FAGGIEDSAA TWGAVYMTSE LEASLFLAAM PFVACQAMMT LGRLAGDRVT DRFGAAAVGR ACGLLAGGGI AFALLVPNPV ATVIGFGVMG LGVSTLFPLT LAAAGNVPGV RTGDGITVVG WLGRAGFLAF PPLVGFLADS SSLGNALWVI AGAGVGAFLL AFALRPRV
|
| |