Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1093 |
Symbol | |
ID | 9244939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1342475 |
End bp | 1343824 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003679041 |
Protein GI | 297560067 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.672028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCGTCGT ACCGCAGACT GCTCCACGAG TACCCCACCG GAGCCCGGCG CCGGCTCCTG CTCGCCGTCG TCGTCCTGGC GTTGTTCATC TCGGCCTTCG AAGGACAGCT CGCACCGGTC CTGCCCCTGC TGCTGGCCGA CCTCGGGCTG TCCCTGGAGG TCTACGGGCT GATCACCGCC GTGTCGCTGC TGTTCGGCGC CGTGTCCGGT TACCTGGGCG GCGAGTTGGT CGACCGGATC GGCCGCGTCC GCGTACTGGT GCCGTTCATG TTCCTGTCCG CGGCGGCGTG CCTGTTCATG GCGCTGTCGC AGACGGTCGT CCACTTCACC GCCGCGCGTA TCCTGCTCGC CTTCGTCGAG GGTGTGGCGA TGGCCGGCAC CCAGCCCCTG ATCCGGGACT TCACGCCGCG GATGGGACGG GCCCAGGCCT TCGCGTTCTG GAGCTGGGGC CCCGTCGGAG CCAACTTCTT CGCGGCGGCC GTCGCCGCGC TGACGCTCGA CCTGTTCGAC AACTCCTGGC GCGCGCAGAT CTTCGTGATG TCCGGCCTGG CCTTCGTGGG CGCGACGGTC GTCGCCCTCA CCCTGCGCGA CCTGGCTCCG AACCTGCGGC GCACCATCCG CCTCACCGAG CATGCGACCC GCGGGAGCGG GGCGGCCCCC GCCGGGCAGC GGCTGCGGCT GCTCCTGCGC CGCCGTGTCG TCTGGGCGCA CGTGGCCGCG ATGTCGCTGC TCTACGTCCT GCTCGCCACC ATGAACGCCT ACGGCCAGAC CATGCTGGTG GACCACTTCG CCGTCGCGGT GCGCACGGCG TCCGCGATCG TGATGTCGTT CTGGGTCAGC AACCTGGCGG CCAGCCTGGT CTTCGCCCGG CTCTCCGACA GGGCGCAGAA CCGCAAGCCG TTCCTGGTCC TCGGGGCGCT GGCGGCGACG CTGCTGCTCG GGGCGCTCGT CGTCACGATG GGCGCGGGGG CGTCCGCCTC ACTGCCGCTG GTCATCGTGC TGCTGACCGG CGTCGGCCTC TCGCTCGGCG CCGTCCTCGG CCCGTGGATG GCCAGCTTCT CGGAGTACAC CGAGGAGGTC CACCCGGACG CCCAGGGGGT GGCCTTCGGC CTCAACCACT TCGTGAGCCG GTTGTTCATC CTCGCCGCGG TGCTGTTCGC TCCGCAGGTG GTCGCGGTGG GCGACTGGCG GGTGTGGATG ACGGTCACCC TGGTGACGAC AGCGGCCTTC GCGGTCGTCA CCACACAGGT CCAGGGCCGC CTGCGGCGCG CGACCGGTTC CGCGTCGGCG GAGGAGGCCG GTGCGGCCGT CGCCGCGACC GGGGCGCCCG AGGACTCCCG GAGTTCCTGA
|
Protein sequence | MPSYRRLLHE YPTGARRRLL LAVVVLALFI SAFEGQLAPV LPLLLADLGL SLEVYGLITA VSLLFGAVSG YLGGELVDRI GRVRVLVPFM FLSAAACLFM ALSQTVVHFT AARILLAFVE GVAMAGTQPL IRDFTPRMGR AQAFAFWSWG PVGANFFAAA VAALTLDLFD NSWRAQIFVM SGLAFVGATV VALTLRDLAP NLRRTIRLTE HATRGSGAAP AGQRLRLLLR RRVVWAHVAA MSLLYVLLAT MNAYGQTMLV DHFAVAVRTA SAIVMSFWVS NLAASLVFAR LSDRAQNRKP FLVLGALAAT LLLGALVVTM GAGASASLPL VIVLLTGVGL SLGAVLGPWM ASFSEYTEEV HPDAQGVAFG LNHFVSRLFI LAAVLFAPQV VAVGDWRVWM TVTLVTTAAF AVVTTQVQGR LRRATGSASA EEAGAAVAAT GAPEDSRSS
|
| |