Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4127 |
Symbol | |
ID | 9248001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4928571 |
End bp | 4930256 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | SSS sodium solute transporter superfamily |
Protein accession | YP_003682028 |
Protein GI | 297563054 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCTCG CCCAGGAAGC TGTCAAGACG GTCAGCGACA GCAGCCGCAT CATCACCATC GTCCTCTTCG TCCTCATGAT CGCGGCCACG CTCGGCATCA CCGTCTGGGC CGGCCGCAGC ACGCGCTCCG CCACCGACTT CCATTCGGGC GGGCGCGGCT TCTCCCCGCT CCAGAACGGC CTGGCGATCG GCAGCGACTA CATGTCCGCC GCGTCCTTCC TGGGCATCGC GGGCATGATC GCGCTGTTCG GCTACGACGG CTTCCTCTAC TCCATCGGGT TCCTCGTCGC CTGGCTCGTC GCCCTCCTGC TGGTCGCGGA GCTGCTGCGC AACTCCGGCC GCTACACCAT GGGTGACGTG CTGTCCTACC GGATGCGGCA GCGCCCGGTG CGCACCGCCG CGTCGGTCTC CACGATCGTG GTGTCGATCT TCTACCTGCT GGCGCAGATG GTCGGCGCGG GCGCGCTCAT CGCCCTGCTG CTGGGCATCC AGCCCGGGCA GACCTTCATG GGCATGGACG CCGCCACCGC CAAGATCGTC GGCATCGTGG TCATCGGTCT GCTGATGACC ATCTACGTGA CCTTCGGCGG CATGAAGGGC ACCACCTGGG TGCAGATCAT CAAGGCCCTC ATCCTCATGC TGGGCGCCGC CCTGCTCACC GTGCTGACCC TGTCCCTGTA CGGGTTCAAC CTCGGCGCGC TGATGAGCGA CGCCGCCGCG GCCAGCGGCC AGGAGGGCTT CCTGGAGCCC GGCCTGCGCT ACGGGGTGGA GGTCGCGGGC GACCCGCTCC AGACCCTGTG GAACAAGCTG GACCTGATCA GCCTGGGCCT GGCCCTGGTG CTGGGCACGG CCGGCCTGCC GCACATCCTC ATCCGCTTCT ACACCGTCCC CGACTCCCAG AGCGCCCGCA AGTCGGTCAA CTGGGGCATC GGCCTCATCG GCACCTTCTA CCTGATGACC CTGGTGCTCG GCTTCGGCGC GGCGGCCATC GTCGGCCACG AGGCCATCAC GGCGCAGGAC GCCGCGGGCA ATACCGCGGC CCCGCAGCTG GCCCAGGTGG TCGGCGAGCG CATCGGCGGC GAGATGGCGG GCGCGGTCCT GCTGGCGATC ATCGCCTCGG CGGCCTTCGC GGCCATCCTG TCCACGGTGG CGGGCCTGGT CATCGCCTCC TCGTCCTCCA TCGCGCACGA CTTCTACAAC TCGGTCCTGC GCAAGGGCAG GGCCACGGGC GCCGAGGAGG TCCGGGTCGC CCGCCTGTCC GCCCTGGGCG TCGGCGTGGT CTCGATCCTG CTGGCGGTGT TCGCACAGAA CCTCAACGTG GCCTTCCTGG TCTCGCTGGC GTTCGCGATG GCGGCCTCGG CGAACCTGCC GACCCTGCTG CTCAGCCTGT TCTGGAAGAG GTTCAACACC CGCGGCGCCC TCGCGGGCAT CTACGGCGGC CTGATCAGCG CCGTGGGCCT GGTGTTCTTC TCGCCCGTGG TCTCCGGCAG CGAGACCGCG CTCATCCCGA CCGCGGACTT CGCCTGGTTC CCGCTGCCCA ACCCGGCCCT GGTGTCGGTG CCGATCAGCC TGCTGTGCGC GATCGTGGGA ACCCTGTCGT CCAAGGAGCG CGACTTCGAC AAGTTCGCCC AGCTCCAGGT CCGCGCCCTG ACCGGCGCGG GCGCGGAGAA GGCGGCCAAC CACTAG
|
Protein sequence | MNLAQEAVKT VSDSSRIITI VLFVLMIAAT LGITVWAGRS TRSATDFHSG GRGFSPLQNG LAIGSDYMSA ASFLGIAGMI ALFGYDGFLY SIGFLVAWLV ALLLVAELLR NSGRYTMGDV LSYRMRQRPV RTAASVSTIV VSIFYLLAQM VGAGALIALL LGIQPGQTFM GMDAATAKIV GIVVIGLLMT IYVTFGGMKG TTWVQIIKAL ILMLGAALLT VLTLSLYGFN LGALMSDAAA ASGQEGFLEP GLRYGVEVAG DPLQTLWNKL DLISLGLALV LGTAGLPHIL IRFYTVPDSQ SARKSVNWGI GLIGTFYLMT LVLGFGAAAI VGHEAITAQD AAGNTAAPQL AQVVGERIGG EMAGAVLLAI IASAAFAAIL STVAGLVIAS SSSIAHDFYN SVLRKGRATG AEEVRVARLS ALGVGVVSIL LAVFAQNLNV AFLVSLAFAM AASANLPTLL LSLFWKRFNT RGALAGIYGG LISAVGLVFF SPVVSGSETA LIPTADFAWF PLPNPALVSV PISLLCAIVG TLSSKERDFD KFAQLQVRAL TGAGAEKAAN H
|
| |