Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2744 |
Symbol | |
ID | 9246595 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3284777 |
End bp | 3285763 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Bile acid:sodium symporter |
Protein accession | YP_003680663 |
Protein GI | 297561689 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTTC TCACGCGCGT CGCCGACTTC GTCGGCAGGT GGTTCGCCCT GCTGGTCCTC GCCGGAGGGA TCGCGGGCCT GGCCGCCCCC GGGCAGGCCG CCCTCCTCGC CCCCTACATC TCCCTGCTGC TCGGGGTGAT CATGTTCGGC ATGGGGCTCA CCATGCGCCC CGTGGACTTC GCGATCGTGG CCAGACATCC CAAGGCCGTC GTCCTCGGCG TCCTGGCCCA GTACACGGTC ATGCCGCTGC TGGGCTGGGG GATCGCCCAC CTCCTCAACC TGCCGCCGCT GCTCGTGGTG GGCATGATCC TGGTCGGCTC CTCCCCCGGC GGCACCGCCT CCAACGTCAT CGTCTACCTC GCCCGCGGCG ACGTGGCCCT GTCGGTGGCG ATGACCTCGA TCTCCACCCT GATCGCCCCG GTGCTGACCC CGCTCCTGGT CCTGGCCCTG GCCGGTTCCA CCCTGCCCGT CGCCGCCGGC GACCTGTTCG TCTCCATCCT CCAGGTCGTC CTGGTCCCGG TCCTGGCCGG ACTGCTGCTG CGCATGGCCG CGCGACGGTT CGTGGAGAGG GTCCTGCCCG TCCTGCCGCT GGTGTCCGTC CTCGGCATCG TCGTCGTGGT CGCCGCGGTG GTGGGCGCCA ACGCCGACGC CGTGCTCTCC TCCGGCCTCC TGGTCGCCCT GGCCGTGGTG CTGCACAACT CCCTGGGCCT GACGCTCGGC TACCTGCTCG GCGTGGTCAC CAAGCTGCCC GAGACCGCCC GCCGCGCGGT CAGCGTCGAG GTGGGGATGC AGAACTCCGG TCTCGCCGCG GCCCTGGCGA CCGCCCACTT CGCCCCGCTC GCCGCCCTGC CCGGCGCCCT GTTCTCGGTC TGGCACAACA TCTCCGGCGC GCTCGTGGCC ACCTACTGGG CCCGCCGCCC GCCCGCGGAC GTCCCGGCCG AGTCCGAGCC GACGGGGAGC GGCACGGAGG GCTCGACCGG GGCCTGA
|
Protein sequence | MSVLTRVADF VGRWFALLVL AGGIAGLAAP GQAALLAPYI SLLLGVIMFG MGLTMRPVDF AIVARHPKAV VLGVLAQYTV MPLLGWGIAH LLNLPPLLVV GMILVGSSPG GTASNVIVYL ARGDVALSVA MTSISTLIAP VLTPLLVLAL AGSTLPVAAG DLFVSILQVV LVPVLAGLLL RMAARRFVER VLPVLPLVSV LGIVVVVAAV VGANADAVLS SGLLVALAVV LHNSLGLTLG YLLGVVTKLP ETARRAVSVE VGMQNSGLAA ALATAHFAPL AALPGALFSV WHNISGALVA TYWARRPPAD VPAESEPTGS GTEGSTGA
|
| |