Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1346 |
Symbol | |
ID | 9245196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1653646 |
End bp | 1654650 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | Bile acid:sodium symporter |
Protein accession | YP_003679284 |
Protein GI | 297560310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTGG CCGAACGCCT CCAGAGCCTG TTCGTGGCCC TGGCCGCCGT GGCGGGGCTG GCCGCGGGCC TGCTGCTGCC CGTCGGTCCG GCGGCCGAGC ACGTGGTGCT GCCCGCCCTG CTGGCGATGC TCACCGCCGT GTTCGCGCAG ATGGACGCCG CCCACGTGGG CGAGGTCGGG CGCGCCAGGA CCCTGGTCGC CGTCAGCCTG GCGCTCAACT TCGTGTTCAC CCCGGCCCTG GCCTGGGCCC TGGGAGCGGG GCTGCTCGGG GGCGAACCGG ACCTGCGCAT CGGTCTGCTG TTGCTGCTGG TGACCCCGTG CACGGACTGG TACCTGGTCT TCACCGCCGT GGCGCGCGGG CACACCGGCA TCGCCGCCGC CCTGCTGCCG GTCAACCTCG TCCTCCAGCT CGCGCTGCTG CCGGTGTACG TGCTGCTGCT GGGCGGCCGG GCCGCGATGG TCGACGCCGC CACCCTGGCC GAGTCGGTGC TGCTCGTCCT CGTCGTCCCG CTGGCCCTGG CCTCGGTGCT GCGGTGGGCC TCATACCGGT TCAAGGGGGC CGCCTGGCGC GAGCGGTACG TCACCGGTCC GGCCTCGCGC CTGGTCCTGC CGCTGCTGTA CGCGGCCGTG CTGGCGATGT TCGCCTGGCA GGCCCGCACC GTCCTGGAGC ACGGCGCCGA CCTGCTCGCC CTGCTGCCGC CGCTGGCGGT CTTCTTCGTG GCGCTGCCCC TGATCGCCAC CGGCCTGTCC CGGATGCTGC GCCTCCCGGC GGACCAGGGG GTCACCCTGG TCATGACCAC CACGGCCCGC AACTCGCCCA TCGCGCTGGC CGTCGCGGTC GCGGCCTTCC CCGACCGGCC CCTGATCGCG GTGGCGCTGG TCGTGGGACC GCTGGTGGAA CTGCCCGTGC TGGCCCTGCT CGCGCAGCTG GTGAGGGTAC GGCCTCCCGC CGCGAGCGGG TCCGCTCCGG CCCGGACCCG CCAGGGGCAC GGGCGCGAGC GCTGA
|
Protein sequence | MSLAERLQSL FVALAAVAGL AAGLLLPVGP AAEHVVLPAL LAMLTAVFAQ MDAAHVGEVG RARTLVAVSL ALNFVFTPAL AWALGAGLLG GEPDLRIGLL LLLVTPCTDW YLVFTAVARG HTGIAAALLP VNLVLQLALL PVYVLLLGGR AAMVDAATLA ESVLLVLVVP LALASVLRWA SYRFKGAAWR ERYVTGPASR LVLPLLYAAV LAMFAWQART VLEHGADLLA LLPPLAVFFV ALPLIATGLS RMLRLPADQG VTLVMTTTAR NSPIALAVAV AAFPDRPLIA VALVVGPLVE LPVLALLAQL VRVRPPAASG SAPARTRQGH GRER
|
| |