Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2667 |
Symbol | |
ID | 9246518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 3175834 |
End bp | 3177654 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003680590 |
Protein GI | 297561616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0383388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGATCC GTCTACACGA CGCCCTGCGG CTGCGTACCT CGCCGCCGGT ATTCTTCGGC GCGGCCGCGG TGGTCATCGT GTTCGTGCTT GTCACCATCA TCTTCACCGA GCCGCTGGAT GCAGCCGTCA CCGTCGCCTC GGACTGGCTG TACGCCAACC TGGGCTGGTT CTACATTCTC GGCCTCACCT TGTTTCTGGT TTTTCTGGTT TATGTTGCCG CCAGCCGATT CGGGCGGGTG AAGCTGGGCC CCGACGACGA AGAGCCCGAG CACTCCGGTC CGGCTTGGTT CGCCATGCTC TTCGCCGCTG GCATCGGTAG CATCCTGATG TTCTGGGGCG TGGCCGAACC CATCAGTCAT TTCGGTGATC CGCCGCGGGG TCCATCCCTG GGCGTGGAGC CGGAGACAGC AGCTGCTGCC GCGGACGCGA TGAACTTCAC GCTCTATCAT TTCACCCTGC ACACCTGGGC GATCTTCACC CTCCCAGCGC TGTGCTTCGC CTACTTCATC CACAAGCGGA ACCTGCCTCC GCGTGTCAGC TCAATCTTCC AGCCGATTCT CGGTGAGGGG ATCCACGGGC CGATCGGTAA GTTCATCGAC ATCGTCGCCA TCGTCGGCAC GGTCTTTGGC GTCGCGGTCT CTCTCGGACT GGGTGCTCTG CAGATCAACA GCGGAGTCAA CCGTGTGCTC GGCATCCCGG AGAACGCCGT GTGGCAACTG GTCATCATCG GCGTCGTCGG CGGGGCCGCG ATGATCTCCG TCGCGTTGGG CCTGGACCGC GGCATCAAAC GCCTGTCCAA TATAAACATT TGGATGGCCG TGGGTCTGCT GGTGTTCATC CTGCTGACGG GTTCCACTCT CTTCGTGCTC CAGGGCACCA TCGAAGCGCT GGGCCGTTAC ATAGTGAATC TGCCGGAACT GGCCTTGTGG AACGACACCT TCGCCGACAC CGGTTGGCAG TCCAACTGGA CAGTGTTCTA TTGGGCCTGG ACGATCAGCT GGTCACCGTT CGTGGGCATC TTCATCGCCC GGATCTCCAA AGGCCGCACC ATCCGTCAGT TCATCACCGG AGTACTCCTC ATTCCGTCCG GTTTCTCGGT GCTGTGGTTC GGCATCTTCG GGTTCTCCGC TTTCGACATC GAGCTCAACG GTGAGGGCGG CCTGGTCGAA AGTGTCGTGG AGCAGCAGGA CATCCCCGGG GCGCTGTTCA CGTTCTTGGA GCACTACCCC GCCACCCCCT TCACCTCGGT GCTCGCCATC ATCCTGGTGG TGGTCTTCTT CATCACCTCG GTGGATTCCG CGGCGCTGGT GACCGACACG ATGGCCAACG GCCATGAGGA CTTCAACCCG TTGGGGCAGC GCATCTTCTG GGCCGTCGCC ATCGCCGTGG TAACCGCCAC TCTGCTGGTG TTCTCCGGAA CCGGCGGTTT GGGAGCGCTC GAGAAGATCG TCGTCCTCGT CGGCCTGCCG TTCTTCGTGA TGGGATACTT CCAGATGTAC GCGGTGTATC GAGCTCTTCG GGAAGACGCC GGAGAGCTGC CGGCGATGCG GACACGCAGG TGGAAGAAGG TCCTGCCCCC GGAAGAGTAC GAACGCCGCC AGGACGAGGA TGAACACGAC GTCTCAGAAG TCGTCGTTCA GCCCGAAGCC ATCGACGAAC GGCCCGTGAT GCGTGACCCC TACGTCGACC GGGGCCGGGC GGCCGGAGCC CCCGGCACAC TCTCGGCACG GACCGGTTCG ACAAGGCGCC CCGGCCAGCA GCTGAACCGA CCGGATCGGC CACCGCAGGC CGGAGGCCAG GACCGGAAGG ACTCTGATTG A
|
Protein sequence | MLIRLHDALR LRTSPPVFFG AAAVVIVFVL VTIIFTEPLD AAVTVASDWL YANLGWFYIL GLTLFLVFLV YVAASRFGRV KLGPDDEEPE HSGPAWFAML FAAGIGSILM FWGVAEPISH FGDPPRGPSL GVEPETAAAA ADAMNFTLYH FTLHTWAIFT LPALCFAYFI HKRNLPPRVS SIFQPILGEG IHGPIGKFID IVAIVGTVFG VAVSLGLGAL QINSGVNRVL GIPENAVWQL VIIGVVGGAA MISVALGLDR GIKRLSNINI WMAVGLLVFI LLTGSTLFVL QGTIEALGRY IVNLPELALW NDTFADTGWQ SNWTVFYWAW TISWSPFVGI FIARISKGRT IRQFITGVLL IPSGFSVLWF GIFGFSAFDI ELNGEGGLVE SVVEQQDIPG ALFTFLEHYP ATPFTSVLAI ILVVVFFITS VDSAALVTDT MANGHEDFNP LGQRIFWAVA IAVVTATLLV FSGTGGLGAL EKIVVLVGLP FFVMGYFQMY AVYRALREDA GELPAMRTRR WKKVLPPEEY ERRQDEDEHD VSEVVVQPEA IDERPVMRDP YVDRGRAAGA PGTLSARTGS TRRPGQQLNR PDRPPQAGGQ DRKDSD
|
| |