Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3576 |
Symbol | |
ID | 9247445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4288522 |
End bp | 4290225 |
Gene Length | 1704 bp |
Protein Length | 567 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003681483 |
Protein GI | 297562509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.440158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTGG GCCGGTGCCC GCACCGGTCC GTCATCCCCC GACCAGGGAA AGCGACCCTC CGGGGCTGGG AAGGAGCCAG CATCTCATCC CTACGCGAGT ACATCCGAGA ACACACCAAC CCCACGGTCT TCGGGGTTTC CGGGGTCGTG ATCCTCGCCT TCGTCATCGT CGGCATCGTG GCCACCGAGC CGATGCTCGA CGCGGCCACC GCCACGCGCG ACTGGATAGG TGAGAAGCTG GGCTGGGTCT ACGTCCTGTC CACCACCTTC TTCCTGGTGA TGGCGGTCTT CCTCATGCTG AGCCGGTTCG GGAAGATCCG GCTGGGCCCG GCCGACTCCA GGCCCGAGTT CGGGACGCTG GCCTGGTTCG CCATGCTGTT CACCACCGGC ATGGGCATCG GCCTGGTGTT CTGGGGGGTG TCGGAACCCA TCCACCACCT CACCTCGCCG CGCTCGGCCG AGTTCGTCAC CCCCGAGGGT GAACCGCCCC CGCCCGAGGC GGCGAGCGAG GCGCTGGCCC TGAGCTACTT CCACTGGAGC TTCCACCCCT GGGCCATCTA CATCGCGCTC GGCCTGTCGC TGGGCTACTT CGCCTTCCGC AAGGGCCTGC CCCTGCGCCC GGCCTCCGCG CTCTACCCCC TCCTGGGCGA CCGGGCTTTC GGCTGGCCCG GGAACGTCGT GGACATCCTC GCCATCTTCG GCACGATCTT CGGCCTGGCC ACCTCGCTGG GCCTGGGCAC CCTCCAGATC AACGGCGGGC TCAACCACGT CTTCGGCATC CCCTCCAACG CCACCGTGCA GTCGGTCATC ATCATCCTCA TCACCGCGGT CGCGCTGGCC AGCGTGCTCT CCGGCATCGA CAAGGGCATC CGCCGCCTGT CGATGATCAA CCTGTGGCTG GCCTTCCTGC TGCTGGTGGT CGTCTTCGCC CTGGGCCCCA AGCTGTGGAT CGCCAGCATC ATGACCACCG GCACGGGCGA GTACCTGAGC AACATCGTCG AGTGGAGCCT GGCCTTCCCG AGCCCGCTCA TCGACGAGAC GGCGGCGGCC TGGACCACCG CGTGGCCCAT CTTCTACTGG GGCTGGTGGA TCTCCTGGGC CCCGTTCGTG GGCATCTTCC TGGCCCGTAT CTCCTACGGC CGCACCATCC GCGAGTTCGT CATCGGGGCG CTGTTCGCCC CCGTCGCCGT GTCCATCCTG TGGTTCGGGG TGTTCGGCGG CTCCGGCCTG TACTACGAAC TGTTCGGGAA CGCCGGACTG AGCGCGCTGA GCGAGGAGGA CCGGTCCTTC CGCCTCGTGG AGCTGCTGCC CGGAGGGCCG CTCATCGGCG GCATCATCTC CGTCCTGCTG ATCATCGTGG TGGCGGTCTT CTTCATCACC TCCTCCGACT CCGGCTCGCT GGTGGTGGAC ACGCTCGCCA GCGGCGGGAG CCTCAAGCCG GTCAAGGCCC AGCGCGCCTT CTGGGCGATC AGCGAGGGCG CGGTCACCCT GATCCTGCTG GTGCTGGGCG GGGAGAACGC CCTGTCGGCG CTCCAGGCCG CGTCGGTGGT CACCGGACTG CCCTTCGCGA TCATCCTGCT GCTCATGGTG TGGGGCCTGA TCAAGGGGCT GTCGGAGGAG CCCAGGCCCG GAGGCCCCCG GGAGCAGCGC GCCGAGGACC GCCCCCGGTC CGGCCGGTCC CCGGAGAAGC AGGCGAGCGA CTAG
|
Protein sequence | MSVGRCPHRS VIPRPGKATL RGWEGASISS LREYIREHTN PTVFGVSGVV ILAFVIVGIV ATEPMLDAAT ATRDWIGEKL GWVYVLSTTF FLVMAVFLML SRFGKIRLGP ADSRPEFGTL AWFAMLFTTG MGIGLVFWGV SEPIHHLTSP RSAEFVTPEG EPPPPEAASE ALALSYFHWS FHPWAIYIAL GLSLGYFAFR KGLPLRPASA LYPLLGDRAF GWPGNVVDIL AIFGTIFGLA TSLGLGTLQI NGGLNHVFGI PSNATVQSVI IILITAVALA SVLSGIDKGI RRLSMINLWL AFLLLVVVFA LGPKLWIASI MTTGTGEYLS NIVEWSLAFP SPLIDETAAA WTTAWPIFYW GWWISWAPFV GIFLARISYG RTIREFVIGA LFAPVAVSIL WFGVFGGSGL YYELFGNAGL SALSEEDRSF RLVELLPGGP LIGGIISVLL IIVVAVFFIT SSDSGSLVVD TLASGGSLKP VKAQRAFWAI SEGAVTLILL VLGGENALSA LQAASVVTGL PFAIILLLMV WGLIKGLSEE PRPGGPREQR AEDRPRSGRS PEKQASD
|
| |