Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3532 |
Symbol | |
ID | 9247401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4241239 |
End bp | 4242909 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003681439 |
Protein GI | 297562465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.825969 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCAAA AGCTGGCGCA AAGACTGGGG CTCGAGACGA ACCCGGCGAT CTTCTTCGTG TCCGCCGCGT TGACCATCGT TTTCGTGGTG TCCGCGATCT TCTTCACCGA CACGGTGGAC GCCGTCTTCG GAACCACCTC CGGCTGGATC CTCACCAACC TGGGATGGTT CTACATCCTC GGTGTGACCA CCTTCCTCAT CTTCCTGGTC TGGATCGCGT TCAGCAGGTT CGGGCGGGTA CGACTGGGGC CGCCGGAGAG CACGCCCGAC TACAGCAACT CCGCCTGGTT CGCCATGCTG TTCGCCGCCG GCATCGGCAG CATCCTGATG TTCTGGGGAG TGGCCGAACC CATCAGCCAC TACGCCGAGC CGCCGCGCTC CGACGTCGGC CCGCAGACCA TCGAGGCGGC CGAGGAGGCG ATGGGCTTCA CGCTCTACCA CTTCGGCCTG CACACCTGGA CGATCTTCTG TCTGCCCGGG CTCGCCTTCG CCTACTTCGT GTACCGCAAG GGCCTGCCCT TCCGGGTCAG CTCCGTCTTC CAGCCCTTCC TCGGGGACCG GATCAACGGC CCCATCGGGC GGACCATCGA CATCGTCGCG GTCCTGGGCA CCCTGTTCGG CGTCGCCGTC ACCATCGGCC TGGGCACCCT CCAGATGAAC AGCGGCCTCA ACACCCTGTT CGGCCTGGAG GAGAGCCGCG TCAGCCAACT GATCCTCATC GCGATCGTCA CCACCGTCGC CGTGATCTCC GTGGCCACCG GCCTCGACGT CGGCATCAAG TGGCTGTCCA CGATCAACAT CTACATGGCC GTGGGGCTGC TGGTCTTCGT CTTCCTCGCC GGCTCGACGC TCTACCTGGC CAAGGGGGTC ATCGAGACCA CCGGCGTCTA CCTGGAGATG CTGGTCCCGC TGTCGTTCTG GAACGACACC TTCGCCAACA CCGGCTGGCA GGGCTCCTGG ACCGTCTTCT ACTGGGCGTG GACCATCACC TGGTCGCCCT TCGTCGGCAT CTTCATCGCG CGCATCTCCA AGGGCCGGAC CATCCGGGAG TTCATCCTCG GCGTGCTGGC CGCGCCCACC GCGTTCAGCG TCGTGTGGTT CAGCGTGTTC GGCCTGTCGG CCTTCGACAT CGAGCGCAAC CAGGGCGGCG GCCTGGTGGA CGAGGTGGTC ACGCAGGAGG ACATCCCCGG TTCCCTGTTC GCCTTCCTGG AGCACTTCCC GCTGACCACG GTCGTGTCGG TGGTGGCCAT CCTCATCGTC ATCATCTTCT TCACCACGTC CTCGGACTCG GCCTCCCTGG TGGTGGACAT GCTCTGCTCC GGCAAGAGCG ACAACCCCAC CCGGCAGCGG GTGTTCTGGG GGATCACCGA GGGCGTCGTC GCCGCGACCG TGCTCACCGC CTCCGGCGTG GGCGGCCTGG ACGCCCTGCA ACAGACGATC ATCGTGTTCG GGCTGCCCTT CTTCGTGATC GGCTTCTTCA TGATGGTGGG GCTGGTGCGC TCGCTGCACG CCGAGTTCGC GGAGGGGTCG GGCCTTCAGC GCGAACGCAA GGCTCCGCTG GCCACCAGGG CGCGCAGGAT CCAGCGGCGG TACCGGCGCC TGGGCGAGGC CGGGTCCCGC GCCGGCGAGC GCAACGGGGG CAACGGCACC GGTGACGGCG GTCCGCGGTA G
|
Protein sequence | MFQKLAQRLG LETNPAIFFV SAALTIVFVV SAIFFTDTVD AVFGTTSGWI LTNLGWFYIL GVTTFLIFLV WIAFSRFGRV RLGPPESTPD YSNSAWFAML FAAGIGSILM FWGVAEPISH YAEPPRSDVG PQTIEAAEEA MGFTLYHFGL HTWTIFCLPG LAFAYFVYRK GLPFRVSSVF QPFLGDRING PIGRTIDIVA VLGTLFGVAV TIGLGTLQMN SGLNTLFGLE ESRVSQLILI AIVTTVAVIS VATGLDVGIK WLSTINIYMA VGLLVFVFLA GSTLYLAKGV IETTGVYLEM LVPLSFWNDT FANTGWQGSW TVFYWAWTIT WSPFVGIFIA RISKGRTIRE FILGVLAAPT AFSVVWFSVF GLSAFDIERN QGGGLVDEVV TQEDIPGSLF AFLEHFPLTT VVSVVAILIV IIFFTTSSDS ASLVVDMLCS GKSDNPTRQR VFWGITEGVV AATVLTASGV GGLDALQQTI IVFGLPFFVI GFFMMVGLVR SLHAEFAEGS GLQRERKAPL ATRARRIQRR YRRLGEAGSR AGERNGGNGT GDGGPR
|
| |