Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1338 |
Symbol | |
ID | 9245188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1645625 |
End bp | 1647253 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003679276 |
Protein GI | 297560302 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCGAG CTATCCGCTG GAACATCGAC AAAACAGTCT TCTGGCCCGC GCTGATCATG GTGATCGGCT TCAGCGCGCC GTTCGTGATC GCGCCGCAGA CGGGTGAACG CCTGCTCGGC GAGGTCCTGT CCCGGCTCCA GTCCGACCTG GGCTGGGTGT ACATGTGGTT CGTCGCCGCA CTCGCGGTAC TGCTCGTCTG GCTGCTCTTC AGCAGGTACG GCCGCATCAG GATGGGCGGC CCCGACGACC GCCCCGAGTT CTCCACCCCC ACCTGGCTCG CGATGATCTT CACCGCCGCG ATCGGCGGCG GCCTCATGTA CTGGGGGATC ATCGAGTGGG CGCACTACCA CGTGGATCCC CCGTTCCAAC TCGAACCGCA CAGCGCAGAG GCCGCCGAGT GGTCGGCCAC CTACCCCCTC TTCCACTGGG GTCCCACCGC CTGGGCGGTC TTCTGCGTAC CGACCCTGGC CCTGGCCTAC GCCTACCACG TGCGTAGAAT CCGCCGCCTG CGCCTGAGCG AGGCCTGCCG CGGGGTGCTC GGCGACCGCG TCGACCGCTG GCCGGGGCGC CTGATCGACG TCTTCTTCAT CCTCGGGATG ATCGGGGCGG CGGGGACCTC GCTGGCCCTG GCCGTGCCCA CCGTCGCCGA GGGCGCCTCC CGCATGCTGG GCTTCGAGCC GGGCCCCACC CTCAACACCA TCGTCATCGG CCTGTGGACC GTGCTGTTCG GGGGCAGCGT AGCCCTGGGC CTGCACCGCG GCCTCAAGCG CCTGGCCAAC CTCAACCTCT ACCTGGCCGC CGCCCTGGGC GTCCTGGTCC TGGTGCTGGG CCCGGCGGTC TTCGTCATCG ACACCTTCAC CAACAGCGTC GGCATGCTCG CCCAGAACAT CGTGCGGATG AGCACCTACA CCGATCCGGT CGGCGGGTCC GGCTTCGAGG AGATCTGGAC GGTCTTCTAC TGGGGCTGGT GGATCTCCTA CGGACCCTTC GTCGGCATGT TCTGCGCCAA GATCTCCAAG GGGCGCACCG TCCGGCAGAT CATCGTCGGC ATGTGCGGCT TCGGCAGCCT GGGCTGCTGG CTCTCGTTCG CGCTGCTCGG CAACTCCAGC ATGGCCTTCG AGCTGAGCGG ACAGGCGCCC ATCGTCGACA CCCTGGAGGC CGAGGGCGCG GTCCCGGCCA TCTTCGCCAC GCTGGAGGCG TTCCCGCTCA GCTGGATCAT CACCCCGCTG TTCCTGCTGC TGTTGCTGGT CTTCCTGGCC ACCACGCTGG ACTCCGCGTC CTACATCATG GGCTCTGCCA CCTCCCGTGA CCTGCCCAAC GAGGTCGAGC CCTCCCGGGC CAACCGCGTG CTCTGGGCGA TCGTCCTGGC CGCGGTGTCG GTCTCGGTGA TGTCGGCCGG AGGCACCGAC GCGTTGCAGA CCCTGTCGGT CGTGACCGCC TTCCCGCTGA TCTTCATCCT GAGCCTGGTC GCCGCCTCCC TGGTGCGCTG GCTCCGCGAG GACGAGCGCT ACCGTTCCGG CACTGCGAGC CCCACTCCCG GCCCCGGGAC CGCCGAGCCC GGTCCGGACC GAGGGGAGGA AGCCGGACTG GAAACGCCTC CCTCCCCACC GGCGGCGGCT GCGAGGTGA
|
Protein sequence | MTRAIRWNID KTVFWPALIM VIGFSAPFVI APQTGERLLG EVLSRLQSDL GWVYMWFVAA LAVLLVWLLF SRYGRIRMGG PDDRPEFSTP TWLAMIFTAA IGGGLMYWGI IEWAHYHVDP PFQLEPHSAE AAEWSATYPL FHWGPTAWAV FCVPTLALAY AYHVRRIRRL RLSEACRGVL GDRVDRWPGR LIDVFFILGM IGAAGTSLAL AVPTVAEGAS RMLGFEPGPT LNTIVIGLWT VLFGGSVALG LHRGLKRLAN LNLYLAAALG VLVLVLGPAV FVIDTFTNSV GMLAQNIVRM STYTDPVGGS GFEEIWTVFY WGWWISYGPF VGMFCAKISK GRTVRQIIVG MCGFGSLGCW LSFALLGNSS MAFELSGQAP IVDTLEAEGA VPAIFATLEA FPLSWIITPL FLLLLLVFLA TTLDSASYIM GSATSRDLPN EVEPSRANRV LWAIVLAAVS VSVMSAGGTD ALQTLSVVTA FPLIFILSLV AASLVRWLRE DERYRSGTAS PTPGPGTAEP GPDRGEEAGL ETPPSPPAAA AR
|
| |