Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1904 |
Symbol | |
ID | 9245754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2322483 |
End bp | 2324519 |
Gene Length | 2037 bp |
Protein Length | 678 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | choline/carnitine/betaine transporter |
Protein accession | YP_003679838 |
Protein GI | 297560864 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.674112 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAAA AACCACATAA AAACAAAGAG CAAACGAGCC GTCGCAGGAA GGAACACAAA GACGAGAAAG GCAAGGACCG GGCCTCGGCG CCGCGGAGCC GGACGGGCCA CACCGCGCTC GGGAGCCTGG ACTACCAGCA CCCCACGTAT CCCCACGACA CACACCCGGT CCTCGTCCCG GGCATCTCGA TCGACGACCA GCGCCGCAGC TACCGGGTCG ACTGGCTCGT CTTCGCGGTC GCCGGATCGC TGACGGTCGC CTTCGTCGTC TGGGGCATCT GGTCGCCGGG AAGCGTCGCC GCGGTCGCCG AGACGGCGTT CTACTGGTCG ACCGACAACC TCGGCTGGAT GTTCAACGTC GTGGCCATCG TCGTGCTCGT CTCCACCGTG GGCATAGCGT TCTCGCCCTA CGGGAGGATC CCGCTGGGCA AGGACGGCGA GCGGCCCGAG TTCAGCACGT TCTCCTGGAC GGCCATGCTG TTCGCCGCCG GGCTGGGTGT GGCGGTCCTG TTCTGGGGGC CCTCGGAGCC GCTCGGCTAC TTCATCTCAC CGCCTCCGCT GACGAACGAG CCGGAGTCGG TCGAGGCCAT GCACACGGCG CTCGCCCAGA TGTACTACCA CTGGGGCTTC CACGCGTGGG CCGTCTACGC GCTGGTCGGC GGGGCCGTCG CCTACGCCGC CTACCGCCGC GGCCGTCCCC TGCTCATGTC CTCGATCTTC CGCGCCCTGT TCGGCCGACG GCTCACCGAG GGTTTCGCCG GAAAGCTCGT CGACATCTTC GCGATCATCG CCACGCTCTT CGGCACCGCC GCCGCCCTCG GTATCGCGGC GATGCAGATC GGCTCCGGCG TCAGCATCGT GTCGGGCGCC GGGGACCTCA CGAACAACAC CCTGGTCGTC ATCATCGCGG TCCTGACCGT CGGCTTCGTC GTCTCGGCGG TCTCGGGCGT CGCACGGGGC ATCCGGCTCC TGTCGAACGT GAACATCGTG CTCACGATCG GCATCGTCGC GGTCTTCCTC TTCCTCGGCC CCACACTGTT CCTCCTGAAC CTGCTGCCCT CGGCGGTCAT GGAGTACTTC GGCTCCCTGT TCGACATGAT GGGCCGATCG CTCTCGTGGG GTCCCGAGAC GCAGGAGTTC CAGTCGCTGT GGACCGTCTA CTACTGGGCG TGGTGGATCT CCTGGTCGCC CTTCGTGGGC ATCTTCCTCG CGCGCATCTC CCGCGGCCGC ACCATCCGCC AGTTCACCCT CGGCACGATC ATCATCCCGT CCTCGCTGCT CTTCGTCGCC TACGGGGTGA TGGGCGGAAC CTCCATCTGG ATGTACCGGG AGGGCGCCCC CGGTCTCACC GAGGGCATGC CCGCGCCCGA GGTGCTCTTC GCCCTCATCG ACAACCTGCC GTACGTGGAG TGGCTGCCCT TCGTCGTGAT CGTGGTGCTC GCGATCTTCT TCATCACCGC CGCCGACTCC GCGTCGGTGG TGATGGGCAT GCTCACCACG CAGGGGGACC AGAACCCGCG CCTGTGGGTG GTCGTCTTCT GGGGCCTGGT CATGTCGGGG ATCGCGATCG TGATGCTGCT TCTGGGCGAC GCGACGGCGT TGACGGGCCT GCAGCAGCTG GTGATCGTCA CGGCGGTGCC CTTCGCGCTC GTACTGGTCC TCGCCGTCGT CGCGTGGTTC AGGGAGCTGC GCACCGATCC CCTCACCCTG CGCATGCACT ACATCGACAC GGCGATGGAC AACGCCGTGA CGGAGGGGGT GGACCGCTAC GGCGACGACT TCTCCCTGAA GGTCGTGGAA TCGCGGCCCG GGGACGGAGC CGGGGCGGGC ATCGACTCCA CCGACGAGAG CTACACGGAG TGGTACCAGC GCACCAATGA GGAGGGCGAA CCGGTCGGCT TCGACTTCGG GACCGGGGAG TGGGCCGACG GCTACGATCC GGACACCGGT GAGACCTCGG AGACCGCCCC CGAAGGGGCC GTGGTGGCAC GGGAAGACCG CGTCGAGGTG TCGGAGACCG GGACCGAGGA ACGCTGA
|
Protein sequence | MPEKPHKNKE QTSRRRKEHK DEKGKDRASA PRSRTGHTAL GSLDYQHPTY PHDTHPVLVP GISIDDQRRS YRVDWLVFAV AGSLTVAFVV WGIWSPGSVA AVAETAFYWS TDNLGWMFNV VAIVVLVSTV GIAFSPYGRI PLGKDGERPE FSTFSWTAML FAAGLGVAVL FWGPSEPLGY FISPPPLTNE PESVEAMHTA LAQMYYHWGF HAWAVYALVG GAVAYAAYRR GRPLLMSSIF RALFGRRLTE GFAGKLVDIF AIIATLFGTA AALGIAAMQI GSGVSIVSGA GDLTNNTLVV IIAVLTVGFV VSAVSGVARG IRLLSNVNIV LTIGIVAVFL FLGPTLFLLN LLPSAVMEYF GSLFDMMGRS LSWGPETQEF QSLWTVYYWA WWISWSPFVG IFLARISRGR TIRQFTLGTI IIPSSLLFVA YGVMGGTSIW MYREGAPGLT EGMPAPEVLF ALIDNLPYVE WLPFVVIVVL AIFFITAADS ASVVMGMLTT QGDQNPRLWV VVFWGLVMSG IAIVMLLLGD ATALTGLQQL VIVTAVPFAL VLVLAVVAWF RELRTDPLTL RMHYIDTAMD NAVTEGVDRY GDDFSLKVVE SRPGDGAGAG IDSTDESYTE WYQRTNEEGE PVGFDFGTGE WADGYDPDTG ETSETAPEGA VVAREDRVEV SETGTEER
|
| |