Gene Ndas_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2667 
Symbol 
ID9246518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3175834 
End bp3177654 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content63% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003680590 
Protein GI297561616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0383388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGATCC GTCTACACGA CGCCCTGCGG CTGCGTACCT CGCCGCCGGT ATTCTTCGGC 
GCGGCCGCGG TGGTCATCGT GTTCGTGCTT GTCACCATCA TCTTCACCGA GCCGCTGGAT
GCAGCCGTCA CCGTCGCCTC GGACTGGCTG TACGCCAACC TGGGCTGGTT CTACATTCTC
GGCCTCACCT TGTTTCTGGT TTTTCTGGTT TATGTTGCCG CCAGCCGATT CGGGCGGGTG
AAGCTGGGCC CCGACGACGA AGAGCCCGAG CACTCCGGTC CGGCTTGGTT CGCCATGCTC
TTCGCCGCTG GCATCGGTAG CATCCTGATG TTCTGGGGCG TGGCCGAACC CATCAGTCAT
TTCGGTGATC CGCCGCGGGG TCCATCCCTG GGCGTGGAGC CGGAGACAGC AGCTGCTGCC
GCGGACGCGA TGAACTTCAC GCTCTATCAT TTCACCCTGC ACACCTGGGC GATCTTCACC
CTCCCAGCGC TGTGCTTCGC CTACTTCATC CACAAGCGGA ACCTGCCTCC GCGTGTCAGC
TCAATCTTCC AGCCGATTCT CGGTGAGGGG ATCCACGGGC CGATCGGTAA GTTCATCGAC
ATCGTCGCCA TCGTCGGCAC GGTCTTTGGC GTCGCGGTCT CTCTCGGACT GGGTGCTCTG
CAGATCAACA GCGGAGTCAA CCGTGTGCTC GGCATCCCGG AGAACGCCGT GTGGCAACTG
GTCATCATCG GCGTCGTCGG CGGGGCCGCG ATGATCTCCG TCGCGTTGGG CCTGGACCGC
GGCATCAAAC GCCTGTCCAA TATAAACATT TGGATGGCCG TGGGTCTGCT GGTGTTCATC
CTGCTGACGG GTTCCACTCT CTTCGTGCTC CAGGGCACCA TCGAAGCGCT GGGCCGTTAC
ATAGTGAATC TGCCGGAACT GGCCTTGTGG AACGACACCT TCGCCGACAC CGGTTGGCAG
TCCAACTGGA CAGTGTTCTA TTGGGCCTGG ACGATCAGCT GGTCACCGTT CGTGGGCATC
TTCATCGCCC GGATCTCCAA AGGCCGCACC ATCCGTCAGT TCATCACCGG AGTACTCCTC
ATTCCGTCCG GTTTCTCGGT GCTGTGGTTC GGCATCTTCG GGTTCTCCGC TTTCGACATC
GAGCTCAACG GTGAGGGCGG CCTGGTCGAA AGTGTCGTGG AGCAGCAGGA CATCCCCGGG
GCGCTGTTCA CGTTCTTGGA GCACTACCCC GCCACCCCCT TCACCTCGGT GCTCGCCATC
ATCCTGGTGG TGGTCTTCTT CATCACCTCG GTGGATTCCG CGGCGCTGGT GACCGACACG
ATGGCCAACG GCCATGAGGA CTTCAACCCG TTGGGGCAGC GCATCTTCTG GGCCGTCGCC
ATCGCCGTGG TAACCGCCAC TCTGCTGGTG TTCTCCGGAA CCGGCGGTTT GGGAGCGCTC
GAGAAGATCG TCGTCCTCGT CGGCCTGCCG TTCTTCGTGA TGGGATACTT CCAGATGTAC
GCGGTGTATC GAGCTCTTCG GGAAGACGCC GGAGAGCTGC CGGCGATGCG GACACGCAGG
TGGAAGAAGG TCCTGCCCCC GGAAGAGTAC GAACGCCGCC AGGACGAGGA TGAACACGAC
GTCTCAGAAG TCGTCGTTCA GCCCGAAGCC ATCGACGAAC GGCCCGTGAT GCGTGACCCC
TACGTCGACC GGGGCCGGGC GGCCGGAGCC CCCGGCACAC TCTCGGCACG GACCGGTTCG
ACAAGGCGCC CCGGCCAGCA GCTGAACCGA CCGGATCGGC CACCGCAGGC CGGAGGCCAG
GACCGGAAGG ACTCTGATTG A
 
Protein sequence
MLIRLHDALR LRTSPPVFFG AAAVVIVFVL VTIIFTEPLD AAVTVASDWL YANLGWFYIL 
GLTLFLVFLV YVAASRFGRV KLGPDDEEPE HSGPAWFAML FAAGIGSILM FWGVAEPISH
FGDPPRGPSL GVEPETAAAA ADAMNFTLYH FTLHTWAIFT LPALCFAYFI HKRNLPPRVS
SIFQPILGEG IHGPIGKFID IVAIVGTVFG VAVSLGLGAL QINSGVNRVL GIPENAVWQL
VIIGVVGGAA MISVALGLDR GIKRLSNINI WMAVGLLVFI LLTGSTLFVL QGTIEALGRY
IVNLPELALW NDTFADTGWQ SNWTVFYWAW TISWSPFVGI FIARISKGRT IRQFITGVLL
IPSGFSVLWF GIFGFSAFDI ELNGEGGLVE SVVEQQDIPG ALFTFLEHYP ATPFTSVLAI
ILVVVFFITS VDSAALVTDT MANGHEDFNP LGQRIFWAVA IAVVTATLLV FSGTGGLGAL
EKIVVLVGLP FFVMGYFQMY AVYRALREDA GELPAMRTRR WKKVLPPEEY ERRQDEDEHD
VSEVVVQPEA IDERPVMRDP YVDRGRAAGA PGTLSARTGS TRRPGQQLNR PDRPPQAGGQ
DRKDSD