Gene Ndas_3576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3576 
Symbol 
ID9247445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4288522 
End bp4290225 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content69% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003681483 
Protein GI297562509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.440158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTGG GCCGGTGCCC GCACCGGTCC GTCATCCCCC GACCAGGGAA AGCGACCCTC 
CGGGGCTGGG AAGGAGCCAG CATCTCATCC CTACGCGAGT ACATCCGAGA ACACACCAAC
CCCACGGTCT TCGGGGTTTC CGGGGTCGTG ATCCTCGCCT TCGTCATCGT CGGCATCGTG
GCCACCGAGC CGATGCTCGA CGCGGCCACC GCCACGCGCG ACTGGATAGG TGAGAAGCTG
GGCTGGGTCT ACGTCCTGTC CACCACCTTC TTCCTGGTGA TGGCGGTCTT CCTCATGCTG
AGCCGGTTCG GGAAGATCCG GCTGGGCCCG GCCGACTCCA GGCCCGAGTT CGGGACGCTG
GCCTGGTTCG CCATGCTGTT CACCACCGGC ATGGGCATCG GCCTGGTGTT CTGGGGGGTG
TCGGAACCCA TCCACCACCT CACCTCGCCG CGCTCGGCCG AGTTCGTCAC CCCCGAGGGT
GAACCGCCCC CGCCCGAGGC GGCGAGCGAG GCGCTGGCCC TGAGCTACTT CCACTGGAGC
TTCCACCCCT GGGCCATCTA CATCGCGCTC GGCCTGTCGC TGGGCTACTT CGCCTTCCGC
AAGGGCCTGC CCCTGCGCCC GGCCTCCGCG CTCTACCCCC TCCTGGGCGA CCGGGCTTTC
GGCTGGCCCG GGAACGTCGT GGACATCCTC GCCATCTTCG GCACGATCTT CGGCCTGGCC
ACCTCGCTGG GCCTGGGCAC CCTCCAGATC AACGGCGGGC TCAACCACGT CTTCGGCATC
CCCTCCAACG CCACCGTGCA GTCGGTCATC ATCATCCTCA TCACCGCGGT CGCGCTGGCC
AGCGTGCTCT CCGGCATCGA CAAGGGCATC CGCCGCCTGT CGATGATCAA CCTGTGGCTG
GCCTTCCTGC TGCTGGTGGT CGTCTTCGCC CTGGGCCCCA AGCTGTGGAT CGCCAGCATC
ATGACCACCG GCACGGGCGA GTACCTGAGC AACATCGTCG AGTGGAGCCT GGCCTTCCCG
AGCCCGCTCA TCGACGAGAC GGCGGCGGCC TGGACCACCG CGTGGCCCAT CTTCTACTGG
GGCTGGTGGA TCTCCTGGGC CCCGTTCGTG GGCATCTTCC TGGCCCGTAT CTCCTACGGC
CGCACCATCC GCGAGTTCGT CATCGGGGCG CTGTTCGCCC CCGTCGCCGT GTCCATCCTG
TGGTTCGGGG TGTTCGGCGG CTCCGGCCTG TACTACGAAC TGTTCGGGAA CGCCGGACTG
AGCGCGCTGA GCGAGGAGGA CCGGTCCTTC CGCCTCGTGG AGCTGCTGCC CGGAGGGCCG
CTCATCGGCG GCATCATCTC CGTCCTGCTG ATCATCGTGG TGGCGGTCTT CTTCATCACC
TCCTCCGACT CCGGCTCGCT GGTGGTGGAC ACGCTCGCCA GCGGCGGGAG CCTCAAGCCG
GTCAAGGCCC AGCGCGCCTT CTGGGCGATC AGCGAGGGCG CGGTCACCCT GATCCTGCTG
GTGCTGGGCG GGGAGAACGC CCTGTCGGCG CTCCAGGCCG CGTCGGTGGT CACCGGACTG
CCCTTCGCGA TCATCCTGCT GCTCATGGTG TGGGGCCTGA TCAAGGGGCT GTCGGAGGAG
CCCAGGCCCG GAGGCCCCCG GGAGCAGCGC GCCGAGGACC GCCCCCGGTC CGGCCGGTCC
CCGGAGAAGC AGGCGAGCGA CTAG
 
Protein sequence
MSVGRCPHRS VIPRPGKATL RGWEGASISS LREYIREHTN PTVFGVSGVV ILAFVIVGIV 
ATEPMLDAAT ATRDWIGEKL GWVYVLSTTF FLVMAVFLML SRFGKIRLGP ADSRPEFGTL
AWFAMLFTTG MGIGLVFWGV SEPIHHLTSP RSAEFVTPEG EPPPPEAASE ALALSYFHWS
FHPWAIYIAL GLSLGYFAFR KGLPLRPASA LYPLLGDRAF GWPGNVVDIL AIFGTIFGLA
TSLGLGTLQI NGGLNHVFGI PSNATVQSVI IILITAVALA SVLSGIDKGI RRLSMINLWL
AFLLLVVVFA LGPKLWIASI MTTGTGEYLS NIVEWSLAFP SPLIDETAAA WTTAWPIFYW
GWWISWAPFV GIFLARISYG RTIREFVIGA LFAPVAVSIL WFGVFGGSGL YYELFGNAGL
SALSEEDRSF RLVELLPGGP LIGGIISVLL IIVVAVFFIT SSDSGSLVVD TLASGGSLKP
VKAQRAFWAI SEGAVTLILL VLGGENALSA LQAASVVTGL PFAIILLLMV WGLIKGLSEE
PRPGGPREQR AEDRPRSGRS PEKQASD