Gene Ndas_3532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3532 
Symbol 
ID9247401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4241239 
End bp4242909 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content67% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003681439 
Protein GI297562465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.825969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCAAA AGCTGGCGCA AAGACTGGGG CTCGAGACGA ACCCGGCGAT CTTCTTCGTG 
TCCGCCGCGT TGACCATCGT TTTCGTGGTG TCCGCGATCT TCTTCACCGA CACGGTGGAC
GCCGTCTTCG GAACCACCTC CGGCTGGATC CTCACCAACC TGGGATGGTT CTACATCCTC
GGTGTGACCA CCTTCCTCAT CTTCCTGGTC TGGATCGCGT TCAGCAGGTT CGGGCGGGTA
CGACTGGGGC CGCCGGAGAG CACGCCCGAC TACAGCAACT CCGCCTGGTT CGCCATGCTG
TTCGCCGCCG GCATCGGCAG CATCCTGATG TTCTGGGGAG TGGCCGAACC CATCAGCCAC
TACGCCGAGC CGCCGCGCTC CGACGTCGGC CCGCAGACCA TCGAGGCGGC CGAGGAGGCG
ATGGGCTTCA CGCTCTACCA CTTCGGCCTG CACACCTGGA CGATCTTCTG TCTGCCCGGG
CTCGCCTTCG CCTACTTCGT GTACCGCAAG GGCCTGCCCT TCCGGGTCAG CTCCGTCTTC
CAGCCCTTCC TCGGGGACCG GATCAACGGC CCCATCGGGC GGACCATCGA CATCGTCGCG
GTCCTGGGCA CCCTGTTCGG CGTCGCCGTC ACCATCGGCC TGGGCACCCT CCAGATGAAC
AGCGGCCTCA ACACCCTGTT CGGCCTGGAG GAGAGCCGCG TCAGCCAACT GATCCTCATC
GCGATCGTCA CCACCGTCGC CGTGATCTCC GTGGCCACCG GCCTCGACGT CGGCATCAAG
TGGCTGTCCA CGATCAACAT CTACATGGCC GTGGGGCTGC TGGTCTTCGT CTTCCTCGCC
GGCTCGACGC TCTACCTGGC CAAGGGGGTC ATCGAGACCA CCGGCGTCTA CCTGGAGATG
CTGGTCCCGC TGTCGTTCTG GAACGACACC TTCGCCAACA CCGGCTGGCA GGGCTCCTGG
ACCGTCTTCT ACTGGGCGTG GACCATCACC TGGTCGCCCT TCGTCGGCAT CTTCATCGCG
CGCATCTCCA AGGGCCGGAC CATCCGGGAG TTCATCCTCG GCGTGCTGGC CGCGCCCACC
GCGTTCAGCG TCGTGTGGTT CAGCGTGTTC GGCCTGTCGG CCTTCGACAT CGAGCGCAAC
CAGGGCGGCG GCCTGGTGGA CGAGGTGGTC ACGCAGGAGG ACATCCCCGG TTCCCTGTTC
GCCTTCCTGG AGCACTTCCC GCTGACCACG GTCGTGTCGG TGGTGGCCAT CCTCATCGTC
ATCATCTTCT TCACCACGTC CTCGGACTCG GCCTCCCTGG TGGTGGACAT GCTCTGCTCC
GGCAAGAGCG ACAACCCCAC CCGGCAGCGG GTGTTCTGGG GGATCACCGA GGGCGTCGTC
GCCGCGACCG TGCTCACCGC CTCCGGCGTG GGCGGCCTGG ACGCCCTGCA ACAGACGATC
ATCGTGTTCG GGCTGCCCTT CTTCGTGATC GGCTTCTTCA TGATGGTGGG GCTGGTGCGC
TCGCTGCACG CCGAGTTCGC GGAGGGGTCG GGCCTTCAGC GCGAACGCAA GGCTCCGCTG
GCCACCAGGG CGCGCAGGAT CCAGCGGCGG TACCGGCGCC TGGGCGAGGC CGGGTCCCGC
GCCGGCGAGC GCAACGGGGG CAACGGCACC GGTGACGGCG GTCCGCGGTA G
 
Protein sequence
MFQKLAQRLG LETNPAIFFV SAALTIVFVV SAIFFTDTVD AVFGTTSGWI LTNLGWFYIL 
GVTTFLIFLV WIAFSRFGRV RLGPPESTPD YSNSAWFAML FAAGIGSILM FWGVAEPISH
YAEPPRSDVG PQTIEAAEEA MGFTLYHFGL HTWTIFCLPG LAFAYFVYRK GLPFRVSSVF
QPFLGDRING PIGRTIDIVA VLGTLFGVAV TIGLGTLQMN SGLNTLFGLE ESRVSQLILI
AIVTTVAVIS VATGLDVGIK WLSTINIYMA VGLLVFVFLA GSTLYLAKGV IETTGVYLEM
LVPLSFWNDT FANTGWQGSW TVFYWAWTIT WSPFVGIFIA RISKGRTIRE FILGVLAAPT
AFSVVWFSVF GLSAFDIERN QGGGLVDEVV TQEDIPGSLF AFLEHFPLTT VVSVVAILIV
IIFFTTSSDS ASLVVDMLCS GKSDNPTRQR VFWGITEGVV AATVLTASGV GGLDALQQTI
IVFGLPFFVI GFFMMVGLVR SLHAEFAEGS GLQRERKAPL ATRARRIQRR YRRLGEAGSR
AGERNGGNGT GDGGPR