Gene Ndas_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1338 
Symbol 
ID9245188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1645625 
End bp1647253 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content69% 
IMG OID 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_003679276 
Protein GI297560302 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGAG CTATCCGCTG GAACATCGAC AAAACAGTCT TCTGGCCCGC GCTGATCATG 
GTGATCGGCT TCAGCGCGCC GTTCGTGATC GCGCCGCAGA CGGGTGAACG CCTGCTCGGC
GAGGTCCTGT CCCGGCTCCA GTCCGACCTG GGCTGGGTGT ACATGTGGTT CGTCGCCGCA
CTCGCGGTAC TGCTCGTCTG GCTGCTCTTC AGCAGGTACG GCCGCATCAG GATGGGCGGC
CCCGACGACC GCCCCGAGTT CTCCACCCCC ACCTGGCTCG CGATGATCTT CACCGCCGCG
ATCGGCGGCG GCCTCATGTA CTGGGGGATC ATCGAGTGGG CGCACTACCA CGTGGATCCC
CCGTTCCAAC TCGAACCGCA CAGCGCAGAG GCCGCCGAGT GGTCGGCCAC CTACCCCCTC
TTCCACTGGG GTCCCACCGC CTGGGCGGTC TTCTGCGTAC CGACCCTGGC CCTGGCCTAC
GCCTACCACG TGCGTAGAAT CCGCCGCCTG CGCCTGAGCG AGGCCTGCCG CGGGGTGCTC
GGCGACCGCG TCGACCGCTG GCCGGGGCGC CTGATCGACG TCTTCTTCAT CCTCGGGATG
ATCGGGGCGG CGGGGACCTC GCTGGCCCTG GCCGTGCCCA CCGTCGCCGA GGGCGCCTCC
CGCATGCTGG GCTTCGAGCC GGGCCCCACC CTCAACACCA TCGTCATCGG CCTGTGGACC
GTGCTGTTCG GGGGCAGCGT AGCCCTGGGC CTGCACCGCG GCCTCAAGCG CCTGGCCAAC
CTCAACCTCT ACCTGGCCGC CGCCCTGGGC GTCCTGGTCC TGGTGCTGGG CCCGGCGGTC
TTCGTCATCG ACACCTTCAC CAACAGCGTC GGCATGCTCG CCCAGAACAT CGTGCGGATG
AGCACCTACA CCGATCCGGT CGGCGGGTCC GGCTTCGAGG AGATCTGGAC GGTCTTCTAC
TGGGGCTGGT GGATCTCCTA CGGACCCTTC GTCGGCATGT TCTGCGCCAA GATCTCCAAG
GGGCGCACCG TCCGGCAGAT CATCGTCGGC ATGTGCGGCT TCGGCAGCCT GGGCTGCTGG
CTCTCGTTCG CGCTGCTCGG CAACTCCAGC ATGGCCTTCG AGCTGAGCGG ACAGGCGCCC
ATCGTCGACA CCCTGGAGGC CGAGGGCGCG GTCCCGGCCA TCTTCGCCAC GCTGGAGGCG
TTCCCGCTCA GCTGGATCAT CACCCCGCTG TTCCTGCTGC TGTTGCTGGT CTTCCTGGCC
ACCACGCTGG ACTCCGCGTC CTACATCATG GGCTCTGCCA CCTCCCGTGA CCTGCCCAAC
GAGGTCGAGC CCTCCCGGGC CAACCGCGTG CTCTGGGCGA TCGTCCTGGC CGCGGTGTCG
GTCTCGGTGA TGTCGGCCGG AGGCACCGAC GCGTTGCAGA CCCTGTCGGT CGTGACCGCC
TTCCCGCTGA TCTTCATCCT GAGCCTGGTC GCCGCCTCCC TGGTGCGCTG GCTCCGCGAG
GACGAGCGCT ACCGTTCCGG CACTGCGAGC CCCACTCCCG GCCCCGGGAC CGCCGAGCCC
GGTCCGGACC GAGGGGAGGA AGCCGGACTG GAAACGCCTC CCTCCCCACC GGCGGCGGCT
GCGAGGTGA
 
Protein sequence
MTRAIRWNID KTVFWPALIM VIGFSAPFVI APQTGERLLG EVLSRLQSDL GWVYMWFVAA 
LAVLLVWLLF SRYGRIRMGG PDDRPEFSTP TWLAMIFTAA IGGGLMYWGI IEWAHYHVDP
PFQLEPHSAE AAEWSATYPL FHWGPTAWAV FCVPTLALAY AYHVRRIRRL RLSEACRGVL
GDRVDRWPGR LIDVFFILGM IGAAGTSLAL AVPTVAEGAS RMLGFEPGPT LNTIVIGLWT
VLFGGSVALG LHRGLKRLAN LNLYLAAALG VLVLVLGPAV FVIDTFTNSV GMLAQNIVRM
STYTDPVGGS GFEEIWTVFY WGWWISYGPF VGMFCAKISK GRTVRQIIVG MCGFGSLGCW
LSFALLGNSS MAFELSGQAP IVDTLEAEGA VPAIFATLEA FPLSWIITPL FLLLLLVFLA
TTLDSASYIM GSATSRDLPN EVEPSRANRV LWAIVLAAVS VSVMSAGGTD ALQTLSVVTA
FPLIFILSLV AASLVRWLRE DERYRSGTAS PTPGPGTAEP GPDRGEEAGL ETPPSPPAAA
AR