Gene Ndas_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4604 
Symbol 
ID9248485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5460310 
End bp5461872 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content71% 
IMG OID 
Productsodium/proline symporter 
Protein accessionYP_003682496 
Protein GI297563522 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCTGGT CCGTCGCGAC CTTCGGCGTC TACCTCGCCG CCATGGTGGC CATCGGCCTG 
TGGGCCTACA AGCTCACTGT CTCGCAGTCG GACTTCGTGC TCGGCGGCAG ACAGCTCAAC
AGCTGGGTGG CGGGTCTGAG CGCCAACGCC AGCGACTTCA GCGGGTGGCT GCTGCTCGGA
CTGCCCGGCG CCATCTACGT CTCCGGTCTG GGCGAGGCCT GGATCGCGGT CGGCCTGGCC
TGCGGCTTCG CCGGAAGCTG GATCCTCCTC GCGCCCCGCC TGCGCGTGTA CACCGAGCGC
GTGACCGACG CCCGCTCGGG GGGCGACTCC GACTCCCTGA CCCTCTCCTC CTTTCTAGAG
AACCGCTTCA ACGACCCCAC ACGGCTGCTG CGCGGGGTGT CGGCGGTGCT CATCATCGTC
TTCTACTTCT TCTACGTCGC CTCCGGGCTC GTCGCCATGG CCGCCCTGTT CGACCAGGTC
TTCGGACTGA GCCCGGGCCC CGCCATCGCC ATCGGCGTGG GCATCGTGGT GCTCTACACC
GTGCTCGGCG GCTTCCTCGC GGTGTCCTAC ACCGACGTGG TGCAGGCGGC GATGATGTGG
ATCGCCCTGC TGGCCGTCCC CGTCATGGCG GTCACCGCGC TCGGCGGTTT CGCCGGGCTG
ACCGAGGGCG TGTCCGACAA GAGCGACGGG CTGCTGTCGG CCGTAGGCGG CACCGCCCTG
GACGCCGAGC TCGGCCAGTG GGTGAGCACC GACACCCTCG GCTGGGTGGT CATCGTCTCC
GGCCTGGCCT GGGGCTTCGG CTACTTCGGC CAGCCGCACA TCCTGTCCCG CTACATGGGC
ATCCGCTCGG TCCGCGACAT CCCCAAGGCC GCCGTCATCA GCGTGGTCTG GGCGGTCACC
GCCATGGCCC TGGCCGTGCT GGTCGGCTTC ATCGGCGTCG CCTACTTCGA CACCCCGCTG
GAGAACTCCG AGCAGGTCTT CCCGCTCCTG ATCGAGGCCC TGACCCACCC GCTGGTCGCC
GGTCTGCTGC TCGCCGCCAT CCTCGCCGCC GTCATGAGCA CCGCCGACTC CCAGCTGCTG
GTCGCCGCGT CCGCGCTCAC CGAGGACGGC TACCGGGCCT TCGTGGACCG CGACGCCGAC
CCCGGGAGGC TGCTGTGGAT CAGCCGCGTC ACCGTGGTCG CCGTCGCCCT CGGCGCCGCG
GCCATCGCCC TGTGGGGCGA CCAGTCGGTG ATGGACCTGG TCGGCTACGC CTGGGCCGGG
TTCGGCGCGG GCTTCGGCCC GATCCTGGTG CTCTCCGTGT TCTGGAAGCG CATGAGCTGG
TCCGGCGCGC TCGCGGGCAT GATCGCGGGC GGCACCACCG CGATCGTGTG GGACGTCCTC
GACGCCAACT TCTTCGGCAC CGGCCTGTAC GCCATGGTCC CGGCCGTGGT CCTCAGCGTC
GCCGCCATCC TCGTCTTCAA CGGCCTGGCC AGGGTCACCC CGCAGATGGA GAGCGACTTC
GACCGGGTCG AGGCGGAGAT CCGCGGGACC GGCTCCGCCC CGGGAGAGGC CGCACGGGTC
TGA
 
Protein sequence
MLWSVATFGV YLAAMVAIGL WAYKLTVSQS DFVLGGRQLN SWVAGLSANA SDFSGWLLLG 
LPGAIYVSGL GEAWIAVGLA CGFAGSWILL APRLRVYTER VTDARSGGDS DSLTLSSFLE
NRFNDPTRLL RGVSAVLIIV FYFFYVASGL VAMAALFDQV FGLSPGPAIA IGVGIVVLYT
VLGGFLAVSY TDVVQAAMMW IALLAVPVMA VTALGGFAGL TEGVSDKSDG LLSAVGGTAL
DAELGQWVST DTLGWVVIVS GLAWGFGYFG QPHILSRYMG IRSVRDIPKA AVISVVWAVT
AMALAVLVGF IGVAYFDTPL ENSEQVFPLL IEALTHPLVA GLLLAAILAA VMSTADSQLL
VAASALTEDG YRAFVDRDAD PGRLLWISRV TVVAVALGAA AIALWGDQSV MDLVGYAWAG
FGAGFGPILV LSVFWKRMSW SGALAGMIAG GTTAIVWDVL DANFFGTGLY AMVPAVVLSV
AAILVFNGLA RVTPQMESDF DRVEAEIRGT GSAPGEAARV