Gene Ndas_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4131 
Symbol 
ID9248005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4932911 
End bp4934566 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content74% 
IMG OID 
ProductNa+/solute symporter 
Protein accessionYP_003682032 
Protein GI297563058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGATCG CGGTCCTGGC CATCGGGCTG GTCCTGCTCA CCAGTCTGGT CATCGGCGTC 
TACGGGGTGC GGGCGGCCCG CTCGACCTCC GACTTCCTCG TGGCCTCGCG GCGGATCTCC
CCCAACTGGA ACGCGATGGC GATCGCCGGG GAGTACCTGT CGGCCGCCTC GGTGCTGGGG
CTCGCGGGGC TGCTGCTCAA GAACGGCCTG GGCACCATGT GGTACGCGGT GGGGTTCGCC
GCGGGCTACG TGGCGGTGGT GGCGCTGGTG GCCGGGCCGA TGCGCCGGTC GGGGGCGTTC
ACCGTGCCCG ACTTCGCCGA GTACCGCCTG GGTGCGCCGC GTCTGCGCAA GCTGTGCGGC
TGCGTGGTGC TGGTGATCAT GCTGCTGTAC CTGGTGCCCC AGTTCAAGGG TGCCGGTGTC
GTGCTGGCCC TGGTCAGCGG CACGCCCTAC TGGGTGGGGG TGCTGCTCGC CGGGCTGGTG
GTGAGCGCCT CGATCGCGGC CGGGGGGATG CGGTCGGCGA CCTACGTGCA GGCCTTCCAC
TACGTGGTCA AGCTGGCCTT CATCGCCATC CCGGCGGTCT ACCTGGTGGT GCACGCCGGT
CCCGGGACCC GGGCGGAGGC GCTCAACCCC GAGTGGGGCA CGCACTTCCC CGAGACGACC
CCGGTGGAGT TCACCGTGGG GACCCGGTTC AGCCTGGACG AGCCGGTGAC GGTGACCGCC
TCCGGGGGCG AACGGGTGGA GTTCGCGGCC GGGGAGCACA CCGTGGAGGC GGGCGCCGAG
TACGTGTTCC CCGAGGGCGC GCCCATCCCG CACCCGGCGG GGCTGCCGGA GCTGGGCAGC
GAGCGGTGGA GTTCGCCGCT GCTGGACGTG GGCGGGTACC CCCTGTTCGA GACGTGGTCC
ACGCTGCTGG CGATCACGCT GGGCTGCATG GGGCTGCCGC ACGTCATCAT GCGCTTCCAC
ACCAGCCCCA CGGCGCGGAC GGCGCGCCGG GTGGCGGTGG GCGTGATCGC CCTGCTGGGG
CTGTTCTACC TGTTCCCGAC GGTGTACGGG CTGCTCGGGC GGGTGCTCAC CCCGCACCTG
GTGCTGCTCC AGGGCACCGA CACGGTCGCG GTGGTGCTGC CCGCCCAAGC GGTGCCGGGC
TCGGTGGGGA CGGTGCTGAC GGCGCTGGTG GCGGCGGGGG CGTTCGGCGC GTTCCTGTCC
ACCTCCTCCG GACTGCTGCT GGCCCTGGCG GGCGGGCTGT CGCACGACCT GTTCCAGGGC
AGCGTGCCGC GGCTGCGCCT GGCGGTGGCC GTGGGCGCGT GCGTGGCGGT GCTGCTGGCG
CTGCCCGCGC AGCTGATCGA CATCAACGTG CTGGTGGTGT GGGCGTTCAC GGTGGCGGCC
TCGACGTTCT GCCCGCTGCT GGTGCTGGGC ATCTGGTGGC GGCGGCTGAC GCTGGCCGGG
GCCGCGTCCG GGCTGGTCGT GGGGGCGGTG ACCGCGACCG GCGCGGTGGG GTGGTCGATC
CTGCTGCCGC CCCCGGCCGG GTGGCTGGCG GTGCTGCTGG CACAGCCCGC GGCGTGGACC
ATCCCGCTGG CCTTCGGGAC GATGGTGCTC GTCTCGCGGC TGACCAGGCC GCCGGACTGG
GCCGAGCACG CCGTGCTGCG CCTGCACTCC CCCTGA
 
Protein sequence
MVIAVLAIGL VLLTSLVIGV YGVRAARSTS DFLVASRRIS PNWNAMAIAG EYLSAASVLG 
LAGLLLKNGL GTMWYAVGFA AGYVAVVALV AGPMRRSGAF TVPDFAEYRL GAPRLRKLCG
CVVLVIMLLY LVPQFKGAGV VLALVSGTPY WVGVLLAGLV VSASIAAGGM RSATYVQAFH
YVVKLAFIAI PAVYLVVHAG PGTRAEALNP EWGTHFPETT PVEFTVGTRF SLDEPVTVTA
SGGERVEFAA GEHTVEAGAE YVFPEGAPIP HPAGLPELGS ERWSSPLLDV GGYPLFETWS
TLLAITLGCM GLPHVIMRFH TSPTARTARR VAVGVIALLG LFYLFPTVYG LLGRVLTPHL
VLLQGTDTVA VVLPAQAVPG SVGTVLTALV AAGAFGAFLS TSSGLLLALA GGLSHDLFQG
SVPRLRLAVA VGACVAVLLA LPAQLIDINV LVVWAFTVAA STFCPLLVLG IWWRRLTLAG
AASGLVVGAV TATGAVGWSI LLPPPAGWLA VLLAQPAAWT IPLAFGTMVL VSRLTRPPDW
AEHAVLRLHS P