Gene Ndas_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2994 
Symbol 
ID9246847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3577285 
End bp3579183 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content74% 
IMG OID 
ProductNa+/solute symporter 
Protein accessionYP_003680910 
Protein GI297561936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.77047 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.396474 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCG CCGCAGCGGC CGCGGGCGGT GCCGTCACGG GGGACGGCAC CGCCGCGGTG 
GCCGTCGGAG TGGCGGCCGT CACCGAAACG CCCGCCGCTG CCGCACCGCC CGCGTCCACC
GTCCCGGTCG CGCCCGCCGA ACCGCCCGCT GCCGAAGCAC CGTCCGCGTC CACCGTTCCG
TCCGCGGCCG CCGACCCCCT GGACGCCCTG GGCGGCGCGC TGGGGAGCGA GTTCGGCGTC
GCGCACGCCT GGACGGTGGG CCTGTGCGCC CTGTTCGTCC TGGTCACCCT CGCCGTCACC
GTCCGCGCCA GGCGCACCAC CCGGGGCGCG GTCGACTTCT ACGCGGGCGG ACGCGGCTTC
TCCAGCACCC AGAACGGGCT CGCCCTGACC GGCGACTACC TGTCCGCGGC CTCCTTCCTG
GGCATCGCCG GGATGATCTC CCTCCAGGGC TACGACGGCT TCCTGTACTC CATCGGCTTC
CTGGCGGCCT GGCTGCTGGT GCTGCCGATG GCCCAGCTGG TGCGCAACAC CGGCCGCTTC
ACCATGGCCG ACCTGCCCGC CTTCCGCATG AACCGGATGC GGGTGCGCCT GGCCTGCACC
GTCTCCACCG TCACCGTGTG CGTGTTCTAC CTGGTCGCGC AGATGGTCGG CGCCGGAGCC
CTGGTCGCGG TCCTGCTGGG CCTGCACGAC GGCGGGACCT TCCTGGGGAT GGGCGCCGAG
CAGGCCCGCA CGGGCGTGAT CGTGCTCGTG GGCGTGCTCA TGATCGTCTA CGTCATGTAC
GGCGGCATGA AGGCGGCCAC CTGGCTCCAG ATCATCAAGG CGGTCGCGCT GCTGGCCGCC
ACCGGACTGC TCACCGCCCT GGTGCTGGCC CTGTTCGCCT TCGACCCGCG CGCCCTGCTC
GGCGGGGCCG CCGAGGCCAG CGGGCACGGT CAGGCCTTCC TGGAACCGGG GCTGCGCTTC
GGGGTGGAGG TCTCCGGCGA CCCCGCCCGG ACCCTGTTCA ACAAGCTCGA CCTCCTCAGC
CTCGGGCTGG CCCTGGTGCT GGGCACCGTC GCGCTGCCGC ACATCCTCAT CCGCTTCTAC
ACCGTGCCCG ACGGCCGGGC GGCGCGCTCG TCGGTCAACC GCACGATCGT CATGGTCGGG
GCCTTCTACC TCATGACGCC GGCCCTGGGC TTCGGCGCCG CGGCGCTGGT CGGCTCCGAG
CGCATCGCGG CCGCGGACCC CTCGGGCAAC ACCGCGGTGC CGATGCTCGC CGAGGAGGTG
GGGCGGCTGA CCGCCGGTCC CGCGGGCGCG GCCGTGCTGC TCGCGCTGGT CTCGGCGGTC
GCGCTGGCCA CCGTCCTCGC CGTCATCGCC AGCCTCACCC TGGCCTCGTC CTCCTCGATC
GCGCACGACC TGTTCGGCCA CATCCTCATG TGGGGCAGGC CCCGGGAGTC GCAGGAGGTG
GGTGTGGCGC GGCTCTCCGC CTGCGTGATC GGCGCCGTGG CCGTCGTGCT GGCCGTGCGG
GCCCAGGACA TGAACGCGGC CTTCCTCGTG GGGCTGGCCT TCGCCGTCGC GGCCGCGGCC
AACCTGCCGG TCATCGTGCT CACCATGTTC TGGCGCCGCT TCAACACCAG GGGTGTGGAG
TGGGGCGTCT ACGGCGGCCT GTCCGCCACC CTGCTGCTCA TGCTGCTGTC GCCGGTGATG
TCGGGCAGGA CCGACCCCGT CACGGGGGAG AACCTGTCGG TGCTGCCCGC CTGGATCGAC
GTCCAGCTCT TCCCGATGGA GAACCCGGCG CTGCTGGCGG TGCCGTTCGG CTTCGCGTGC
GCGGTCGTGG GCAGCCTGCT CTCGCCGGAG CGCGACACCG CGCGCTTCAC CGAGCTGCGG
GTGCGCTCCC TGACCGGGTG GGGCGTCGAG CGGGACTGA
 
Protein sequence
MSAAAAAAGG AVTGDGTAAV AVGVAAVTET PAAAAPPAST VPVAPAEPPA AEAPSASTVP 
SAAADPLDAL GGALGSEFGV AHAWTVGLCA LFVLVTLAVT VRARRTTRGA VDFYAGGRGF
SSTQNGLALT GDYLSAASFL GIAGMISLQG YDGFLYSIGF LAAWLLVLPM AQLVRNTGRF
TMADLPAFRM NRMRVRLACT VSTVTVCVFY LVAQMVGAGA LVAVLLGLHD GGTFLGMGAE
QARTGVIVLV GVLMIVYVMY GGMKAATWLQ IIKAVALLAA TGLLTALVLA LFAFDPRALL
GGAAEASGHG QAFLEPGLRF GVEVSGDPAR TLFNKLDLLS LGLALVLGTV ALPHILIRFY
TVPDGRAARS SVNRTIVMVG AFYLMTPALG FGAAALVGSE RIAAADPSGN TAVPMLAEEV
GRLTAGPAGA AVLLALVSAV ALATVLAVIA SLTLASSSSI AHDLFGHILM WGRPRESQEV
GVARLSACVI GAVAVVLAVR AQDMNAAFLV GLAFAVAAAA NLPVIVLTMF WRRFNTRGVE
WGVYGGLSAT LLLMLLSPVM SGRTDPVTGE NLSVLPAWID VQLFPMENPA LLAVPFGFAC
AVVGSLLSPE RDTARFTELR VRSLTGWGVE RD