Gene Ndas_0990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0990 
Symbol 
ID9244836 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1210996 
End bp1212678 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content68% 
IMG OID 
ProductSSS sodium solute transporter superfamily 
Protein accessionYP_003678940 
Protein GI297559966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.394544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTGC TAGCACAGGA CTCCCTGCGG CTGGACGCCA ACGCCATCGA CTACACCCTG 
CTCGCGGTGT ACTTCGCGTT CGTGCTGGGC ATCGGCTTCA TCGCCCGGCG TTCGGTCTCC
AACAGCCTCG ACTTCTTCCT GTCGGGGCGC TCCCTGCCCG CCTGGGTGAC GGGTCTGGCC
TTCATCGCGG CCAACCTCGG CGCCATCGAG ATCATCGGCA TGTCGGCCAA CGGCGCCAAC
TACGGCATGC CGACCATGCA CTACTTCTGG ATCGGCGCCG TCCCGGCGAT GCTGTTCCTG
GGCCTGGTCA TGATGCCGTT CTACTACGGC TCCAAGGTCC GCAGCGTCCC CGAGTTCATG
CTCCGCCGCT TCGGCACCGC CGCCCACCTG CTCAACGGGA TCAGCTTCGC GGTCGCCCAG
ATCCTGATCG CGGGCGTCAA CCTCTTCCTG CTGGCGACCA TCGTGGACGC GCTGCTGGGC
TGGCCCCTGT GGCTGTCGCT CCTGGTCGCC GCCGCGATCG TCCTCAGCTA CACCGCGCTG
GGCGGGCTCT CCGCGGCGAT CTACAACGAG GTGCTCCAGT TCTTCGTCAT CGTCGCGGCG
CTGCTGCCGC TGACCCTGGC GGGGCTGAAC CGGGTCGGCG GCTGGTCGGG TCTGGTGGAG
GAGGTCACCG CCTCCCCGCA GGGCGCCGAA CAGCTCTCCG CCTGGCCGGG CAACGCGCTG
ACCGGTTTCG GCGACAGCTT CCTGAGCATC CTCGGCATCG TCTTCGGCCT CGGCTTCGTC
CTCGCGTTCG GCTACTGGAC GACGAACTTC GTCGAGGTGC AGCGCGCCAT GGCCTCCAAG
AGCATGTCGG CCGCGATGCG CACCCCCATC ATCGGCGCCT TCCCCAAGCT GTTCATCCCG
TTCATCGTCA TCATCCCCGG GATGATCGCG GGCGTGAGCG TCTCGGAGAT GGTCCAGCTC
AAGGCCGGTG AGAACCCCGG CGTCGACTAC AACGACGCGA TCCTGCTGCT GATGCGCGAC
GTGCTGCCCA ACGGTCTGCT GGGCGTGGCC CTGGCCGGTC TGCTCGCGTC GTTCATGGCC
GGTATGGCCG CCAACCTCAG CTCGTTCAAC ACCGTGTTCA CGTACGACAT CTGGCAGGCC
TACGTCGTCA AGAACCGGCC CGACTCCTAC TACCTGGGCA TGGGCCGGTG GGTCACGGTC
GGCGCCACCG TCGGCGCCGT GGGCACGGCG TTCATCGCCT CGGGCTACTC CAACCTGATG
GACTACCTGC AGCAGCTGTT CTCGTTCTTC AACGCGCCGC TGTTCGCCAC GTTCATCCTC
GGTATGTACT GGAAGCGGAT GACGCCCCAC GCCGGTTGGA GCGGCCTGGC GGCGGGAACC
CTGGCCGCCG TGGGCGTGTT CCTGCTCGCC GAGACCGGAG TACTGGCCCT GTCGGCGCAG
GGCGCGAGCT TCGTCGGCGC GGGAGCGGCC TTCGTGGTCG ACATCCTCGT CAGCGTCGTG
GTCACCATGT TCACCCGGCC CAAGCCCGAC TCCGAGCTGG TGGGCCTGGT GCACTCGCTG
ACCCCGCGCG AGTCGCGCAA GGCCTCCACC ACCGGTGAGG ACGCCGGCTG GTACCGCCGA
CCGGGGCTGC TGGCCGGGAT CGCCCTGGTG CTCGTCATCG TCCTGAACAT CATCTTCGCC
TGA
 
Protein sequence
MTVLAQDSLR LDANAIDYTL LAVYFAFVLG IGFIARRSVS NSLDFFLSGR SLPAWVTGLA 
FIAANLGAIE IIGMSANGAN YGMPTMHYFW IGAVPAMLFL GLVMMPFYYG SKVRSVPEFM
LRRFGTAAHL LNGISFAVAQ ILIAGVNLFL LATIVDALLG WPLWLSLLVA AAIVLSYTAL
GGLSAAIYNE VLQFFVIVAA LLPLTLAGLN RVGGWSGLVE EVTASPQGAE QLSAWPGNAL
TGFGDSFLSI LGIVFGLGFV LAFGYWTTNF VEVQRAMASK SMSAAMRTPI IGAFPKLFIP
FIVIIPGMIA GVSVSEMVQL KAGENPGVDY NDAILLLMRD VLPNGLLGVA LAGLLASFMA
GMAANLSSFN TVFTYDIWQA YVVKNRPDSY YLGMGRWVTV GATVGAVGTA FIASGYSNLM
DYLQQLFSFF NAPLFATFIL GMYWKRMTPH AGWSGLAAGT LAAVGVFLLA ETGVLALSAQ
GASFVGAGAA FVVDILVSVV VTMFTRPKPD SELVGLVHSL TPRESRKAST TGEDAGWYRR
PGLLAGIALV LVIVLNIIFA