Gene Ndas_3546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3546 
Symbol 
ID9247415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4255209 
End bp4256738 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID 
Productsodium:dicarboxylate symporter 
Protein accessionYP_003681453 
Protein GI297562479 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.286531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAGTC CCGGACGCGC CGCCGCCCCC GGGCAGAGTG CCGACAAGGC ACCGCCCAGG 
CGCAGGAAGC CCATCTACCG TCACCTGTAC TTCTGGGTGC TCGTCGGTAT CGCGCTCGGC
ATCGCCGTCG GTCTGCTCTT CCCGGCCTGG GCCGCGGACA TGCGCTGGCT GGCCGACCTG
TTCATCAAGC TCGTCAAGGT CGTCATCGCC CCGACCATCT TCTGTACCGT CGTCGTGGGC
ATCGCCGGTC TGGGCAACCT GGCCAAGGCG GGCGGGCTGG CCCTGCGCAC CCTGGTGTAC
TTCACCGCGC TGACCGTGCT CGCCCTGGCC ATCGGCCTGG TCACCGTCAA CGTCATGCGT
CCGGGCGTGG GCCTGAACGT GTCCTTCGAC GAGGCCGACG CCGCGGACAC CATCGCCGAG
GCCGAGGCGG GCGGAACCGG TTTCTCCGGC TTCATCCTCC ACATGATCCC CGACTCGTTC
TTCTCCGCGT TCGTGGAGGG CGAGCTGATC CAGGTCCTGG TCCTGGCCAT CCTGGTCGCC
TGCGCCCTGA CCATGCTCGG CAGGCGCGGC GAGCCCGCCG TCCGCGCCCT GGACACGATG
TCGCACATCA TGTTCGGCGT CATCAAGATC GTCATGTACG CGGCGCCCGT CGGCGCCTTC
GGCGGCATGG CCTTCACCAT CGGCGAGTAC GGCGGCCAGG TGCTCAGCAG CCTCGCCTAC
TTCATGCTCA GCTTCTACGT CACCTGTGTC CTGTTCATCG TCGTCGTCCT GGGGCTCGTC
AGCCGCCTGG CCGGGTTCAG CCTGTTCCGC CTGGTGCGCC TGATCCGCGA CGAGCTCCTC
ATCGTGCTGG GCACCTCCTC CAGCGAGTCG GTGCTGCCGC GCATGATGAC CAAGCTGGAG
GCCGCGGGCG CCAAGAAGTC CGTGGTGGGC CTGACCATCC CGACCGGGTA CTCCTTCAAC
CTGGACGGCA CCGCGATCTA CATGACCATG GGCGCCATCT TCATCGCCCA GGCCACCGGT
TCGGACATCT CCGTCTGGAC CCAGGTCGGA CTGCTGCTGT TCATGCTGCT CTCCAGCAAG
GGCGCGGCGG GCGTCAGCGG TGCCGGTCTG GTGACCCTGG CCGCCTCGCT GGCCGCGTTC
GGCGACGTGA TCCCGCTGGC GGGCATCGCG CTCATCGTCG GCATCGACCG CTTCATGTCC
GAGGGCCGCG CCCTGACCAA CCTCATCGGC AACGCGGTGG GCACCCTGGT CATCGCCCGC
TGGACCGGCG GCCTGGACCG CGAGCGCCTC ACCCACACGC TGAGGAACCC GGACAGCATC
GACATGGACG CCCTCATGAG CTACGACAAG CCCGCCGAGG GTTCCGGTGG GGCCGCGGAG
GCCTCCGGCA CGGCCGGGGG CGACGGCTCG GAGGCCGAGG AGGCCAGGAC GGCCGACTCC
GACACGGCCG CCGGGGAGTC GCGGACCGCC GACGCCTCCG GTGGGGGAGG CGGGGAGCGG
CACGCCCCCG TGGGCGCACC GCGCGGCTGA
 
Protein sequence
MPSPGRAAAP GQSADKAPPR RRKPIYRHLY FWVLVGIALG IAVGLLFPAW AADMRWLADL 
FIKLVKVVIA PTIFCTVVVG IAGLGNLAKA GGLALRTLVY FTALTVLALA IGLVTVNVMR
PGVGLNVSFD EADAADTIAE AEAGGTGFSG FILHMIPDSF FSAFVEGELI QVLVLAILVA
CALTMLGRRG EPAVRALDTM SHIMFGVIKI VMYAAPVGAF GGMAFTIGEY GGQVLSSLAY
FMLSFYVTCV LFIVVVLGLV SRLAGFSLFR LVRLIRDELL IVLGTSSSES VLPRMMTKLE
AAGAKKSVVG LTIPTGYSFN LDGTAIYMTM GAIFIAQATG SDISVWTQVG LLLFMLLSSK
GAAGVSGAGL VTLAASLAAF GDVIPLAGIA LIVGIDRFMS EGRALTNLIG NAVGTLVIAR
WTGGLDRERL THTLRNPDSI DMDALMSYDK PAEGSGGAAE ASGTAGGDGS EAEEARTADS
DTAAGESRTA DASGGGGGER HAPVGAPRG