Gene Ndas_4509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4509 
Symbol 
ID9248389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5345847 
End bp5347163 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content69% 
IMG OID 
ProductGeneral substrate transporter 
Protein accessionYP_003682403 
Protein GI297563429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTGG ACACCGCACA GGCTCGGCCG CCGAGGGCCA AGGTCCTCAT CGCCAGCCTG 
ACCGGAAGCA CGATCGAGTG GTTCGACTTC TTCCTCTACG GCACCGCCGC CGCCCTCGTC
TTCGACGAGC TGTTCTTCCC CTCGGACGAC CCGTTCGTCT CGCTGATGCT GTCGTACCTG
ACCTTCTCGC TGACGTTCTT CATCCGTCCG CTGGGCGGCG TGGTCTTCTC CCACATCGGT
GACCGGATCG GCCGCAAGAG GACCCTGATC ATCACGCTGA CCCTCATGGG CGGCGCGACC
ATGCTCATCG GCCTGCTGCC GACCTACGAC AGCGTCGGGA TCCTCGCCCC GATCCTGCTC
GTCGTGCTGC GGATCATCCA GGGCCTGGGC ATCGGCGGCG AGTGGGGCGG CGCCCTGCTG
CTCGCCTACG AGTACGCGCC CGAGAACCGG CGCGGCCTCT TCGGCAGCGT GCCGCAGATG
GGCATCACCT CGGGCATGCT GCTGGCCAGC CTGGTGCTGA CCCTGATGTC GCTGCTGCCC
GACGACCAGT TCGCGACGTG GGGCTGGCGC GTGCCGTTCG TCGGCAGCGT CCTGCTGGTG
CTGCTGGGAC TGTGGATCCG CTCGGGGATC GACGAGACCC CCTCGTTCAG GAAGGCCAAG
GAGGAGGGCG AGGTCGCCGA ACTGCCGGTG GTGGAGACCT TCCGGTTCCA CTGGCGGGCC
GTGCTGGTCG CCGTCGGCGC CAAGGTCGTG GAGACCGCCC CCTTCTACAT CTTCGGCACC
TTCGTGGTGA GCTACGCGAC GGGCACGCTG TCCTTCGACA ACACCTCCGC GCTCAACGCC
GTGACCGTCG GGGCGATCGT GGCCACCGTG TGCATCCCGA TCGCCGGACG CCTGTCGGAC
ACCTTCGGCA GGCAGCGGGT GTACCTGGTC GGCGCGGTGC TGCTGGCCCT GTTCATCGCG
CCCTACTTCC TCATGCTGGG TACCGGCAGC ACGCTGATGC TGGTCCTGGC GACCGTGATC
GGGCTGGGCG TCCTGTGGGC ACCGGTCACC GCCACCATCG GCACCCTGTG CTCGGAGATC
TTCTCCACCC GGGTGCGCTA CACGGGCGTC ACCCTGGGTT ACCAGATCGG CGCGGCGGCC
GCGGGCGGCA CCGCCCCGCT GATCGCCACC TGGCTGCTGT CGCGGTTCGA CAACTCGTGG
GTTCCGGTCG CGGGCTACCT GGTTCTCACC GCGGTGGTGT CCATCGTCGC CGTGGCGCTG
GCCGGGCGGG CCTCCAACGC CGAGGAGAGG CACCTGGCGG CCTCCGAGAA GGGCTGA
 
Protein sequence
MSVDTAQARP PRAKVLIASL TGSTIEWFDF FLYGTAAALV FDELFFPSDD PFVSLMLSYL 
TFSLTFFIRP LGGVVFSHIG DRIGRKRTLI ITLTLMGGAT MLIGLLPTYD SVGILAPILL
VVLRIIQGLG IGGEWGGALL LAYEYAPENR RGLFGSVPQM GITSGMLLAS LVLTLMSLLP
DDQFATWGWR VPFVGSVLLV LLGLWIRSGI DETPSFRKAK EEGEVAELPV VETFRFHWRA
VLVAVGAKVV ETAPFYIFGT FVVSYATGTL SFDNTSALNA VTVGAIVATV CIPIAGRLSD
TFGRQRVYLV GAVLLALFIA PYFLMLGTGS TLMLVLATVI GLGVLWAPVT ATIGTLCSEI
FSTRVRYTGV TLGYQIGAAA AGGTAPLIAT WLLSRFDNSW VPVAGYLVLT AVVSIVAVAL
AGRASNAEER HLAASEKG