Gene Ndas_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1594 
Symbol 
ID9245444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1951561 
End bp1953366 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content76% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003679529 
Protein GI297560555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.191174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC CGGTGGAGGA CTCCGCGAGC GAACACGGGG AACTGCTCCC CACCGCCTCG 
GCCCGCCGCA CGTGGGCCGT GCTGGGCGAG GGGGTGCGCG GCACCGGCTG GGGCGCCCCG
CTTGCCCTGG CCACCGTGCT CGCGGGCAGC GCCGCCGGAC TGGTCGCCCC CTGGGTGCTG
GGCCGGATGG TCGACGACAT CTCGGCGCAG CGGGGGCTGG ACACCGTCCT GTCCTCGGTC
CTGCTCATCG CCGGGGCGGG GCTGCTCGGC GGCCTGCTCA CGGGTGTGGG CAGCTGGCTG
GTCAGCCGGG TCGGCGAGAC CGTCCTGGCC CGCCTGCGCG AAAGCGTCAT GGACCGCGTG
CTGCGCATGC CCGCCGCCCG CCTGGAGCGG GTGCGCATCG GCGACCTGCT CTCCCGGGTC
GGCGACGACG TCGCCGTGGT CACCGCCAGC ATCGCCCGCA GCGGCCCCGA CGCGGTCGTG
GCCCTGGCCA CCGTCCTGCT GACCGCGGTG GGCATCACCG CCCTGGACTG GCGCCTGGGC
CTGGCCGCGA TGCTGTGCGT GCCGGTCTAC GTGACGGCCG TGCGCTGGTA CCTGCCCCGT
TCGGGCCCCT ACTACGCCCG CGAGCGCGTG GCCATGGGCG AGCGCTCGCA CGCGATGATG
GGAGCCCTGC GCGGGCGTTC CGCCGTGCGC GCCTACCGCC TGGAGGACGA GCACGAGTCC
CGGGTGCGCG ACCGCTCGGC CGCCGCCATG GACATCACGG TGTCGGTGTT CCGGCTGTTC
ACCCGCTTCG GGTCGCGGCT CAACGCCGCC GAGTGCGTCG GACTGACCTC GGTGCTGGTG
GCCGGGTTCT GGCTGGTGCG CGCCGACCTG GCCAGCGTCG GCATGGTCAC CGCCGCCGCC
CTGTACTTCC ACCGCCTGTT CGGCCCGCTG ATCTTCCTGG TCATGAACTT CAACGAGGTG
CAGTCGGCCG GGGCGAGCCT GGCCCGTCTG GCCGGCGTGG TCGACCTCCC GGTGGAGGCC
GACCCGGGCG CGGAGGAGGG CCCGGCCGGT TCCTCGGTCC GCGTCGCGGG GGTCACCCAC
TCCTACGGCA GCGGCGGCCC GGCCGCGCTG GAGGAGGTGT CGCTGGAGGT CGCCCCGGGT
GAGCGCGTGG CCCTGGTCGG GGCCAGCGGC GCGGGCAAGA CCACGCTGGC CGCCCTGGTC
AGCGGCCTGC GCACGCCCAC GACCGGAACC GTGTACCTGG GCGGGATCCC CCTGGAGGAG
CTGGGGGAGC GGCGGGTGCG CGAGCACGTC TTCCTCGTCA GCCAGGAGAC CCACGTGTTC
GCCGGAACCC TGCTGGAGGA CCTGCGCCTG GCCCGCGCCG GGGCCGACGC GGCCGAGGCG
GAGGCCGCCC TGGAGGCCGT CGGCGCCCTG GAGTGGGTGC GCGCCCTGCC CGAGGGACTG
GACACGGTGG TCGGCGAGGG CGGGCACGCC CTGACCGCCG AGCAGGCCCA GCACCTGGCA
CTGGCCCGGC TGGTGCTGGC CGACCCGGAC GTGGCCGTCC TGGACGAGGC CACCGCCGAG
GCGGGCAGTT CCGGCGCCCG GCGACTGGAG CGCGCCGCCG CCGCGGCCAC CGCCGGGCGC
ACCACGCTGG TGGTGGCGCA CCGGCTGACC CAGGCGCAGA CCGCCGACCG CGTGGTGGTC
ATGGACCGGG GCCGCGTCGT CGAGCAGGGG ACGCACGCCG AGCTGGTCGC CGCCGGGGGG
CGCTACGCCG ACCTGTGGCG CAGCTGGCGG GGCGCGCCGG ACGACGGGGA GCCGCTCTCC
CGCTGA
 
Protein sequence
MSTPVEDSAS EHGELLPTAS ARRTWAVLGE GVRGTGWGAP LALATVLAGS AAGLVAPWVL 
GRMVDDISAQ RGLDTVLSSV LLIAGAGLLG GLLTGVGSWL VSRVGETVLA RLRESVMDRV
LRMPAARLER VRIGDLLSRV GDDVAVVTAS IARSGPDAVV ALATVLLTAV GITALDWRLG
LAAMLCVPVY VTAVRWYLPR SGPYYARERV AMGERSHAMM GALRGRSAVR AYRLEDEHES
RVRDRSAAAM DITVSVFRLF TRFGSRLNAA ECVGLTSVLV AGFWLVRADL ASVGMVTAAA
LYFHRLFGPL IFLVMNFNEV QSAGASLARL AGVVDLPVEA DPGAEEGPAG SSVRVAGVTH
SYGSGGPAAL EEVSLEVAPG ERVALVGASG AGKTTLAALV SGLRTPTTGT VYLGGIPLEE
LGERRVREHV FLVSQETHVF AGTLLEDLRL ARAGADAAEA EAALEAVGAL EWVRALPEGL
DTVVGEGGHA LTAEQAQHLA LARLVLADPD VAVLDEATAE AGSSGARRLE RAAAAATAGR
TTLVVAHRLT QAQTADRVVV MDRGRVVEQG THAELVAAGG RYADLWRSWR GAPDDGEPLS
R