Gene Ndas_1859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1859 
Symbol 
ID9245709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2270005 
End bp2271612 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID 
ProductABC transporter related protein 
Protein accessionYP_003679793 
Protein GI297560819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0744608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.494788 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA AGTCCGCCGT CACACTCACC GACCTCACCT TCGCCTGGCC GGACGGCACG 
GTCGCGATCG ACCACGTGAG CGGCACCCTC ACGACCGGGC GCACCGGGCT CGTCGGCCGC
AACGGCGCGG GCAAGTCCAC CCTGCTGCGC CTGATCGCCG GTCACCTGCG CCCCACCTCG
GGGCGCGTCG ACGCGGTCGG CGACGTCGGC TACCTCCCGC AGACCCTGAC CCTGGGCACG
GAGGCGACGG TCGCCGAACT GCTCGGCATC GACGCCACCC TCGCCGCGCT CCGCGCGATC
GAGGCGGGCG ACGCCGACGA ACGCCACTTC GACGCGGTCG GCGACGACTG GGACATCGAG
GCGCGCGCCG ACGAGGCCCT GCACGAGATC GGGTTCACCG CCGCCGACCT CGACCGCCGC
GTCGCGCAGG TCTCCGGCGG TGAGGCGGTA CTCATCGCCG TCACCGGCAT GCGCCTGCGG
CGCACCCCGA TCACCCTGCT CGACGAGCCC ACCAACAACC TCGACCGGCC CACGCGGGCC
AGGCTCGCCG CGTTCGTCGA CACCTGGCCC GGCACCCTCG TCGTCGTCAG CCACGACCTC
GAACTGCTCG AACACATGGA CAGCACCGCC GAACTCCACG CCGGGAGCCT CGACGTGTTC
GGCGGCCCCT ACAGCGCCTG GAAGGAGCAC CTCGAACAGG AGCAGGCCTC CGCCGTCCAG
GCGGCCCGGT CCGCGCAGCA GGCCCTCAAG GTCGAGAAGC GCCAGCGCGT GGAGGCCGAG
ACCAAGCTCG CCCGCCGCGA GCGCACGGCC AGGAGGACGC AGAAGGACGG CGGCATCCCC
AAGATCCTCG CGGGCAACCG GGCCAGCAAG GCGCAGGCCT CGGCCGGGGC GATGCGCTCG
ACCCTCGACG ACAAGGTCCA GGCCGCGCAG GCCGCGGTCG ACGCCGCCGA CGCCCGCGTA
CGCGAGGACG AGCACATCCG CCTCACGCTG CCCGATCCGG ACGTGCCGCG CGGCCGCCGC
CTGGCGGAGT TCCACGCCGA GGGACGCACC GTCGTCGTCC AGGGTCCCGA ACGCGTCGCC
CTGGTCGGCC CCAACGGCGC CGGGAAGTCG ACCCTGCTCC AGCAGCTCGT CCACGGCGGC
GATCCGGTTC CGGGCCGCGC GCACGGCACG CTCCTGACCG ACCGCGTGGG GTACCTGCCC
CAGCGCCTGG ACGGCCTTGA CGACGCCGCG AGCGCGCTGG AGAACGTGCG GGCGGTCGCC
CCCGGCACGC CGCCGGGGGA GGTCCGCAAC CAGCTCGCCC GCCTGCTGCT GCGCGGGGAC
GGCGTCGACC GCCCCGTCGC CACGCTCTCC GGCGGCGAGC GGTTCCGCGT CTGCCTCGCC
ACGCTGCTCC TGGCGGAGCC GCCCGCGCAG CTGCTCGTCC TGGACGAGCC GACGAACAAC
CTCGACACCT CCAGCGTCGA GCAGCTCGCC GAGGCCCTCG ACGCCTACCG CGGCGCGCTC
CTGGTCGTCA GCCACGACCA CGGGTTCCTG CGCAGGATCG GGATCGACAC CGTCCTGGAG
ATCGGCCGGG AGGGCGGCCT GCGCCAGCGC GCCGAATTGG GGGACTGA
 
Protein sequence
MSTKSAVTLT DLTFAWPDGT VAIDHVSGTL TTGRTGLVGR NGAGKSTLLR LIAGHLRPTS 
GRVDAVGDVG YLPQTLTLGT EATVAELLGI DATLAALRAI EAGDADERHF DAVGDDWDIE
ARADEALHEI GFTAADLDRR VAQVSGGEAV LIAVTGMRLR RTPITLLDEP TNNLDRPTRA
RLAAFVDTWP GTLVVVSHDL ELLEHMDSTA ELHAGSLDVF GGPYSAWKEH LEQEQASAVQ
AARSAQQALK VEKRQRVEAE TKLARRERTA RRTQKDGGIP KILAGNRASK AQASAGAMRS
TLDDKVQAAQ AAVDAADARV REDEHIRLTL PDPDVPRGRR LAEFHAEGRT VVVQGPERVA
LVGPNGAGKS TLLQQLVHGG DPVPGRAHGT LLTDRVGYLP QRLDGLDDAA SALENVRAVA
PGTPPGEVRN QLARLLLRGD GVDRPVATLS GGERFRVCLA TLLLAEPPAQ LLVLDEPTNN
LDTSSVEQLA EALDAYRGAL LVVSHDHGFL RRIGIDTVLE IGREGGLRQR AELGD