Gene Ndas_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1046 
Symbol 
ID9244892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1290391 
End bp1291428 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content77% 
IMG OID 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003678995 
Protein GI297560021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.912273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.377404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG AGGCCCTGCT GTCGGTCCGC GACCTGCGGG TCACCCTGCC CGGACGGCGA 
GGGGGCGGGG TCCGCGCCGT GCGCGGGCTC TCCTTCGACG TGCGCCCCGG CGAGGTGCTC
GCGCTCGTGG GGGAGTCCGG GGCCGGGAAG TCGGTCACCG CCCGCGCCGT CCTGGGCATG
GCGCCCTACG GCGCCTCCGT CACCGGAAGC GTCCGCCTGG ACGGACAGGA GCTGGTCGGC
GCACCGCCCG CCGTCCTGCG GACCCTGCGC GGGCGGCGGA TGTCCCTGGT GCCCCAGGAC
GCGCTCGCCG TGCTCAGCCC CGTGCACACC GTGGGCGCCC AGCTCGTACG CGCCCTGCGC
TCCGTGCGTC GGATGAGCCG GGCCGCGGCG TGGGAGCGGG CGGTGGCCGC ACTGGACCGG
GTCGGCATCC CCGACGCCGC CCGACGCGCG CACGCCTACC CGCACGAGTT CTCCGGCGGC
ATGCGCCAGC GGGCGGTCAT CGCGATGGCC ACGGTCAACG AACCCGACCT GGTGTTCGCC
GACGAGCCCA CGACCGCCCT CGACCCCAGG ATGCAGGCCC GGACGCTGGA ACTGCTGTGC
GGGCTGCGGG AGCGGACCGG CACCTCGGTC GTCCTGGTCA CCCACGACCT GGGCGTCGTC
GGCGGCTACG CCGACCGGGT GGTGGTCGTC TACGCCGGAC GCCACGTCGA GTCGGGCCCG
GTGGGGCCGG TCCTGACCCG GCCGCGCGCC CCCTACACCG CCGGCCTGGT CGCCGCGCTG
CCCCGGCCCG GGGCCGGGGA CCGCCGCCTG CCCGCCATCG CCGGAACGCC CCCCTCACCG
GAGGCGCTGC CCGGCGGCTG CGCCTTCGCG CCCCGCTGTC CGCTGACGGA GGATCGGTGC
CACGCCGAGG AGCCCTCACC CGCGGTAGCC GGGGAGTCGG GCAGGCTGGT CTCGTGCCAC
CGCTGGCAGG ACCTGCCCGA CCCGGCTTCC TCCCTGTTCA CGGACACCGC GCACACACCA
CGGGAAAGGA CGACATGA
 
Protein sequence
MSDEALLSVR DLRVTLPGRR GGGVRAVRGL SFDVRPGEVL ALVGESGAGK SVTARAVLGM 
APYGASVTGS VRLDGQELVG APPAVLRTLR GRRMSLVPQD ALAVLSPVHT VGAQLVRALR
SVRRMSRAAA WERAVAALDR VGIPDAARRA HAYPHEFSGG MRQRAVIAMA TVNEPDLVFA
DEPTTALDPR MQARTLELLC GLRERTGTSV VLVTHDLGVV GGYADRVVVV YAGRHVESGP
VGPVLTRPRA PYTAGLVAAL PRPGAGDRRL PAIAGTPPSP EALPGGCAFA PRCPLTEDRC
HAEEPSPAVA GESGRLVSCH RWQDLPDPAS SLFTDTAHTP RERTT