Gene Ndas_5010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5010 
Symbol 
ID9248899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp152270 
End bp153973 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content72% 
IMG OID 
ProductResB family protein 
Protein accessionYP_003682897 
Protein GI297563924 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.125994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACGA CCGCGAACGA CCAAGGGGCC GGGACCGCCG CCCAGAACGG GGAGGGCGCC 
CGACCGCGCC CCAAGGGCCT GGGCGCGGTC GGCTGGCTGC GCTGGATCTG GCGCACCCTG
ACCTCGATGC GCACCGCCCT GATCCTGCTG TTCCTGCTGG CCGTCGGGGC CATCCCGGGC
TCGATCCTGC CGCAGAACGT GGTGAGCGTC GACGAGGTCA ACACCTACTT CCAGGAGAAC
CCCGAACTGG CGCCCTGGCT GGACCGGTTC TACCTCTTCG ACGTGTTCTC CTCGCCCTGG
TACGCGGCGA TCTACCTGCT GCTGTTCGTC TCCCTGGCGG GCTGCGTGCT CCCCCGGGCG
CTGGCGCACG CCCGCGCGGT GCGGGCCCGC CCGGTGCACA CGCCCAAGAA CCTGGGCCGG
ATGCCCTACG CCGCCGCGTT CACCACCGAC GCCGACCCCG AGACGGTGCT GGAGCAGTCC
CGCAGGGTCC TGCGGGGCTA CCGGACCGAG CGCTACGGCG ACTCCCTCTC CACCGAGACC
GGCTACCTGC GCGAGGCCGG GAACGTCCTG TTCCACCTCG CCCTGCTGGG CCTGCTGCTC
GCCCTGGCCG CCGGGTCCTT CCTCGGCTAC CGCGGCAACA TGCTCCTGGT GGAGGGCGAC
GGGTTCGCCA ACACCCTGAC CTCCTACGAC GCGATCTACC CCGGGCACTG GACCGACACC
GACTCCCTGG AACCCTTCTC CATCCACCTG GACGACTTCG AGGCCTCCTT CATCGAGGAC
GGCGGCCTGC GCGGCCAGGC CGAGTCCTAC GTCGCCGACC TCACCTACAG GGAGGCGCCC
GACGCGCCGC AGGAGCGGCA CCGGCTGGAG GTCAACCACC CGTTGAGCGT GGCGGGCGTG
CAGGTCTACC TGCTCGGCCA CGGGTACGCG CCGGAGTTCG AGGTGCGCAA CGCCGAGGGC
GACCTGGTCT TCGACCAGGC GGTCCCCTTC CTGCACCGGG ACACGGCCTC CTACACCTCC
GACGGCGTGG TCAAGGTGCC CGACACCGGA GGCGAGCAGC TCGGCTTCGT CGGCGTGTTC
CTGCCCAGCG CGGTGGAGAC GCCCGAGGGG GAGATGGTCT CCAACTTTCC CGGGGCCCAG
AACCCCCGGA TCACCCTGGA GGGCTACCGG GGAGACCTGG GGCTGATCGA CCCGCAGTCG
GTCTACCAGC TGCGCACCGG CGGCATGGAG GAGCTGGGCA GCTCCCCCGT GATGGAGATC
GGGGACACCT GGGAGCTGCC CGAGGGCGCC GGGTCGATCA CCTTCTCCGG ATACAAGGAG
TACGTCAGCC TCCAGATGAA CCGCGACGGC GCCCGGCTGC CCGCGCTGAC CGCGGCCTCC
CTGGCCGTCG CCGGGCTGAT CGTCACCCTG TTCGTCCGCC CGCGCCGCGT GTGGGTGCGC
GCGACCAGGG GCGGGGACGG CCGCACCCAC GTGGAGCTGG CCGCCCTGGG CAAGACCGAG
GCGGCCGGGA ACAACGTGGA GTTCCACGAA CTCACCACGG AACTCGCCGG ACGGCTCCGG
AGCCGGTCCG ACGGGGAATC CGGCTCCGGA CCGGACAGCC GTCCGACTGC CGGCCCGGAC
ACCGACCCGG ACCCCGCTCC GGGCTCCGAC CCCCATCCGC CATCCGACCC GCCAGAGTCG
ACCACACAAG GGAGTGACCG GTGA
 
Protein sequence
MSTTANDQGA GTAAQNGEGA RPRPKGLGAV GWLRWIWRTL TSMRTALILL FLLAVGAIPG 
SILPQNVVSV DEVNTYFQEN PELAPWLDRF YLFDVFSSPW YAAIYLLLFV SLAGCVLPRA
LAHARAVRAR PVHTPKNLGR MPYAAAFTTD ADPETVLEQS RRVLRGYRTE RYGDSLSTET
GYLREAGNVL FHLALLGLLL ALAAGSFLGY RGNMLLVEGD GFANTLTSYD AIYPGHWTDT
DSLEPFSIHL DDFEASFIED GGLRGQAESY VADLTYREAP DAPQERHRLE VNHPLSVAGV
QVYLLGHGYA PEFEVRNAEG DLVFDQAVPF LHRDTASYTS DGVVKVPDTG GEQLGFVGVF
LPSAVETPEG EMVSNFPGAQ NPRITLEGYR GDLGLIDPQS VYQLRTGGME ELGSSPVMEI
GDTWELPEGA GSITFSGYKE YVSLQMNRDG ARLPALTAAS LAVAGLIVTL FVRPRRVWVR
ATRGGDGRTH VELAALGKTE AAGNNVEFHE LTTELAGRLR SRSDGESGSG PDSRPTAGPD
TDPDPAPGSD PHPPSDPPES TTQGSDR