Gene Ndas_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0233 
Symbol 
ID9244067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp289976 
End bp291178 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content78% 
IMG OID 
ProductDNA protecting protein DprA 
Protein accessionYP_003678189 
Protein GI297559215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACGG ACACGACGGA GGAGAGCGGG GGGACGGTGC GGGACGCGGA GGACGAGGCG 
GACGCGCGGG CGCGGGCCTG CCTGACAGCG GTGGCCCCGC CGGGGGACCT GTGGCTGGGG
GCGATGCTCG CCGAGCACGG CGCGGAACGG GTGTGGGCGC TGCTGGCGGC GGGGGCGCAC
CCGCCGCCGG TGCCTGTCGA AGCCGGGGAG GACGGCCCGG GACCGGAGGC GCAGACCCTT
CTGGAACGCA GGTGGGCGCG GTGGGGCGCC GCGGCGCGGG CGGTGGACCC CGACGGGCTG
CTCGGCGACT CGGCGGCGGC CGGCATCCGC TTCGTGGCCC CGCGGGACCC CGAGTGGCCG
GGCCGCCTCG ACGAACTGGA CCTGCCCGGA GGGCGGCGCT CGCACGGACT GTGGGTACGC
GGCGCGGGGG ACCTGCGTCA CCTGTGCCTG CGCTCGGTGG CCGTGGTGGG CGCGCGCTCG
GCCACGCCCT ACGGGGAGCA CGTGGCGGCG GAGATGGCCT ACGAGCTGGC CGAGCGCGCG
GTCGTGGTGG TCTCCGGCGG CGCCTACGGG ATCGACGGGG CCGCGCACCG GGCGGCCCAG
GCCCACGGCG GCACGGTCGT GGTGCTGGCC TGCGGGCTGG ACGTGGACTA CCCGCGCGGG
CACGCGGGCC TGTTCGCCGA CGTCGCCCGC ACCGGGGTGC TGGTGAGCGA GCGGCCGGTG
GGCGCCACCC CGCGCGCACC GGACTTCCTC GTACGCAACC GGCTGATCGC CGCGCTCACC
CCGGGCACGG TGGTGGTGGA GGCCGGACGG CGTAGCGGTG CCCTCAACAC CGCCTCGCAC
GCGGCCGAGC TCAACCGGGC GCTGATGGCG GTCCCCGGCC CGGTCACCTC GGCGATGTCG
GTGGGCTGCC ACCTGCTGCT CCGAGACTGG AACGCGAGCT GCGTCACCTG CGCGGACGAC
GTCGTCGCCC AGGTGAGCGC GCTGGGTGAG CTGCCGCCGG AGTCCGGGCC GCTGCGGGTG
TCGGCCGAGC TCGACCAGGA CAGCGCCCGC GTCCTGGCGG CGGTGCCCAG GTCCGGCGCC
GGGCCCGCGG TGATCGCCGT GGCCAGCGGG ACCCGTCTGG AGAGGACCCT GCGCTCCCTG
GGGATGCTGG CCGCGGCCGG ACTGGTGGAG CGCTGTCCGT CGGGCTGGCG GCTGCCGCAG
TGA
 
Protein sequence
MNTDTTEESG GTVRDAEDEA DARARACLTA VAPPGDLWLG AMLAEHGAER VWALLAAGAH 
PPPVPVEAGE DGPGPEAQTL LERRWARWGA AARAVDPDGL LGDSAAAGIR FVAPRDPEWP
GRLDELDLPG GRRSHGLWVR GAGDLRHLCL RSVAVVGARS ATPYGEHVAA EMAYELAERA
VVVVSGGAYG IDGAAHRAAQ AHGGTVVVLA CGLDVDYPRG HAGLFADVAR TGVLVSERPV
GATPRAPDFL VRNRLIAALT PGTVVVEAGR RSGALNTASH AAELNRALMA VPGPVTSAMS
VGCHLLLRDW NASCVTCADD VVAQVSALGE LPPESGPLRV SAELDQDSAR VLAAVPRSGA
GPAVIAVASG TRLERTLRSL GMLAAAGLVE RCPSGWRLPQ